Mistral AI Game Jam

Enterprise
community
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Mistral-AI-Game-Jam's activity

MikeDoes 
posted an update about 20 hours ago
view post
Post
1198
🚀 We are quite excited to announce the Ai4Privacy Python library! 🎉

pip install ai4privacy to anonymize short english text with OpenPII Masking 500k labels

📊 Day 5/7 of PII Masking 1M announcements complete! ⏰
MikeDoes 
posted an update 2 days ago
MikeDoes 
posted an update 5 days ago
view post
Post
1667
📊 99%+ PII Masking Precision in English Straight to Your Browser! 🚀

ai4privacy/general-english-anonymiser-openpii-500k

Hard Facts:
🖥️ Runs in-browser—blazing fast, no server latency
👐 Open-source, MIT-licensed (even for commercial use)
📈 Full metrics on Hugging Face dataset and model pages

Day 3 out 7 of PII-Masking-1M Announcements Complete!
*Accuracies reported from the new OpenPII-500k dataset

#DataPrivacy #AI #OpenSource
MikeDoes 
posted an update 7 days ago
view post
Post
2064
#PII Masking Tech that does not **** around!

We are happy to release the OpenPII English Anonymiser —the most powerful open-source tool for redacting sensitive info from English text.

Fine-tuned Modernbert on 5.7 million+ PII examples, it’s clocking 99%+ accuracy across emails, dates, social numbers, and more!

Why it’s a big deal:
✅ Top-tier precision: 100% for passport numbers, 99.96% for emails*.
✅ Totally free: MIT license for personal or commercial use.
✅ No secrets: Full metrics shared on Hugging Face.

#AI #OpenSource #DataSecurity @huggingface

Day 2 out 7 of PII-Masking-1M Announcements Complete!

*Accuracies reported from the new OpenPII-500k dataset

ai4privacy/llama-ai4privacy-english-anonymiser-openpii
MikeDoes 
posted an update 9 days ago
view post
Post
2659
🚀 Ai4Privacy Team is excited to unveil PII-Masking-1M, our most significant release yet! 🎉

This publication series 📦 includes datasets 📊, models 🤖, and applications ⚙️ to advance PII masking with AI systems 🛡️

Starting on Monday with daily posts at 7 PM CET ⏰
Tonic 
posted an update 19 days ago
view post
Post
1162
🙋🏻‍♂️Hey there folks,

Did you know that you can use ModernBERT to detect model hallucinations ?

Check out the Demo : Tonic/hallucination-test

See here for Medical Context Demo : MultiTransformer/tonic-discharge-guard

check out the model from KRLabs : KRLabsOrg/lettucedect-large-modernbert-en-v1

and the library they kindly open sourced for it : https://github.com/KRLabsOrg/LettuceDetect

👆🏻if you like this topic please contribute code upstream 🚀

  • 2 replies
·
Tonic 
posted an update 20 days ago
view post
Post
701
Powered by KRLabsOrg/lettucedect-large-modernbert-en-v1 from KRLabsOrg.

Detect hallucinations in answers based on context and questions using ModernBERT with 8192-token context support!

### Model Details
- **Model Name**: [lettucedect-large-modernbert-en-v1]( KRLabsOrg/lettucedect-large-modernbert-en-v1)
- **Organization**: [KRLabsOrg](https://huggingface.co/KRLabsOrg)
- **Github**: [https://github.com/KRLabsOrg/LettuceDetect](https://github.com/KRLabsOrg/LettuceDetect)
- **Architecture**: ModernBERT (Large) with extended context support up to 8192 tokens
- **Task**: Token Classification / Hallucination Detection
- **Training Dataset**: [RagTruth]( wandb/RAGTruth-processed)
- **Language**: English
- **Capabilities**: Detects hallucinated spans in answers, provides confidence scores, and calculates average confidence across detected spans.

LettuceDetect excels at processing long documents to determine if an answer aligns with the provided context, making it a powerful tool for ensuring factual accuracy.
ngxson 
posted an update 27 days ago
view post
Post
3170
A comprehensive matrix for which format should you use.

Read more on my blog post: https://huggingface.co/blog/ngxson/common-ai-model-formats

| Hardware        | GGUF      | PyTorch                | Safetensors              | ONNX  |
|-----------------|-----------|------------------------|--------------------------|-------|
| CPU             | ✅ (best) | 🟡                      | 🟡                       ||
| GPU             |||||
| Mobile          || 🟡 (via executorch)     |||
| Apple silicon   || 🟡                      | ✅ (via MLX framework)   ||
  • 1 reply
·
Tonic 
posted an update about 2 months ago
view post
Post
2373
🙋🏻‍♂️hey there folks ,

Goedel's Theorem Prover is now being demo'ed on huggingface : Tonic/Math

give it a try !
Tonic 
posted an update about 2 months ago
view post
Post
2970
🙋🏻‍♂️ Hey there folks ,

our team made a game during the @mistral-game-jam and we're trying to win the community award !

try our game out and drop us a ❤️ like basically to vote for us !

Mistral-AI-Game-Jam/TextToSurvive

hope you like it !
ngxson 
posted an update 2 months ago
Tonic 
posted an update 2 months ago
view post
Post
1898
🙋🏻‍♂️ Hey there folks ,

Facebook AI just released JASCO models that make music stems .

you can try it out here : Tonic/audiocraft

hope you like it
ngxson 
posted an update 2 months ago
view post
Post
3365
Check out my collection of pre-made GGUF LoRA adapters!

This allow you to use both normal + abliterated version of popular models like llama, qwen, etc, without having to double to amount of VRAM usage.

ngxson/gguf_lora_collection
·
Tonic 
posted an update 2 months ago
view post
Post
2464
🙋🏻‍♂️Hey there folks , Open LLM Europe just released Lucie 7B-Instruct model , a billingual instruct model trained on open data ! You can check out my unofficial demo here while we wait for the official inference api from the group : Tonic/Lucie-7B hope you like it 🚀
ngxson 
posted an update 3 months ago
Tonic 
posted an update 3 months ago
view post
Post
1723
microsoft just released Phi-4 , check it out here : Tonic/Phi-4

hope you like it :-)
fracapuano 
posted an update 4 months ago
view post
Post
1071
Sharing what we have built over the course of the weekend at the @llamameta hackathon, by Cerebral Valley in London 🇬🇧 👇

@gabrycina @calebgcc and I competed with 200+ participants and 50+ teams for a 24-hrs sprint centered around hacking for impact! We focused on applications of robotics to those in need of assisted living, moving our focus to enable greater autonomy and accessibility of robotics in everyday life.

complete list of assets 👇
🤗 trained robotics policies
v1:
- fracapuano/moss-pills
- fracapuano/moss-cup
v2:
- fracapuano/meta-grasp

🤗 datasets
v1:
- fracapuano/pills
- fracapuano/cup
v2:
- fracapuano/cupim


You can find a live demo of our submission at: https://x.com/_fracapuano/status/1858102728691458554

If you want to know more about how we collected 100GB+ of data, trained multiple RL-policies using @lerobot and used Llama-3.2 models to handle user interactions and switch between tasks, go ahead and have a look! Also, don't be a stranger, and reach out 🦾

Our project is fully open-source, for the community (and ourselves, 👨‍🍳) to build! A huge thank you to @cadene for the help (and the robot 🤭) - truly feeling these hugs-vibes 🤗 , and to @thomwolf and @clem for sharing our work across

Little extra:
➡️ Our 🧠EEG waves🧠-based control of the 🦾robotic arm🦾
fracapuano 
posted an update 4 months ago
view post
Post
605
✍️ the last few weeks has been very intense!
🔴 I have been out all weekends
🔴 Participated in 4 hackathons in a row (2 more to come!)
🔴 Even threw a big hackathon myself!

Nonetheless, I am in school again 🏫, which meant... ✨homework✨

➡️ Head out to here https://x.com/_fracapuano/status/1856415612202799243 to read more about how I used @mistralai models to help me with my assignments (not how you think I did hihi 😏)

➡️ Check outhttps://huggingface.co/spaces/fracapuano/texstral if you want to use the tool yourself!