AI & ML interests
Non-profit ML community
Recent Activity
View all activity
Post
430
Who's going to Raise Summit in Paris Tomorrow ?
If you're around , I would love to meet you :-)
If you're around , I would love to meet you :-)
Post
1828
Anyone know how to reset Claude web's MCP config? I connected mine when the HF MCP first released with just the default example spaces added. I added lots of other MCP spaces but Claude.ai doesn't update the available tools... "Disconnecting" the HF integration does nothing, deleting it and adding it again does nothing.
Refreshing tools works fine in VS Code because I can manually restart it in
Refreshing tools works fine in VS Code because I can manually restart it in
mcp.json
, but claude.ai has no such option. Anyone got any ideas?
kalelparkย
authored
a
paper
24 days ago
4n3moneย
authored
a
paper
30 days ago
kalelparkย
authored
a
paper
about 1 month ago
Post
675
๐๐ปโโ๏ธ hey there folks ,
So every bio/med/chem meeting i go to i always the same questions "why are you sharing a gdrive link with me for this?" and "Do you have any plans to publish your model weights and datasets on huggingface?" and finally i got a good answer today which explains everything :
basically there is some kind of government censorship on this (usa, but i'm sure others too) and they are told they are not allowed as it is considered a "dataleak" which is illegal !!!!
this is terrible ! but the good news is that we can do something about it !
so there is this "call for opinions and comments" here from the NIH (usa) , and here we can make our opinion on this topic known : https://osp.od.nih.gov/comment-form-responsibly-developing-and-sharing-generative-artificial-intelligence-tools-using-nih-controlled-access-data/
kindly consider dropping your opinion and thoughts about this censorship of science , and share this post , link or thoughts widely .
Together maybe we can start to share data and model weights appropriately and openly in a good way ๐๐ป๐
cc. @cyrilzakka
So every bio/med/chem meeting i go to i always the same questions "why are you sharing a gdrive link with me for this?" and "Do you have any plans to publish your model weights and datasets on huggingface?" and finally i got a good answer today which explains everything :
basically there is some kind of government censorship on this (usa, but i'm sure others too) and they are told they are not allowed as it is considered a "dataleak" which is illegal !!!!
this is terrible ! but the good news is that we can do something about it !
so there is this "call for opinions and comments" here from the NIH (usa) , and here we can make our opinion on this topic known : https://osp.od.nih.gov/comment-form-responsibly-developing-and-sharing-generative-artificial-intelligence-tools-using-nih-controlled-access-data/
kindly consider dropping your opinion and thoughts about this censorship of science , and share this post , link or thoughts widely .
Together maybe we can start to share data and model weights appropriately and openly in a good way ๐๐ป๐
cc. @cyrilzakka
kinam0252ย
authored
a
paper
about 1 month ago
Post
2525
๐๐ปโโ๏ธ Hey there folks ,
Yesterday the world's first "Learn to Vibe Code" application was released .
As vibe coding is the mainstream paradigm , so now the first educational app is there to support it .
You can try it out already :
https://vibe.takara.ai
and of course it's entirely open source, so i already made my issue and feature branch :-) ๐
Yesterday the world's first "Learn to Vibe Code" application was released .
As vibe coding is the mainstream paradigm , so now the first educational app is there to support it .
You can try it out already :
https://vibe.takara.ai
and of course it's entirely open source, so i already made my issue and feature branch :-) ๐
Post
2736
PSA for anyone using
Both of these themes have been updated to fix some of the long-standing inconsistencies ever since the transition to Gradio v5. Textboxes are no longer bright green and
If your space is already using one of these themes, you just need to restart your space to get the latest version. No code changes needed.
Nymbo/Nymbo_Theme
or Nymbo/Nymbo_Theme_5
in a Gradio space ~Both of these themes have been updated to fix some of the long-standing inconsistencies ever since the transition to Gradio v5. Textboxes are no longer bright green and
in-line code
is readable now! Both themes are now visually identical across versions.If your space is already using one of these themes, you just need to restart your space to get the latest version. No code changes needed.

juyoungmlย
authored
3
papers
3 months ago
MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models
Paper
โข
2410.17578
โข
Published
โข
1
LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation
Paper
โข
2412.10424
โข
Published
โข
2
Trillion 7B Technical Report
Paper
โข
2504.15431
โข
Published
โข
37

emreย
authored
a
paper
4 months ago
Post
3552
having trouble with auto train
hello there this is the first time i am testing auto train with a 1.8k SFT dataset. Howevery i am not quite sure the training is going smooth. Logs seem quite confusing, token did not match can not auth, generates confusing train splits, do you know how i can check my running job properly?
what is being used for training as data?
any ideas?
hello there this is the first time i am testing auto train with a 1.8k SFT dataset. Howevery i am not quite sure the training is going smooth. Logs seem quite confusing, token did not match can not auth, generates confusing train splits, do you know how i can check my running job properly?
what is being used for training as data?
any ideas?
Post
1609
๐๐ปโโ๏ธHey there folks,
Did you know that you can use ModernBERT to detect model hallucinations ?
Check out the Demo : Tonic/hallucination-test
See here for Medical Context Demo : MultiTransformer/tonic-discharge-guard
check out the model from KRLabs : KRLabsOrg/lettucedect-large-modernbert-en-v1
and the library they kindly open sourced for it : https://github.com/KRLabsOrg/LettuceDetect
๐๐ปif you like this topic please contribute code upstream ๐
Did you know that you can use ModernBERT to detect model hallucinations ?
Check out the Demo : Tonic/hallucination-test
See here for Medical Context Demo : MultiTransformer/tonic-discharge-guard
check out the model from KRLabs : KRLabsOrg/lettucedect-large-modernbert-en-v1
and the library they kindly open sourced for it : https://github.com/KRLabsOrg/LettuceDetect
๐๐ปif you like this topic please contribute code upstream ๐
Post
856
Powered by
KRLabsOrg/lettucedect-large-modernbert-en-v1 from KRLabsOrg.
Detect hallucinations in answers based on context and questions using ModernBERT with 8192-token context support!
### Model Details
- **Model Name**: [lettucedect-large-modernbert-en-v1]( KRLabsOrg/lettucedect-large-modernbert-en-v1)
- **Organization**: [KRLabsOrg](
KRLabsOrg
)
- **Github**: [https://github.com/KRLabsOrg/LettuceDetect](https://github.com/KRLabsOrg/LettuceDetect)
- **Architecture**: ModernBERT (Large) with extended context support up to 8192 tokens
- **Task**: Token Classification / Hallucination Detection
- **Training Dataset**: [RagTruth]( wandb/RAGTruth-processed)
- **Language**: English
- **Capabilities**: Detects hallucinated spans in answers, provides confidence scores, and calculates average confidence across detected spans.
LettuceDetect excels at processing long documents to determine if an answer aligns with the provided context, making it a powerful tool for ensuring factual accuracy.
Detect hallucinations in answers based on context and questions using ModernBERT with 8192-token context support!
### Model Details
- **Model Name**: [lettucedect-large-modernbert-en-v1]( KRLabsOrg/lettucedect-large-modernbert-en-v1)
- **Organization**: [KRLabsOrg](

- **Github**: [https://github.com/KRLabsOrg/LettuceDetect](https://github.com/KRLabsOrg/LettuceDetect)
- **Architecture**: ModernBERT (Large) with extended context support up to 8192 tokens
- **Task**: Token Classification / Hallucination Detection
- **Training Dataset**: [RagTruth]( wandb/RAGTruth-processed)
- **Language**: English
- **Capabilities**: Detects hallucinated spans in answers, provides confidence scores, and calculates average confidence across detected spans.
LettuceDetect excels at processing long documents to determine if an answer aligns with the provided context, making it a powerful tool for ensuring factual accuracy.
Post
2440
๐๐ปโโ๏ธhey there folks ,
Goedel's Theorem Prover is now being demo'ed on huggingface : Tonic/Math
give it a try !
Goedel's Theorem Prover is now being demo'ed on huggingface : Tonic/Math
give it a try !
Post
3016
๐๐ปโโ๏ธ Hey there folks ,
our team made a game during the @mistral-game-jam and we're trying to win the community award !
try our game out and drop us a โค๏ธ like basically to vote for us !
Mistral-AI-Game-Jam/TextToSurvive
hope you like it !
our team made a game during the @mistral-game-jam and we're trying to win the community award !
try our game out and drop us a โค๏ธ like basically to vote for us !
Mistral-AI-Game-Jam/TextToSurvive
hope you like it !