LangChainDatasets

community

https://langchain.readthedocs.io/en/latest/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

ImranzamanML authored a paper 6 days ago

A Robust Deep Networks based Multi-Object MultiCamera Tracking System for City Scale Traffic

1024m authored a paper 23 days ago

Robust and Fine-Grained Detection of AI Generated Texts

1024m authored a paper 25 days ago

Improving Multilingual Capabilities with Cultural and Local Knowledge in Large Language Models While Enhancing Native Performance

View all activity

LangChainDatasets's activity

1024m

authored a paper 23 days ago

Robust and Fine-Grained Detection of AI Generated Texts

Paper • 2504.11952 • Published 24 days ago • 11

1024m

authored a paper 25 days ago

Improving Multilingual Capabilities with Cultural and Local Knowledge in Large Language Models While Enhancing Native Performance

Paper • 2504.09753 • Published 26 days ago • 4

1024m

authored a paper 30 days ago

Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation

Paper • 2504.07072 • Published about 1 month ago • 8

Tonic

posted an update 2 months ago

Post

1445

🙋🏻‍♂️Hey there folks,

Did you know that you can use ModernBERT to detect model hallucinations ?

Check out the Demo : Tonic/hallucination-test

See here for Medical Context Demo : MultiTransformer/tonic-discharge-guard

check out the model from KRLabs : KRLabsOrg/lettucedect-large-modernbert-en-v1

and the library they kindly open sourced for it : https://github.com/KRLabsOrg/LettuceDetect

👆🏻if you like this topic please contribute code upstream 🚀

2 replies

Tonic

posted an update 2 months ago

Post

787

Powered by KRLabsOrg/lettucedect-large-modernbert-en-v1 from KRLabsOrg.

Detect hallucinations in answers based on context and questions using ModernBERT with 8192-token context support!

### Model Details
- **Model Name**: [lettucedect-large-modernbert-en-v1]( KRLabsOrg/lettucedect-large-modernbert-en-v1)
- **Organization**: [KRLabsOrg](

KRLabsOrg )
- **Github**: [https://github.com/KRLabsOrg/LettuceDetect](https://github.com/KRLabsOrg/LettuceDetect)
- **Architecture**: ModernBERT (Large) with extended context support up to 8192 tokens
- **Task**: Token Classification / Hallucination Detection
- **Training Dataset**: [RagTruth]( wandb/RAGTruth-processed)
- **Language**: English
- **Capabilities**: Detects hallucinated spans in answers, provides confidence scores, and calculates average confidence across detected spans.

LettuceDetect excels at processing long documents to determine if an answer aligns with the provided context, making it a powerful tool for ensuring factual accuracy.

Tonic

posted an update 3 months ago

Post

2408

🙋🏻‍♂️hey there folks ,

Goedel's Theorem Prover is now being demo'ed on huggingface : Tonic/Math

give it a try !

Tonic

posted an update 3 months ago

Post

2995

🙋🏻‍♂️ Hey there folks ,

our team made a game during the @mistral-game-jam and we're trying to win the community award !

try our game out and drop us a ❤️ like basically to vote for us !

Mistral-AI-Game-Jam/TextToSurvive

hope you like it !

Tonic

posted an update 4 months ago

Post

1919

🙋🏻‍♂️ Hey there folks ,

Facebook AI just released JASCO models that make music stems .

you can try it out here : Tonic/audiocraft

hope you like it

Tonic

posted an update 4 months ago

Post

2479

🙋🏻‍♂️Hey there folks , Open LLM Europe just released Lucie 7B-Instruct model , a billingual instruct model trained on open data ! You can check out my unofficial demo here while we wait for the official inference api from the group : Tonic/Lucie-7B hope you like it 🚀

Tonic

posted an update 4 months ago

Post

1736

microsoft just released Phi-4 , check it out here : Tonic/Phi-4

hope you like it :-)

valeriaWong

authored 2 papers 4 months ago

Xmodel-1.5: An 1B-scale Multilingual LLM

Paper • 2411.10083 • Published Nov 15, 2024 • 14

Xmodel-2 Technical Report

Paper • 2412.19638 • Published Dec 27, 2024 • 27

Tonic

posted an update 6 months ago

Post

3611

🙋🏻‍♂️hey there folks,

periodic reminder : if you are experiencing ⚠️500 errors ⚠️ or ⚠️ abnormal spaces behavior on load or launch ⚠️

we have a thread 👉🏻 https://discord.com/channels/879548962464493619/1295847667515129877

if you can record the problem and share it there , or on the forums in your own post , please dont be shy because i'm not sure but i do think it helps 🤗🤗🤗

2 replies

Tonic

posted an update 7 months ago

Post

1193

boomers still pick zenodo.org instead of huggingface ??? absolutely clownish nonsense , my random datasets have 30x more downloads and views than front page zenodos ... gonna write a comparison blog , but yeah... cringe.

1 reply

Tonic

posted an update 7 months ago

Post

864

🙋🏻‍♂️ hey there folks ,

really enjoying sharing cool genomics and protein datasets on the hub these days , check out our cool new org :

seq-to-pheno

scroll down for the datasets, still figuring out how to optimize for discoverability , i do think on that part it will be better than zenodo[dot}org , it would be nice to write a tutorial about that and compare : we already have more downloads than most zenodo datasets from famous researchers !

1024m

authored 4 papers 7 months ago

RKadiyala at SemEval-2024 Task 8: Black-Box Word-Level Text Boundary Detection in Partially Machine Generated Texts

Paper • 2410.16659 • Published Oct 22, 2024

Large Language Models for Cross-lingual Emotion Detection

Paper • 2410.15974 • Published Oct 21, 2024 • 1

1024m at SMM4H 2024: Tasks 3, 5 & 6 -- Ensembles of Transformers and Large Language Models for Medical Text Classification

Paper • 2410.15998 • Published Oct 21, 2024 • 1

Augmenting Legal Decision Support Systems with LLM-based NLI for Analyzing Social Media Evidence

Paper • 2410.15990 • Published Oct 21, 2024 • 1

Tonic

posted an update 7 months ago

Post

1480

hey there folks,

twitter is aweful isnt it ? just getting into the habbit of using hf/posts for shares 🦙🦙

Tonic/on-device-granite-3.0-1b-a400m-instruct

new granite on device instruct model demo , hope you like it 🚀🚀

AI & ML interests

Recent Activity

Team members 125

LangChainDatasets's activity