ONNXConfig for all

non-profit

Activity Feed Request to join this org

AI & ML interests

Make all hub models available for conversion to ONNX format.

Recent Activity

wannaphong authored a paper about 1 month ago

Mangosteen: An Open Thai Corpus for Language Model Pretraining

MaziyarPanahi authored a paper about 1 month ago

OpenMed NER: Open-Source, Domain-Adapted State-of-the-Art Transformers for Biomedical NER Across 12 Public Datasets

lewtun authored a paper 4 months ago

Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning

View all activity

MaziyarPanahi

authored a paper about 1 month ago

INTELLECT-1 Technical Report

Paper • 2412.01152 • Published Dec 2, 2024 • 3

wannaphong

authored a paper about 1 month ago

Mangosteen: An Open Thai Corpus for Language Model Pretraining

Paper • 2507.14664 • Published Jul 19 • 6

MaziyarPanahi

authored a paper about 1 month ago

OpenMed NER: Open-Source, Domain-Adapted State-of-the-Art Transformers for Biomedical NER Across 12 Public Datasets

Paper • 2508.01630 • Published Aug 3 • 9

AtAndDev

posted an update about 2 months ago

Post

465

Qwen 3 Coder is a personal attack to k2, and I love it.
It achieves near SOTA on LCB while not having reasoning.
Finally people are understanding that reasoning isnt necessary for high benches...

Qwen ftw!

DECENTRALIZE DECENTRALIZE DECENTRALIZE

MaziyarPanahi

posted an update about 2 months ago

Post

8805

🧬 Breaking news in Clinical AI: Introducing the OpenMed NER Model Discovery App on Hugging Face 🔬

OpenMed is back! 🔥 Finding the right biomedical NER model just became as precise as a PCR assay!

I'm thrilled to unveil my comprehensive OpenMed Named Entity Recognition Model Discovery App that puts 384 specialized biomedical AI models at your fingertips.

🎯 Why This Matters in Healthcare AI:
Traditional clinical text mining required hours of manual model evaluation. My Discovery App instantly connects researchers, clinicians, and data scientists with the exact NER models they need for their biomedical entity extraction tasks.

🔬 What You Can Discover:
✅ Pharmacological Models - Extract "chemical compounds", "drug interactions", and "pharmaceutical" entities from clinical notes
✅ Genomics & Proteomics - Identify "DNA sequences", "RNA transcripts", "gene variants", "protein complexes", and "cell lines"
✅ Pathology & Disease Detection - Recognize "pathological formations", "cancer types", and "disease entities" in medical literature
✅ Anatomical Recognition - Map "anatomical systems", "tissue types", "organ structures", and "cellular components"
✅ Clinical Entity Extraction - Detect "organism species", "amino acids", 'protein families", and "multi-tissue structures"

💡 Advanced Features:
🔍 Intelligent Entity Search - Find models by specific biomedical entities (e.g., "Show me models detecting CHEM + DNA + Protein")
🏥 Domain-Specific Filtering - Browse by Oncology, Pharmacology, Genomics, Pathology, Hematology, and more
📊 Model Architecture Insights - Compare BERT, RoBERTa, and DeBERTa implementations
⚡ Real-Time Search - Auto-filtering as you type, no search buttons needed
🎨 Clinical-Grade UI - Beautiful, intuitive interface designed for medical professionals

Ready to revolutionize your biomedical NLP pipeline?

🔗 Try it now: OpenMed/openmed-ner-models
🧬 Built with: Gradio, Transformers, Advanced Entity Mapping

5 replies

gagan3012

authored a paper 3 months ago

Leveraging Vision-Language Pre-training for Human Activity Recognition in Still Images

Paper • 2506.13458 • Published Jun 16

AtAndDev

posted an update 3 months ago

Post

3002

deepseek-ai/DeepSeek-R1-0528

This is the end

1 reply

hazemessam

authored 2 papers 3 months ago

Ankh3: Multi-Task Pretraining with Sequence Denoising and Completion Enhances Protein Representations

Paper • 2505.20052 • Published May 26

Beyond Simple Concatenation: Fairly Assessing PLM Architectures for Multi-Chain Protein-Protein Interactions Prediction

Paper • 2505.20036 • Published May 26

gagan3012

authored a paper 4 months ago

Date Fragments: A Hidden Bottleneck of Tokenization for Temporal Reasoning

Paper • 2505.16088 • Published May 22 • 3

lewtun

authored a paper 4 months ago

Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning

Paper • 2504.11354 • Published Apr 15 • 6

shivalikasingh

authored a paper 4 months ago

The Leaderboard Illusion

Paper • 2504.20879 • Published Apr 29 • 70

lewtun

authored a paper 5 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 199

AtAndDev

posted an update 5 months ago

Post

3134

Llama 4 is out...

3 replies

AtAndDev

posted an update 6 months ago

Post

4361

There seems to multiple paid apps shared here that are based on models on hf, but some ppl sell their wrappers as "products" and promote them here. For a long time, hf was the best and only platform to do oss model stuff but with the recent AI website builders anyone can create a product (really crappy ones btw) and try to sell it with no contribution to oss stuff. Please dont do this, or try finetuning the models you use...
Sorry for filling yall feed with this bs but yk...

6 replies

AtAndDev

posted an update 6 months ago

Post

1665

Gemma 3 seems to be really good at human preference. Just waiting for ppl to see it.

lewtun

authored a paper 6 months ago

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published Mar 10 • 47

lewtun

posted an update 6 months ago

Post

3407

Introducing OlympicCoder: a series of open reasoning models that can solve olympiad-level programming problems 🧑‍💻

- 7B open-r1/OlympicCoder-7B
- 32B open-r1/OlympicCoder-32B

We find that OlympicCoder models outperform Claude 3.7 Sonnet, as well as others over 100x larger 💪

Together with the models, we are releasing:

📊CodeForces-CoTs: new dataset of code problems from the most popular competitive coding platform, with R1 traces in C++ and Python open-r1/codeforces-cots

🏆 IOI'2024: a new benchmark of VERY hard programming problems where even frontier models struggle to match human performance open-r1/ioi

For links to the models and datasets, check out our latest progress report from Open R1: https://huggingface.co/blog/open-r1/update-3

1 reply

wannaphong

authored a paper 7 months ago

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Paper • 2502.12982 • Published Feb 18 • 18

AtAndDev

posted an update 7 months ago

Post

2497

@nroggendorff is that you sama?

2 replies

AI & ML interests

Recent Activity

Team members 163

OWG's activity