Niklas Muennighoff's picture

Niklas Muennighoff

Muennighoff

·

https://muennighoff.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

ATLAS: Adaptive Transfer Scaling Laws for Multilingual Pretraining, Finetuning, and Decoding the Curse of Multilinguality

updated a model 8 days ago

ryzax/1.5B-v96

updated a model 8 days ago

ryzax/1.5B-v103

View all activity

Organizations

upvoted a paper 1 day ago

ATLAS: Adaptive Transfer Scaling Laws for Multilingual Pretraining, Finetuning, and Decoding the Curse of Multilinguality

Paper • 2510.22037 • Published 9 days ago • 16

upvoted an article 13 days ago

Article

Introducing MTEB v2: Evaluation of embedding and retrieval systems for more than just text

By

and 2 others •

13 days ago

• 33

upvoted a paper 19 days ago

HUME: Measuring the Human-Model Performance Gap in Text Embedding Task

Paper • 2510.10062 • Published 22 days ago • 8

upvoted an article about 1 month ago

Article

Introducing RTEB: A New Standard for Retrieval Evaluation

Oct 1

• 120

upvoted a paper about 1 month ago

Humanline: Online Alignment as Perceptual Loss

Paper • 2509.24207 • Published Sep 29 • 11

upvoted a paper 2 months ago

UQ: Assessing Language Models on Unsolved Questions

Paper • 2508.17580 • Published Aug 25 • 15

upvoted a paper 4 months ago

FlexOlmo: Open Language Models for Flexible Data Use

Paper • 2507.07024 • Published Jul 9 • 7

upvoted 2 papers 5 months ago

Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability

Paper • 2506.01789 • Published Jun 2 • 14

OpenThoughts: Data Recipes for Reasoning Models

Paper • 2506.04178 • Published Jun 4 • 48

upvoted 2 papers 6 months ago

Crosslingual Reasoning through Test-Time Scaling

Paper • 2505.05408 • Published May 8 • 8

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published Apr 29 • 53

upvoted a paper 7 months ago

MIEB: Massive Image Embedding Benchmark

Paper • 2504.10471 • Published Apr 14 • 20

upvoted a paper 8 months ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19 • 41

upvoted a paper 9 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 124

upvoted 2 collections 12 months ago

📈 Scaling Laws with Vocabulary

Increase your vocabulary size when you scale up your language model • 5 items • Updated Aug 11, 2024 • 6

🧬 RegMix: Data Mixture as Regression

Automatic data mixture method for large language model pre-training • 10 items • Updated Jul 26, 2024 • 8

upvoted a collection about 1 year ago

BGE

31 items • Updated Sep 23 • 137

upvoted a paper about 1 year ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 121

upvoted a collection about 1 year ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Apr 30 • 308

upvoted a paper about 1 year ago

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3, 2024 • 78