112 75 78

Hugo Laurençon

HugoLaurencon

HugoLaurencon

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

Scaling Laws for Optimal Data Mixtures

View all activity

Organizations

upvoted a paper 11 days ago

Scaling Laws for Optimal Data Mixtures

Paper • 2507.09404 • Published 15 days ago • 32

upvoted a paper about 1 month ago

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published Jun 17 • 39

upvoted 2 papers about 2 months ago

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30 • 66

Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs

Paper • 2505.19075 • Published May 25 • 21

upvoted a paper 2 months ago

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14 • 95

upvoted a paper 3 months ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 132

upvoted 2 papers 4 months ago

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

Paper • 2504.07951 • Published Apr 10 • 29

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published Apr 9 • 75

upvoted a collection 4 months ago

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29 • 587

upvoted a paper 4 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 135

upvoted an article 5 months ago

Article

Open-source DeepResearch – Freeing our search agents

and 4 others •

Feb 4

• 1.28k

upvoted 3 papers 5 months ago

Nougat: Neural Optical Understanding for Academic Documents

Paper • 2308.13418 • Published Aug 25, 2023 • 39

AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

Paper • 2503.07608 • Published Mar 10 • 23

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20 • 193

upvoted an article 6 months ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

and 3 others •

Feb 4

• 167

upvoted 3 papers 6 months ago

upvoted 2 papers 7 months ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1 • 107

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 372