plaguss (Agustín Piqueres Lajarín)

upvoted 2 articles about 1 year ago

Article

Open R1: Update #3

Mar 11, 2025

•

297

Article

Open R1: Update #2

Feb 10, 2025

•

218

upvoted a paper about 1 year ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4, 2025 • 258

upvoted 2 articles about 1 year ago

Article

FuseO1-Preview: System-II Reasoning Fusion of LLMs

Jan 20, 2025

•

22

Article

Open-R1: Update #1

Feb 2, 2025

•

305

upvoted an article over 1 year ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

+1

Jan 28, 2025

•

887

upvoted a paper over 1 year ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13, 2025 • 100

upvoted an article over 1 year ago

Article

Python Is All You Need? Introducing Dria-Agent-α

Jan 10, 2025

•

27

upvoted a collection over 1 year ago

Scaling Test-Time Compute with Open Models

Collection

Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated Jan 6, 2025 • 30

upvoted an article over 1 year ago

Article

Process Reinforcement through Implicit Rewards

Jan 3, 2025

•

31

upvoted 3 papers over 1 year ago

upvoted a collection over 1 year ago

SmolVLM

Collection

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct. Check our blog: https://huggingface.co/blog/smolvlm • 5 items • Updated May 5, 2025 • 42

upvoted 2 articles over 1 year ago

Article

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

Nov 21, 2024

•

35

Article

Halo: Open Source Health Tracking with Wearables

Nov 19, 2024

•

117

upvoted a paper over 1 year ago

Aligning Large Language Models via Self-Steering Optimization

Paper • 2410.17131 • Published Oct 22, 2024 • 24

upvoted 3 articles over 1 year ago

Article

Releasing the largest multilingual open pretraining dataset

Nov 13, 2024

•

106

Article

Releasing Outlines-core 0.1.0: structured generation in Rust and Python

+5

Oct 22, 2024

•

44

Article

How to build a custom text classifier without days of human labeling

Oct 17, 2024

•

57

Agustín Piqueres Lajarín

AI & ML interests

Organizations

Open R1: Update #3

Open R1: Update #2

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

FuseO1-Preview: System-II Reasoning Fusion of LLMs

Open-R1: Update #1

Open-R1: a fully open reproduction of DeepSeek-R1

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Python Is All You Need? Introducing Dria-Agent-α

Scaling Test-Time Compute with Open Models

Process Reinforcement through Implicit Rewards

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Free Process Rewards without Process Labels

Solving math word problems with process- and outcome-based feedback

SmolVLM

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

Halo: Open Source Health Tracking with Wearables

Aligning Large Language Models via Self-Steering Optimization

Releasing the largest multilingual open pretraining dataset

Releasing Outlines-core 0.1.0: structured generation in Rust and Python

How to build a custom text classifier without days of human labeling

Agustín Piqueres Lajarín

AI & ML interests

Organizations

plaguss's activity

Open R1: Update #3

Open R1: Update #2

FuseO1-Preview: System-II Reasoning Fusion of LLMs

Open-R1: Update #1

Open-R1: a fully open reproduction of DeepSeek-R1

Python Is All You Need? Introducing Dria-Agent-α

Process Reinforcement through Implicit Rewards

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

Halo: Open Source Health Tracking with Wearables

Releasing the largest multilingual open pretraining dataset

Releasing Outlines-core 0.1.0: structured generation in Rust and Python

How to build a custom text classifier without days of human labeling