jungyup2

izlley

1 11 41

izlley

AI & ML interests

None yet

Organizations

upvoted a collection about 2 months ago

DNA 3.0

Collection

DNA 3.0 preserves Qwen 3.5/3.6 strengths while removing censorship on China-related topics. • 11 items • Updated 12 days ago • 5

upvoted a paper 2 months ago

Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs

Paper • 2605.09063 • Published May 9 • 82

upvoted 2 collections 7 months ago

Nemotron-Pre-Training-Datasets

Collection

Large scale pre-training datasets used in the Nemotron family of models. • 15 items • Updated 8 days ago • 178

Nemotron-Post-Training-v3

Collection

Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 50 items • Updated 8 days ago • 181

upvoted an article 12 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 783

upvoted a paper over 1 year ago

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published Apr 3, 2025 • 58

upvoted 2 collections over 1 year ago

DNA-R1

Collection

Reasoning model distilled from DeepSeek-R1, enhanced with GRPO using supplementary reasoning datasets. • 1 item • Updated Jan 26 • 2

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19, 2025 • 190

upvoted an article over 1 year ago

Article

Open R1: Update #2

open-r1

•

Feb 10, 2025

• 219

upvoted a paper over 2 years ago

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 152

jungyup2

AI & ML interests

Organizations

izlley's activity

SmolLM3: smol, multilingual, long-context reasoner

Open R1: Update #2