Leon Lee's picture

Leon Lee

Leon-Leee

·

yucc-leon

AI & ML interests

LLMs, code generation, chatbot, workflows

Recent Activity

liked a model 6 days ago

IAAR-Shanghai/xVerify-8B-I

upvoted a paper 7 days ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

liked a dataset 14 days ago

ByteDance-Seed/Code-Contests-Plus

View all activity

Organizations

upvoted a paper 7 days ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published 8 days ago • 42

upvoted a paper 15 days ago

MegaMath: Pushing the Limits of Open Math Corpora

Paper • 2504.02807 • Published Apr 3 • 33

upvoted a paper 17 days ago

SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks

Paper • 2506.10954 • Published 21 days ago • 51

upvoted a collection 29 days ago

Qwen3

72 items • Updated 18 days ago • 818

upvoted a paper about 1 month ago

Learn to Reason Efficiently with Adaptive Length-based Reward Shaping

Paper • 2505.15612 • Published May 21 • 33

upvoted 2 articles about 2 months ago

Article

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

By

•

Sep 27, 2024

• 46

Article

I trained a Language Model to schedule events with GRPO!

By

•

Apr 29

• 80

upvoted a paper 4 months ago

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Paper • 2502.19361 • Published Feb 26 • 28

upvoted an article 5 months ago

Article

Revisiting TemplateGSM: Advancing Mathematical Reasoning in Language Models with Template-based Data Generation

By

•

Nov 14, 2024

• 2

upvoted a collection 5 months ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 154

upvoted a collection 7 months ago

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated Apr 30 • 86

upvoted a collection 9 months ago

ProX Refining Models

Adapted small language models used to generate data refining programs • 5 items • Updated Oct 10, 2024 • 4

upvoted a collection 12 months ago

Magpie-Qwen2 Datasets

Dataset built with Qwen2 72B and Qwen2 7B. • 6 items • Updated Jan 13 • 10

upvoted 3 papers about 1 year ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 98

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11931 • Published Jun 17, 2024 • 65

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 94

upvoted 2 articles about 1 year ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

By

and 2 others •

Mar 20, 2024

• 96

Article

LLM数据工程3——数据收集魔法：获取顶级训练数据的方法

By

•

Jun 4, 2024

• 23

upvoted 2 papers about 1 year ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 147

AutoCoder: Enhancing Code Large Language Model with AIEV-Instruct

Paper • 2405.14906 • Published May 23, 2024 • 28