yuchuqing's picture

yuchuqing

rain2sun

·

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

deepseek-ai/DeepSeek-V3.1

liked a model 4 days ago

MiniMaxAI/MiniMax-M1-80k

new activity 6 days ago

di-zhang-fdu/AIME_1983_2024:The answer to question 2016-I-7 is wrong

View all activity

Organizations

None yet

upvoted an article about 2 months ago

Article

Open R1: Update #2

By

and 6 others •

Feb 10

• 218

upvoted a collection 2 months ago

AM-Thinking-v1

3 items • Updated May 19 • 2

upvoted a collection 3 months ago

Math-Code-Reason

可规则验证数据集，要求带标准答案 • 21 items • Updated Jul 16 • 1

upvoted a collection 4 months ago

Qwen3

84 items • Updated 18 days ago • 1.13k

upvoted 5 collections 9 months ago

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated Apr 30 • 88

OpenCoder

OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 8 items • Updated Nov 23, 2024 • 85

Pythia Scaling Suite

Pythia is the first LLM suite designed specifically to enable scientific research on LLMs. To learn more see https://github.com/EleutherAI/pythia • 18 items • Updated Feb 26 • 31

OpenCoder Datasets

OpenCoder datasets! • 6 items • Updated Nov 15, 2024 • 41

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Jul 21 • 632

upvoted an article 11 months ago

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

By

and 4 others •

Jan 18, 2024

• 70

upvoted an article 12 months ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

By

and 2 others •

Aug 14, 2024

• 69

upvoted 2 articles about 1 year ago

Article

SmolLM - blazingly fast and remarkably powerful

By

and 2 others •

Jul 16, 2024

• 411

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

By

and 2 others •

Mar 20, 2024

• 102

upvoted a collection about 1 year ago

"Physics of Language Models" series

6 items • Updated Aug 30, 2024 • 46