Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
106
39
23
TY.Zheng
aaabiao
Follow
SivilTaram's profile picture
thomwolf's profile picture
mkj69's profile picture
20 followers
·
9 following
https://scholar.google.com/citations?user=Vq-VZnUAAAAJ&hl=zh-CN
Zheng0428
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
3 days ago
Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search
upvoted
a
paper
4 days ago
Reverse-Engineered Reasoning for Open-Ended Generation
updated
a collection
9 days ago
Code Synthetic RL Rollout
View all activity
Organizations
aaabiao
's models
62
Sort: Recently updated
aaabiao/qwen3_14b_think_32B_math_reject_sampling_150step_0706
15B
•
Updated
Jul 5
•
12
aaabiao/qwen3_14b_no_think_32B_math_reject_sampling_150step_0706
15B
•
Updated
Jul 5
•
11
aaabiao/qwen3_14b_think_32B_math_reject_sampling_150step
15B
•
Updated
Jul 2
•
11
aaabiao/qwen3_14b_no_think_32B_math_reject_sampling_150step
15B
•
Updated
Jul 2
•
12
aaabiao/qwen3_14b_distill_no_think_32b_5e5_150step_fix2
15B
•
Updated
Jun 29
•
13
aaabiao/verl-8B-100step-v1
8B
•
Updated
Jun 29
•
13
aaabiao/verl-4B-100step-v1
4B
•
Updated
Jun 28
•
11
aaabiao/qwen3_14b_distill_no_think_32b_5e5_150step_fix
15B
•
Updated
Jun 28
•
11
aaabiao/verl-14B-60step-v1
15B
•
Updated
Jun 27
•
12
aaabiao/verl-14B-120step-v1
15B
•
Updated
Jun 27
•
13
aaabiao/verl-14B-40step-v1
15B
•
Updated
Jun 27
•
10
aaabiao/verl-14B-100step-v1
15B
•
Updated
Jun 27
•
10
aaabiao/verl-14B-20step-v1
15B
•
Updated
Jun 27
•
9
aaabiao/verl-14B-80step-v1
15B
•
Updated
Jun 27
•
9
aaabiao/qwen3_14b_distill_think_32b_5e5_285step
15B
•
Updated
Jun 27
•
10
aaabiao/qwen3_14b_distill_think_32b_5e5_250step
15B
•
Updated
Jun 27
•
10
aaabiao/qwen3_14b_distill_think_32b_5e5_200step
15B
•
Updated
Jun 27
•
9
aaabiao/qwen3_14b_distill_think_32b_5e5_100step
15B
•
Updated
Jun 27
•
9
aaabiao/qwen3_14b_distill_think_32b_5e5_50step
15B
•
Updated
Jun 27
•
10
aaabiao/qwen3_4b_math_rl_60step
4B
•
Updated
Jun 27
•
9
aaabiao/qwen3_4b_rl_60step
Updated
Jun 27
aaabiao/qwen3_14b_distill_think_32b_5e5_150step
15B
•
Updated
Jun 20
•
8
aaabiao/qwen3_14b_distill_no_think_32b_5e5_150step
15B
•
Updated
Jun 20
•
8
aaabiao/qwen3_14b_distill_no_think_32b_5e5
Text Generation
•
15B
•
Updated
Jun 20
•
11
aaabiao/qwen3_14b_distill_32b_5e5_700step
15B
•
Updated
Jun 17
•
8
aaabiao/qwen3_14b_distill_32b_5e5_450step
15B
•
Updated
Jun 17
•
8
aaabiao/qwen3_14b_distill_32b_5e5
Text Generation
•
15B
•
Updated
Jun 17
•
18
aaabiao/verl-14B-140step-v1
15B
•
Updated
Jun 16
•
8
aaabiao/verl-14B-160step
15B
•
Updated
Jun 15
•
7
aaabiao/verl-14B-140step
15B
•
Updated
Jun 15
•
8
Previous
1
2
3
Next