Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
174.4
TFLOPS
3
15
88
rasdani
PRO
rasdani
Follow
vaibhavad's profile picture
consdi's profile picture
mike-jiang's profile picture
19 followers
·
71 following
rasdani_
rasdani
rasdani
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 6 hours ago
rasdani/deepseek_r1_qwen14b_swe_rl_8k_56_steps_preds
published
a dataset
about 6 hours ago
rasdani/deepseek_r1_qwen14b_swe_rl_8k_56_steps_preds
updated
a dataset
about 8 hours ago
rasdani/SWE-bench_Verified_oracle_32k_v2_100
View all activity
Organizations
rasdani
's models
36
Sort: Recently updated
rasdani/deepseek_r1_qwen14b_swe_rl_8k
15B
•
Updated
about 8 hours ago
rasdani/qwen3_8b_swe_rl_8k
8B
•
Updated
5 days ago
•
15
rasdani/deepseek_r1_7b_gh_patches_2k_fixed_reward
8B
•
Updated
13 days ago
•
9
rasdani/deepseek_r1_7b_gh_patches_2k
8B
•
Updated
14 days ago
•
11
rasdani/crux-eval_math-eval-logs
Updated
17 days ago
rasdani/git-diff-Qwen-4B-10k
4B
•
Updated
17 days ago
•
23
rasdani/git-diff-Qwen-4B-10k-checkpoints
Updated
17 days ago
rasdani/git-diff-Qwen-4B-32k-checkpoints
Updated
19 days ago
rasdani/git-diff-Qwen-4B-30k
4B
•
Updated
20 days ago
•
12
rasdani/git-diff-Qwen-4B
4B
•
Updated
25 days ago
•
79
rasdani/git-diff-Qwen-1.7B
2B
•
Updated
26 days ago
•
60
rasdani/git-diff-Qwen-1.7-B
2B
•
Updated
26 days ago
•
15
rasdani/simple-math-Qwen-1.5B
2B
•
Updated
27 days ago
•
5
rasdani/qwen3_0_6b_function_rm
0.8B
•
Updated
May 22
•
2
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-8192k
0.5B
•
Updated
Apr 8
•
5
rasdani/Qwen2.5-0.5B-simpleRL-Zoo
Text Generation
•
0.5B
•
Updated
Apr 6
•
13
rasdani/smolR1-Qwen2.5-0.5B
Text Generation
•
0.5B
•
Updated
Mar 31
•
9
•
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-no-KL
Updated
Mar 31
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-3072k
Updated
Mar 31
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-4096k
Updated
Mar 31
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-2560k
Updated
Mar 31
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-2048k
Updated
Mar 31
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-first-try
0.5B
•
Updated
Mar 29
•
3
rasdani/Qwen-1.5B-Distill-GRPO
Text Generation
•
2B
•
Updated
Mar 28
•
14
rasdani/Qwen-0.5B-Instruct-GRPO
Updated
Mar 27
rasdani/gsm8k_qwen2.5-0.5b
0.5B
•
Updated
Mar 11
•
2
rasdani/Qwen2.5-1.5B-Open-R1-Code-GRPO
Updated
Mar 9
rasdani/Qwen2.5-0.5B-Open-R1-Code-GRPO
Text Generation
•
0.6B
•
Updated
Mar 8
•
3
rasdani/Qwen2.5-7B-Instruct-GRPO-unsloth
Text Generation
•
8B
•
Updated
Mar 2
•
5
rasdani/Qwen2.5-3B-Instruct-GRPO-unsloth
Text Generation
•
3B
•
Updated
Mar 1
•
331
Previous
1
2
Next