Anirudh Buvanesh
anirudhb11
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 7 hours ago
anirudhb11/R1-1.5b-Par-Temp-0.7-Ans-40-16384-s-42-deg-64-path-3-n-16000-s-400-e-500
published
a dataset
about 7 hours ago
anirudhb11/R1-1.5b-Par-Temp-0.7-Ans-40-16384-s-42-deg-64-path-3-n-16000-s-400-e-500
updated
a dataset
about 7 hours ago
anirudhb11/R1-1.5b-Par-Temp-0.7-Ans-40-16384-s-42-deg-64-path-3-n-16000-s-15300-e-15400
Organizations
None yet
anirudhb11's activity
[Experiment] Applying GRPO to DeepSeek-R1-Distill-Qwen-1.5B with LIMO
🔥
😎
22
22
#15 opened 4 months ago
by
lewtun
