Aurelien Lucchi
alucchi
ยท
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 13 hours ago
alucchi/Qwen3-4B_n1000_e10_oadam0.0001_b20_1_a10
published
a dataset
about 13 hours ago
alucchi/Qwen3-4B_n1000_e10_oadam0.0001_b20_1_a10
updated
a dataset
about 19 hours ago
alucchi/Qwen3-4B_n1000_e20_oadam0.0001_b20_1_a0
Organizations
None yet
alucchi's activity
[Experiment] Applying GRPO to DeepSeek-R1-Distill-Qwen-1.5B with LIMO
๐ฅ
๐
22
22
#15 opened 4 months ago
by
lewtun
