AI & ML interests
None yet
Organizations
None yet
models
10
0.5B
•
Updated
•
1
saswatach/Qwen2-1.5B-GRPO_1024
Updated
saswatach/grpo_countdown_test_3
Updated
saswatach/grpo_countdown_test_2
Updated
saswatach/grpo_countdown_test_1
Updated
saswatach/qwen-r1-aha-moment
Updated
saswatach/Qwen2-1.5B-GRPO-test
Updated
saswatach/Qwen2-1.5B-GRPO-test2
Updated
saswatach/Qwen2-0.5B-GRPO-test2
Updated
saswatach/Qwen2-0.5B-GRPO-test
Updated