reproducing DeepSeek R1 Zero with Qwen2.5-0.5B on two 4090 GPUs
rasdani PRO
rasdani
AI & ML interests
None yet
Recent Activity
updated
a dataset
29 minutes ago
rasdani/SWE-bench_Verified_oracle_32k_v2_100
published
a dataset
29 minutes ago
rasdani/SWE-bench_Verified_oracle_32k_v2_100
updated
a dataset
29 minutes ago
rasdani/SWE-bench_Verified_oracle_32k_v2_50