-
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.02
3B • Updated • 42 -
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.03
3B • Updated • 9 -
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.04
3B • Updated • 30 -
CohenQu/sft_completion_only_finemath-4plus-flexible-ordering.00.00-Step2000_numina-cot-100k_babel
Text Generation • 4B • Updated • 22
Yuxiao Qu PRO
CohenQu
AI & ML interests
None yet
Recent Activity
published
a dataset
about 17 hours ago
CohenQu/RLAD-DeepScalaR_SolGen_BATCH
Organizations
Flexible Ordering
-
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.02
3B • Updated • 42 -
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.03
3B • Updated • 9 -
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.04
3B • Updated • 30 -
CohenQu/sft_completion_only_finemath-4plus-flexible-ordering.00.00-Step2000_numina-cot-100k_babel
Text Generation • 4B • Updated • 22
Hint Generation
models
296

CohenQu/Joint-Train-deepscalar_RL_hard_500_verl_0.35_0.001_0.001_32_32_20k_4
2B
•
Updated

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.07-8000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.07-4000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.07-2000_numina-cot-100k_babel
Text Generation
•
4B
•
Updated

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.07-2000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.05-8000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.05-6000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.05-4000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated

CohenQu/Qwen3-1.7B_Continue_vs_Terminate.03.00
Text Generation
•
2B
•
Updated

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.05-2000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated
datasets
159
CohenQu/Continue_vs_Terminate.03.00
Viewer
•
Updated
•
3.58k
CohenQu/Continue_vs_Terminate.03.01
Viewer
•
Updated
•
3.58k
CohenQu/continue_vs_terminate_Qwen3-1.7B_DAPO-Math-en_0-2000_counterfactual_reason
Viewer
•
Updated
•
3.58k
CohenQu/continue_vs_terminate_Qwen3-1.7B_DAPO-Math-en_0-2000_counterfactual
Viewer
•
Updated
•
3.58k
CohenQu/continue_vs_terminate_Qwen3-1.7B_DAPO-Math-en_0-2000
Viewer
•
Updated
•
76.2k
CohenQu/finemath-4plus-flexible-ordering.00.05
Viewer
•
Updated
•
10M
•
50
CohenQu/finemath-4plus-flexible-ordering.00.06
Viewer
•
Updated
•
6.7M
•
32
CohenQu/finemath-4plus-flexible-ordering.00.07
Viewer
•
Updated
•
6.7M
•
51
CohenQu/finemath-4plus-flexible-ordering.00.02_length_BATCH
Viewer
•
Updated
•
1k
•
36
CohenQu/e3-math-medhard
Viewer
•
Updated
•
2.5k
•
74