-
CohenQu/Qwen3-4B-Base_HintGen-withSol.00.00
Text Generation • 4B • Updated -
CohenQu/Qwen3-4B-Base_HintGen-withSol.00.01
Text Generation • 4B • Updated • 1 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.01.00
Text Generation • 4B • Updated • 1 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.01.01
Text Generation • 4B • Updated • 1
Yuxiao Qu PRO
CohenQu
AI & ML interests
None yet
Recent Activity
updated
a model
20 minutes ago
CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.02.01_long-15000_numina-cot-100k_orchard
updated
a dataset
about 7 hours ago
CohenQu/arxiv_rlad_math_reasoning_benchmark_hints
published
a dataset
about 7 hours ago
CohenQu/arxiv_rlad_math_reasoning_benchmark_hints
Organizations
Flexible Ordering
-
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.02
3B • Updated • 2 -
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.03
3B • Updated • 2 -
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.04
3B • Updated • 2 -
CohenQu/sft_completion_only_finemath-4plus-flexible-ordering.00.00-Step2000_numina-cot-100k_babel
Text Generation • 4B • Updated • 3
RLAD
-
CohenQu/Qwen3-4B-Base_HintGen-withSol.00.00
Text Generation • 4B • Updated -
CohenQu/Qwen3-4B-Base_HintGen-withSol.00.01
Text Generation • 4B • Updated • 1 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.01.00
Text Generation • 4B • Updated • 1 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.01.01
Text Generation • 4B • Updated • 1
Flexible Ordering
-
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.02
3B • Updated • 2 -
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.03
3B • Updated • 2 -
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.04
3B • Updated • 2 -
CohenQu/sft_completion_only_finemath-4plus-flexible-ordering.00.00-Step2000_numina-cot-100k_babel
Text Generation • 4B • Updated • 3
models
340

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.02.01_long-15000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.02.01_long-20000_numina-cot-100k_orchard
4B
•
Updated

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.02.01_long-10000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.02.01_long-5000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.00_long-5000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.00_long-15000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.00_long-25000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.00_long-45000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.00_long-40000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated

CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.00.00_long-20000_numina-cot-100k_orchard
Text Generation
•
4B
•
Updated
datasets
185
CohenQu/arxiv_rlad_math_reasoning_benchmark_hints
Viewer
•
Updated
•
600
CohenQu/Joint_train_AceReason_AIME_HMMT_filter_RL
Viewer
•
Updated
•
3.9k
•
137
CohenQu/Joint_train_stage3_filter_RL
Viewer
•
Updated
•
4.14k
•
106
CohenQu/Joint_train_AIME_HMMT_RL
Viewer
•
Updated
•
190
•
111
CohenQu/finemath-4plus-flexible-ordering.02.02
Viewer
•
Updated
•
13.4M
•
313
CohenQu/finemath-4plus-flexible-ordering.02.01
Viewer
•
Updated
•
6.7M
•
264
CohenQu/finemath-4plus-flexible-ordering.02.04
Viewer
•
Updated
•
26.8M
•
462
CohenQu/Continue_vs_Terminate.04.01
Viewer
•
Updated
•
6.98k
•
110
CohenQu/Continue_vs_Terminate.04.00
Viewer
•
Updated
•
6.98k
•
107
CohenQu/continue_vs_terminate_Qwen3-1.7B_DAPO-Math-en_0-2000_8000-10000_counterfactual_reason
Viewer
•
Updated
•
6.98k
•
119