mlfoundations-dev/Qwen2.5-7B-Instruct_qwq_mix_r1_science
Text Generation
•
8B
•
Updated
•
6
•
1
mlfoundations-dev/dclm_baseline_openthoughts1
Text Generation
•
7B
•
Updated
•
6
mlfoundations-dev/dclm_baseline_it_openthoughts3_30k
7B
•
Updated
•
4
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz256_lr16e5_epochs5
Text Generation
•
2B
•
Updated
•
7
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz256_lr2e5_epochs5
Text Generation
•
2B
•
Updated
•
7
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz256_lr8e5_epochs5
Text Generation
•
2B
•
Updated
•
5
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz1024_lr16e5_epochs5
Text Generation
•
2B
•
Updated
•
9
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz1024_lr8e5_epochs5
Text Generation
•
2B
•
Updated
•
24
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz1024_lr4e5_epochs5
Text Generation
•
2B
•
Updated
•
6
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz1024_lr2e5_epochs5
Text Generation
•
2B
•
Updated
•
7
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz512_lr8e5_epochs5
Text Generation
•
2B
•
Updated
•
4
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz512_lr4e5_epochs5
Text Generation
•
2B
•
Updated
•
5
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz512_lr16e5_epochs5
Text Generation
•
2B
•
Updated
•
6
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz256_lr16e5_epochs7
Text Generation
•
2B
•
Updated
•
4
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz256_lr8e5_epochs7
Text Generation
•
2B
•
Updated
•
3
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz256_lr2e5_epochs7
Text Generation
•
2B
•
Updated
•
3
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz256_lr4e5_epochs7
Text Generation
•
2B
•
Updated
•
5
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz256_lr4e5_epochs5
Text Generation
•
2B
•
Updated
•
4
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz512_lr2e5_epochs5
Text Generation
•
2B
•
Updated
•
5
mlfoundations-dev/QwQ-32B_openthoughts3_300k
Text Generation
•
33B
•
Updated
•
7
mlfoundations-dev/Qwen2.5-7B-Instruct_openthoughts3_300k_annotated_Qwen3-32B
Text Generation
•
8B
•
Updated
•
8
•
1
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-7B_OpenThoughts3
Text Generation
•
8B
•
Updated
•
8
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-1.5B_OpenThoughts3
Text Generation
•
2B
•
Updated
•
5
mlfoundations-dev/QwQ-32B_enable-liger-kernel_False_OpenThoughts3_10k
Text Generation
•
33B
•
Updated
•
5
mlfoundations-dev/QwQ-32B_enable-liger-kernel_False_OpenThoughts3_3k
Text Generation
•
33B
•
Updated
•
7
mlfoundations-dev/QwQ-32B_enable-liger-kernel_False_OpenThoughts3_1k
Text Generation
•
33B
•
Updated
•
7
mlfoundations-dev/Qwen2.5-7B-Instruct_openthoughts3_math_100k_annotated_QwQ-32B
Text Generation
•
8B
•
Updated
•
4
mlfoundations-dev/OpenThoughts3_1.5B
Text Generation
•
2B
•
Updated
•
7
mlfoundations-dev/QwQ-32B_openthoughts3_100k
Text Generation
•
33B
•
Updated
•
5
mlfoundations-dev/openthoughts3_30k_llama3
Text Generation
•
8B
•
Updated
•
10
•
1