Temporal Sampling UWNSL/Qwen2.5-7B-deepscaler_4k_step_224 8B • Updated about 1 month ago • 5 UWNSL/Qwen2.5-7B-deepscaler_4k_step_160 8B • Updated about 1 month ago • 5 UWNSL/Qwen2.5-7B-deepscaler_4k_step_192 8B • Updated about 1 month ago • 5 UWNSL/Qwen2.5-7B-deepscaler_4k_step_256 8B • Updated about 1 month ago • 5
Small Model Learnability Gap: Models UWNSL/Qwen2.5-32B-Instruct_Short_CoT_lora Updated Dec 22, 2024 • 4 UWNSL/Qwen2.5-32B-Instruct_Long_CoT_lora Updated Dec 22, 2024 • 7 UWNSL/Qwen2.5-0.5B-Instruct_Long_CoT Text Generation • 0.5B • Updated Dec 22, 2024 • 13 UWNSL/Qwen2.5-0.5B-Instruct_Short_CoT Text Generation • 0.5B • Updated Dec 22, 2024 • 16
SafeChain UWNSL/SafeChain Viewer • Updated Feb 18 • 40k • 87 • 7 UWNSL/WildJailbreakEval Viewer • Updated Feb 22 • 300 • 199 UWNSL/DeepSeek-R1-Distill-Qwen-7B-SafeChain Text Generation • 8B • Updated Apr 2 • 46 UWNSL/DeepSeek-R1-Distill-Llama-8B-SafeChain Text Generation • 8B • Updated Apr 2 • 33
Small Model Learnability Gap: Dataset UWNSL/MATH_training_split_distill_large_teacher Viewer • Updated Feb 21 • 4.54k • 36 UWNSL/MATH_training_split_long_cot Viewer • Updated Feb 21 • 5.38k • 129 UWNSL/MATH_training_split_short_cot Viewer • Updated Feb 21 • 5.38k • 144 • 2 UWNSL/MATH_training_split_distill_small_teacher Viewer • Updated Feb 21 • 4.54k • 38
Temporal Sampling UWNSL/Qwen2.5-7B-deepscaler_4k_step_224 8B • Updated about 1 month ago • 5 UWNSL/Qwen2.5-7B-deepscaler_4k_step_160 8B • Updated about 1 month ago • 5 UWNSL/Qwen2.5-7B-deepscaler_4k_step_192 8B • Updated about 1 month ago • 5 UWNSL/Qwen2.5-7B-deepscaler_4k_step_256 8B • Updated about 1 month ago • 5
SafeChain UWNSL/SafeChain Viewer • Updated Feb 18 • 40k • 87 • 7 UWNSL/WildJailbreakEval Viewer • Updated Feb 22 • 300 • 199 UWNSL/DeepSeek-R1-Distill-Qwen-7B-SafeChain Text Generation • 8B • Updated Apr 2 • 46 UWNSL/DeepSeek-R1-Distill-Llama-8B-SafeChain Text Generation • 8B • Updated Apr 2 • 33
Small Model Learnability Gap: Models UWNSL/Qwen2.5-32B-Instruct_Short_CoT_lora Updated Dec 22, 2024 • 4 UWNSL/Qwen2.5-32B-Instruct_Long_CoT_lora Updated Dec 22, 2024 • 7 UWNSL/Qwen2.5-0.5B-Instruct_Long_CoT Text Generation • 0.5B • Updated Dec 22, 2024 • 13 UWNSL/Qwen2.5-0.5B-Instruct_Short_CoT Text Generation • 0.5B • Updated Dec 22, 2024 • 16
Small Model Learnability Gap: Dataset UWNSL/MATH_training_split_distill_large_teacher Viewer • Updated Feb 21 • 4.54k • 36 UWNSL/MATH_training_split_long_cot Viewer • Updated Feb 21 • 5.38k • 129 UWNSL/MATH_training_split_short_cot Viewer • Updated Feb 21 • 5.38k • 144 • 2 UWNSL/MATH_training_split_distill_small_teacher Viewer • Updated Feb 21 • 4.54k • 38