a small qwen 3 1.7b trained on deepseek r1 0528 distill with 8192 context length
Chat template
Files info
Base model