A small qwen 3 model trained on 34000 data collected from open-r1/mixture-of-thought.
Chat template
4-bit
5-bit
8-bit
Base model