
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Text Generation
•
Updated
•
817k
•
•
568
This is a collection of Llama and Qwen-based models ranging from 1.5B to 70B parameters with are distilled from DeepSeek's new R1 models.