opria123
/

SmolGRPO-135M

Text Generation

Reasoning-Course

text-generation-inference

Model card Files Files and versions Community

Model Card for Model ID

Model Details

Model Description

This model is from the GRPO section of the 🤗 LLM Course.

Downloads last month: 146

Safetensors

Model size

135M params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support