Model Card for Model ID

Model Details

Model Description

This model is from the GRPO section of the ๐Ÿค— LLM Course.

Downloads last month
146
Safetensors
Model size
135M params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support