YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Novel training procedure to deslopify instruct/assistant models.

No SFT.

Pure RL with a good signal.

Downloads last month
6
Safetensors
Model size
3.09B params
Tensor type
FP16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support