Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
rshwndsz
/
Llama-3.2-1B-SFT-DPO-bm
like
0
Safetensors
llama
Model card
Files
Files and versions
Community
No model card
Downloads last month
3
Safetensors
Model size
1.24B params
Tensor type
BF16
·
Chat template
Files info
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Collection including
rshwndsz/Llama-3.2-1B-SFT-DPO-bm
Janus
Collection
Analysing the RLHF pipeline
•
104 items
•
Updated
Jul 2