Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
hdong0
/
deepseek-Llama-8B-baseline-Open-R1-GRPO_deepscaler_acc_mu_8_constant_lr_warmed_no_kl
like
0
Safetensors
llama
custom_code
Model card
Files
Files and versions
Community
No model card
Downloads last month
17
Safetensors
Model size
8.3B params
Tensor type
BF16
·
Chat template
Files info
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support