Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
LM-Parallel
/
grpo_llama-hs-v3_bs64_rollout5-lr1e-5-seq-weighted-kl0.01-20250318005512_global_step_100
like
0
Follow
LM-Parallel
4
Safetensors
llama
Model card
Files
Files and versions
Community
Train
main
grpo_llama-hs-v3_bs64_rollout5-lr1e-5-seq-weighted-kl0.01-20250318005512_global_step_100
Commit History
Upload folder using huggingface_hub
a1f9881
verified
longlian
commited on
12 days ago
initial commit
7cdb40f
verified
longlian
commited on
12 days ago