Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
LM-Parallel
/
grpo_llama-hs-v3_bs64_rollout5-lr1e-5-seq-weighted-kl0.01-20250318005512_global_step_100
like
0
Follow
LM-Parallel
4
Safetensors
llama
Model card
Files
Files and versions
Community
Train
main
grpo_llama-hs-v3_bs64_rollout5-lr1e-5-seq-weighted-kl0.01-20250318005512_global_step_100
/
tokenizer.json
longlian
Upload folder using huggingface_hub
a1f9881
verified
12 days ago
raw
Copy download link
history
contribute
delete
3.62 MB
File too large to display, you can
check the raw version
instead.