Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
LM-Parallel
/
grpo_llama-hs-v3_bs64_rollout5-lr1e-5-seq-weighted-kl0.01-20250318005512_global_step_100
like
0
Follow
LM-Parallel
4
Safetensors
llama
Model card
Files
Files and versions
Community
Train
main
grpo_llama-hs-v3_bs64_rollout5-lr1e-5-seq-weighted-kl0.01-20250318005512_global_step_100
1 contributor
History:
2 commits
longlian
Upload folder using huggingface_hub
a1f9881
verified
9 days ago
.gitattributes
Safe
1.52 kB
initial commit
9 days ago
config.json
1.05 kB
Upload folder using huggingface_hub
9 days ago
generation_config.json
Safe
132 Bytes
Upload folder using huggingface_hub
9 days ago
model.safetensors
587 MB
LFS
Upload folder using huggingface_hub
9 days ago
special_tokens_map.json
Safe
552 Bytes
Upload folder using huggingface_hub
9 days ago
tokenizer.json
3.62 MB
Upload folder using huggingface_hub
9 days ago
tokenizer.model
Safe
500 kB
LFS
Upload folder using huggingface_hub
9 days ago
tokenizer_config.json
Safe
979 Bytes
Upload folder using huggingface_hub
9 days ago