Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
alfredcs
/
torchrun-gemma-3-12b-grpo-firstaid-merged
like
0
Image-Text-to-Text
Transformers
Safetensors
gemma3
trl
grpo
GRPO
Reasoning-Course
conversational
text-generation-inference
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
torchrun-gemma-3-12b-grpo-firstaid-merged
Ctrl+K
Ctrl+K
2 contributors
History:
4 commits
Alfred Shen
Update
717734f
21 days ago
.gitattributes
Safe
1.57 kB
Upload tokenizer
21 days ago
README.md
Safe
5.21 kB
Upload Gemma3ForConditionalGeneration
22 days ago
added_tokens.json
Safe
35 Bytes
Upload tokenizer
21 days ago
chat_template.jinja
Safe
1.53 kB
Update
21 days ago
chat_template.json
Safe
1.62 kB
Update
21 days ago
config.json
Safe
1.61 kB
Upload Gemma3ForConditionalGeneration
22 days ago
generation_config.json
Safe
215 Bytes
Update
21 days ago
model-00001-of-00005.safetensors
Safe
4.98 GB
LFS
Upload Gemma3ForConditionalGeneration
22 days ago
model-00002-of-00005.safetensors
Safe
4.93 GB
LFS
Upload Gemma3ForConditionalGeneration
22 days ago
model-00003-of-00005.safetensors
Safe
4.93 GB
LFS
Upload Gemma3ForConditionalGeneration
22 days ago
model-00004-of-00005.safetensors
Safe
4.93 GB
LFS
Upload Gemma3ForConditionalGeneration
22 days ago
model-00005-of-00005.safetensors
Safe
4.6 GB
LFS
Upload Gemma3ForConditionalGeneration
22 days ago
model.safetensors.index.json
Safe
109 kB
Upload Gemma3ForConditionalGeneration
22 days ago
preprocessor_config.json
Safe
570 Bytes
Update
21 days ago
processor_config.json
Safe
70 Bytes
Update
21 days ago
special_tokens_map.json
Safe
662 Bytes
Upload tokenizer
21 days ago
tokenizer.json
Safe
33.4 MB
LFS
Upload tokenizer
21 days ago
tokenizer.model
Safe
4.69 MB
LFS
Upload tokenizer
21 days ago
tokenizer_config.json
Safe
1.16 MB
Upload tokenizer
21 days ago