Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ujjwal1996
/
Qwen3_4B_GRPO_updated
like
0
Transformers
Safetensors
English
text-generation-inference
unsloth
qwen3
trl
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen3_4B_GRPO_updated
Ctrl+K
Ctrl+K
1 contributor
History:
4 commits
ujjwal1996
Upload model trained with Unsloth
552e4f1
verified
16 days ago
.gitattributes
Safe
1.57 kB
Upload model trained with Unsloth
16 days ago
README.md
565 Bytes
Upload README.md with huggingface_hub
16 days ago
adapter_config.json
851 Bytes
Upload model trained with Unsloth
16 days ago
adapter_model.safetensors
264 MB
LFS
Upload model trained with Unsloth
16 days ago
added_tokens.json
Safe
707 Bytes
Upload model trained with Unsloth
16 days ago
merges.txt
Safe
1.67 MB
Upload model trained with Unsloth
16 days ago
special_tokens_map.json
Safe
617 Bytes
Upload model trained with Unsloth
16 days ago
tokenizer.json
Safe
11.4 MB
LFS
Upload model trained with Unsloth
16 days ago
tokenizer_config.json
6.11 kB
Upload model trained with Unsloth
16 days ago
vocab.json
Safe
2.78 MB
Upload model trained with Unsloth
16 days ago