Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
tranbaninh
/
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-hoarse_sedate_marmot
like
1
Text Generation
Transformers
Safetensors
qwen2
Generated from Trainer
rl-swarm
grpo
gensyn
I am hoarse sedate marmot
unsloth
trl
genrl-swarm
I am hoarse_sedate_marmot
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-hoarse_sedate_marmot
Commit History
End of training
7d8d865
verified
tranbaninh
commited on
Apr 7
End of training
4fd3f79
verified
tranbaninh
commited on
Apr 7
End of training
1cadb7f
verified
tranbaninh
commited on
Apr 7
End of training
ce3245a
verified
tranbaninh
commited on
Apr 7
End of training
939aa15
verified
tranbaninh
commited on
Apr 7
End of training
4198ea2
verified
tranbaninh
commited on
Apr 6
End of training
0ce62d9
verified
tranbaninh
commited on
Apr 6
End of training
95eb367
verified
tranbaninh
commited on
Apr 6
End of training
fd3fd90
verified
tranbaninh
commited on
Apr 6
End of training
55432c0
verified
tranbaninh
commited on
Apr 6
End of training
f5e076c
verified
tranbaninh
commited on
Apr 6
End of training
3990c78
verified
tranbaninh
commited on
Apr 6
End of training
3cc812c
verified
tranbaninh
commited on
Apr 6
End of training
534eb4f
verified
tranbaninh
commited on
Apr 6
End of training
55ea9f6
verified
tranbaninh
commited on
Apr 6
End of training
8244711
verified
tranbaninh
commited on
Apr 5
End of training
03d1de0
verified
tranbaninh
commited on
Apr 5
End of training
6984830
verified
tranbaninh
commited on
Apr 5
initial commit
c7ee44f
verified
tranbaninh
commited on
Apr 5
Previous
1
...
99
100
101
Next