Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

vkasera
/
qwen-2.5-0.5b-r1-countdown-phil

Text Generation
Transformers
TensorBoard
Safetensors
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
Model card Files Files and versions
xet
Metrics Training metrics Community
qwen-2.5-0.5b-r1-countdown-phil / runs
518 kB
  • 1 contributor
History: 18 commits
vkasera's picture
vkasera
Training in progress, step 450
0b03911 verified about 1 month ago
  • Oct05_15-27-25_192-222-57-219
    Training in progress, step 25 about 1 month ago
  • Oct05_15-35-38_192-222-57-219
    Training in progress, step 25 about 1 month ago
  • Oct05_15-38-31_192-222-57-219
    Training in progress, step 25 about 1 month ago
  • Oct05_15-44-27_192-222-57-219
    Training in progress, step 25 about 1 month ago
  • Oct05_15-49-11_192-222-57-219
    Training in progress, step 450 about 1 month ago