Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Prathyusha101
/
qwen2-0.5b-REINFORCE-no-baseline-kl-disabled

Text Generation
Transformers
TensorBoard
Safetensors
qwen2
Generated from Trainer
rloo
trl
conversational
text-generation-inference
Model card Files Files and versions
xet
Metrics Training metrics Community
qwen2-0.5b-REINFORCE-no-baseline-kl-disabled / runs
82.6 kB
  • 1 contributor
History: 1 commit
Prathyusha101's picture
Prathyusha101
Training in progress, step 500
ca38bbc verified about 2 months ago
  • Sep04_15-25-19_234f257957b9
    Training in progress, step 500 about 2 months ago