Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

nghind
/
grpo-llama-3-1-8b-math-ep3

Transformers
TensorBoard
Safetensors
Generated from Trainer
trl
grpo
Model card Files Files and versions Metrics Training metrics Community
grpo-llama-3-1-8b-math-ep3 / runs
Ctrl+K
Ctrl+K
  • 1 contributor
History: 4 commits
nghind's picture
nghind
Model save
355d9d3 verified 4 months ago
  • Feb20_08-20-21_koa-dgxa-b11-u17
    Training in progress, epoch 1 4 months ago
  • Feb20_08-29-49_koa-dgxa-b11-u17
    Training in progress, epoch 1 4 months ago
  • Feb20_08-39-07_koa-dgxa-b11-u17
    Training in progress, epoch 1 4 months ago
  • Feb20_08-49-07_koa-dgxa-b11-u17
    Training in progress, epoch 1 4 months ago
  • Feb20_08-53-00_koa-dgxa-b11-u17
    Training in progress, epoch 1 4 months ago
  • Feb20_08-59-40_koa-dgxa-b11-u17
    Training in progress, epoch 1 4 months ago
  • Feb20_09-04-28_koa-dgxa-b11-u17
    Training in progress, epoch 1 4 months ago
  • Feb20_09-22-20_koa-dgxa-b11-u17
    Training in progress, epoch 1 4 months ago
  • Feb20_09-35-40_koa-dgxa-b11-u17
    Training in progress, epoch 1 4 months ago
  • Feb20_09-57-31_koa-dgxa-b11-u17
    Training in progress, epoch 1 4 months ago
  • Feb20_10-03-26_koa-dgxa-b11-u17
    Training in progress, epoch 1 4 months ago
  • Feb20_10-08-19_koa-dgxa-b11-u17
    Model save 4 months ago