MrAnton
/

SmolVLM-256M-Instruct-carrots-and-plates-GRPO-warmup_grpo_carrot_plate_dist_task

Generated from Trainer

Model card Files Files and versions

SmolVLM-256M-Instruct-carrots-and-plates-GRPO-warmup_grpo_carrot_plate_dist_task

16.9 MB

1 contributor

History: 19 commits

MrAnton's picture

Training in progress, step 10

bef041c verified about 1 month ago