FINGU-AI's picture
Trained with Unsloth
ca6ea6e verified
metadata
license: mit
tags:
  - unsloth
  - trl
  - grpo