Smol-reason

Sweaterdog 's Collections

updated Apr 16

My first ever usage of GRPO fine tuning techniques, information learned from this model will be used on future Andy models.