@mkurman on Hugging Face: "I've been working on something cool: a GRPO with an LLM evaluator that can…"

Hugging Face

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

mkurman

posted an update Feb 13, 2025

Post

2054

I've been working on something cool: a GRPO with an LLM evaluator that can also perform SFT on the feedback data - if you want. Check it out 😊

Any 🌟are more than welcome 🤗

https://github.com/mkurman/grpo-llm-evaluator

In this post

mkurman Mariusz Kurman