Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
mkurmanΒ 
posted an update Feb 13
Post
2042
I've been working on something cool: a GRPO with an LLM evaluator that can also perform SFT on the feedback data - if you want. Check it out 😊

Any 🌟are more than welcome πŸ€—

https://github.com/mkurman/grpo-llm-evaluator
In this post