Source code available at https://github.com/phhusson/llm-rl/blob/main/grpo-tldr.py
- Downloads last month
- 28
Hardware compatibility
Log In
to view the estimation
16-bit
Source code available at https://github.com/phhusson/llm-rl/blob/main/grpo-tldr.py
16-bit