Critique-out-Loud Reward Models Paper: https://arxiv.org/abs/2408.11791 | Code: https://github.com/zankner/CLoud ankner/Llama3-8B-CLoud-RM Updated Oct 16 • 233 ankner/Llama3-8B-Classic-RM Updated Oct 17 • 109 ankner/Llama3-70B-CLoud-RM Updated about 1 month ago • 10 • 1 ankner/Llama3-70B-Classic-RM Updated about 1 month ago • 14