Models NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO Text Generation • Updated Apr 30, 2024 • 3.77k • • 423 Self-Rewarding Language Models Paper • 2401.10020 • Published Jan 18, 2024 • 146