GenRM: Generative Reward Models

community

https://www.synthlabs.ai/research/generative-reward-models

synth_labs

SynthLabsAI

AI & ML interests

None defined yet.

Recent Activity

nlile authored a paper 3 days ago

Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models

nlile authored a paper 6 days ago

Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs

nlile authored a paper about 2 months ago

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

View all activity

GenRM's activity

nlile

authored a paper 3 days ago

Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models

Paper • 2502.17387 • Published 13 days ago • 5

nlile

authored a paper 6 days ago

Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs

Paper • 2503.01307 • Published 7 days ago • 30

nlile

authored a paper about 2 months ago

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published Jan 8 • 91

nlile

authored a paper 5 months ago

Generative Reward Models

Paper • 2410.12832 • Published Oct 2, 2024 • 6

nlile

authored a paper 8 months ago

PERSONA: A Reproducible Testbed for Pluralistic Alignment

Paper • 2407.17387 • Published Jul 24, 2024 • 20

nlile

authored a paper 12 months ago

Suppressing Pink Elephants with Direct Principle Feedback

Paper • 2402.07896 • Published Feb 12, 2024 • 11