R-PRM: Reasoning-Driven Process Reward Modeling
Shuaijie She
kevinpro
AI & ML interests
Reasoning, Chain of Thoughts, Alignment, Factual Consistency, Summarization
Recent Activity
updated
a Space
about 6 hours ago
kevinpro/R-PRM-Demo
new activity
about 9 hours ago
nvidia/OpenReasoning-Nemotron-7B:Has the model undergone the RL process or just SFT on R1-0558 reasoning trajectory?
new activity
2 days ago
kevinpro/R-PRM-Demo:Apply for community grant: Academic project (gpu)