R-PRM: Reasoning-Driven Process Reward Modeling
Shuaijie She
kevinpro
AI & ML interests
Reasoning, Chain of Thoughts, Alignment, Factual Consistency, Summarization
Recent Activity
updated
a Space
3 days ago
kevinpro/uniapi
published
a Space
3 days ago
kevinpro/uniapi
published
a Space
6 days ago
kevinpro/geminitest