R-PRM: Reasoning-Driven Process Reward Modeling
			
	
	Shuaijie She
kevinpro
		AI & ML interests
Reasoning, Chain of Thoughts, Alignment, Factual Consistency, Summarization
		Recent Activity
						liked
								a dataset
							
						1 day ago
						
					
						
						
						
						speechcolab/gigaspeech
						
						liked
								a dataset
							
						1 day ago
						
					
						
						
						
						parler-tts/mls_eng
						
						liked
								a dataset
							
						1 day ago
						
					
						
						
						
						parler-tts/mls_eng_10k
						
 
								 
								



