Demystifying Reinforcement Learning in Agentic Reasoning
			
	
	AI & ML interests
LLM, Diffusion, and Beyond
Recent Activity
	View all activity
	
			Organization Card
		
		Welcome to AI Research Lab at Princeton University
Contact Us: Interested in learning more or getting involved? Reach out to us at [email protected] or visit our website at https://github.com/Gen-Verse.
			models
			18
		
			
	
	
	
	
	 
				Gen-Verse/Qwen3-4B-RA-SFT
		
				4B
			• 
	
				Updated
					
				
				• 
					
					33
				
	
				• 
					
					2
				
 
				Gen-Verse/Qwen2.5-7B-RA-SFT
		
				8B
			• 
	
				Updated
					
				
				• 
					
					17
				
	
				
				
 
				Gen-Verse/DemyAgent-4B
		
				4B
			• 
	
				Updated
					
				
				• 
					
					1.3k
				
	
				• 
					
					8
				
 
				Gen-Verse/TraDo-8B-Thinking
		
				8B
			• 
	
				Updated
					
				
				• 
					
					934
				
	
				• 
					
					12
				
 
				Gen-Verse/TraDo-4B-Instruct
		
				4B
			• 
	
				Updated
					
				
				• 
					
					72
				
	
				• 
					
					9
				
 
				Gen-Verse/TraDo-8B-Instruct
		
				8B
			• 
	
				Updated
					
				
				• 
					
					301
				
	
				• 
					
					10
				
 
				Gen-Verse/MMaDA-8B-MixCoT
			Any-to-Any
			• 
		
				8B
			• 
	
				Updated
					
				
				• 
					
					7.8k
				
	
				• 
					
					27
				
 
				Gen-Verse/ReasonFlux-PRM-7B
			Text Generation
			• 
		
				7B
			• 
	
				Updated
					
				
				• 
					
					713
				
	
				• 
					
					8
				
 
				Gen-Verse/ReasonFlux-PRM-Qwen-2.5-7B
			Text Generation
			• 
		
				8B
			• 
	
				Updated
					
				
				• 
					
					4
				• 
			
	
				• 
					
					3
				
 
				Gen-Verse/ReasonFlux-PRM-1.5B
			Text Generation
			• 
		
				2B
			• 
	
				Updated
					
				
				• 
					
					2
				
	
				• 
					
					3
				
			datasets
			27
		
			
	
	
	
	
	Gen-Verse/Open-AgentRL-30K
			Viewer
			• 
	
				Updated
					
				• 
			
			30.1k
	
				• 
					
					212
				
				• 
					
					2
				
Gen-Verse/Open-AgentRL-SFT-3K
			Viewer
			• 
	
				Updated
					
				• 
			
			3k
	
				• 
					
					168
				
				• 
					
					2
				
Gen-Verse/Open-AgentRL-Eval
			Viewer
			• 
	
				Updated
					
				• 
			
			433
	
				• 
					
					46
				
				
				
Gen-Verse/PrimeIntellect
			Viewer
			• 
	
				Updated
					
				• 
			
			5.95k
	
				• 
					
					85
				
				
				
Gen-Verse/demon_openr1math
			Viewer
			• 
	
				Updated
					
				• 
			
			2k
	
				• 
					
					77
				
				
				
Gen-Verse/LiveBench
			Viewer
			• 
	
				Updated
					
				• 
			
			128
	
				• 
					
					78
				
				
				
Gen-Verse/MATH_train
			Viewer
			• 
	
				Updated
					
				• 
			
			8.52k
	
				• 
					
					88
				
				
				
Gen-Verse/LiveCodeBench
			Preview
			• 
	
				Updated
					
				
	
				• 
					
					79
				
				
				
Gen-Verse/AIME2024
			Viewer
			• 
	
				Updated
					
				• 
			
			30
	
				• 
					
					60
				
				
				
Gen-Verse/GSM8K
			Viewer
			• 
	
				Updated
					
				• 
			
			1.32k
	
				• 
					
					76