Models and datasets for Elastic Reset (NeurIPS 2023), code at https://github.com/mnoukhov/elastic-reset
			
	
	Michael N
mnoukhov
		AI & ML interests
Representation learning for functional language
		Recent Activity
						updated
								a dataset
							
						28 days ago
						
					
						
						
						
						mnoukhov/MATH_3000_final_filter
						
						published
								a dataset
							
						28 days ago
						
					
						
						
						
						mnoukhov/MATH_3000_final_filter
						Organizations
			models
			39
		
			
	
	
	
	
	mnoukhov/test
		
	
				Updated
					
				
				
				
	
				
				
mnoukhov/SmolLM2-135M-tldr-sft
			Text Generation
			• 
		
				0.1B
			• 
	
				Updated
					
				
				• 
					
					1
				
	
				
				
mnoukhov/SmolLM2-360M-tldr-sft
			Text Generation
			• 
		
				0.4B
			• 
	
				Updated
					
				
				
				
	
				
				
mnoukhov/SmolLM2-135M-Instruct_tldr-sft
			Text Generation
			• 
		
				0.1B
			• 
	
				Updated
					
				
				• 
					
					2
				
	
				
				
mnoukhov/SmolLM2-135M-Instruct_tldr-rm
			Text Classification
			• 
		
				0.1B
			• 
	
				Updated
					
				
				
				
	
				
				
mnoukhov/pythia2.8b-rm-tldr6.9b
			Text Classification
			• 
		
				3B
			• 
	
				Updated
					
				
				
				
	
				
				
mnoukhov/pythia2.8b-sft-tldr
			Text Generation
			• 
		
				3B
			• 
	
				Updated
					
				
				
				
	
				
				
mnoukhov/pythia160m-sft-tldr
			Text Generation
			• 
		
				0.2B
			• 
	
				Updated
					
				
				• 
					
					1
				
	
				
				
mnoukhov/pythia160m-rm-tldr6.9b
			Text Classification
			• 
		
				0.1B
			• 
	
				Updated
					
				
				
				
	
				
				
mnoukhov/pythia1b-rm-tldr6.9b
			Text Classification
			• 
		
				0.9B
			• 
	
				Updated
					
				
				
				
	
				
				
			datasets
			54
		
			
	
	
	
	
	mnoukhov/MATH_3000_final_filter
			Viewer
			• 
	
				Updated
					
				• 
			
			2.3k
	
				• 
					
					21
				
				
				
mnoukhov/deepscaler_20k_medhard_nolatex_rlvr
			Viewer
			• 
	
				Updated
					
				• 
			
			19.5k
	
				• 
					
					2
				
				
				
mnoukhov/aime2024-25-rlvr
			Viewer
			• 
	
				Updated
					
				• 
			
			60
	
				• 
					
					18
				
				
				
mnoukhov/DAPO-Math-14k-Processed-RLVR
			Viewer
			• 
	
				Updated
					
				• 
			
			14.1k
	
				• 
					
					6
				
				
				
mnoukhov/rlvr_countdown
			Viewer
			• 
	
				Updated
					
				• 
			
			490k
	
				• 
					
					3
				
				
				
mnoukhov/summarize_from_feedback_oai_preprocessing_1706381144_relabel_pythia6.9b
			Viewer
			• 
	
				Updated
					
				• 
			
			177k
	
				• 
					
					8
				
				
				
mnoukhov/summarize_from_feedback_oai_preprocessing_1706381144_relabel2_llama8b
			Viewer
			• 
	
				Updated
					
				• 
			
			92.1k
	
				• 
					
					9
				
				
				
mnoukhov/summarize_from_feedback_oai_preprocessing_1706381144_relabel_llama8b
			Viewer
			• 
	
				Updated
					
				• 
			
			176k
	
				• 
					
					2
				
				
				
mnoukhov/summarize_from_feedback_tldr3_unlabelled_vllm_pythia410m-dpo-tldr_relabel_pythia1b
			Viewer
			• 
	
				Updated
					
				• 
			
			107k
	
				• 
					
					6
				
				
				
mnoukhov/summarize_from_feedback_tldr3_unlabelled_vllm_pythia410m-dpo-tldr
			Viewer
			• 
	
				Updated
					
				• 
			
			107k
	
				• 
					
					9