- 
	
	
	587
Scaling test-time compute
📈Implement test-time compute scaling for math problems
 - 
	
	
	1.14k
FineWeb: decanting the web for the finest text data at scale
🍷Generate high-quality text data for LLMs using FineWeb
 - 
	
	
	3.42k
The Ultra-Scale Playbook
🌌The ultimate guide to training LLM on large GPU Clusters
 - 
	
	
	197
FineVision: Open Data is All You Need
📝A new open-source dataset for training VLMs
 
ldwang
ldwang
		AI & ML interests
LLM, MLLM, Infra
		Recent Activity
						liked
								a model
							
						1 day ago
						
					
						
						
						
						BlinkDL/rwkv7-g1
						
						liked
								a model
							
						2 days ago
						
					
						
						
						
						moonshotai/Kimi-Linear-48B-A3B-Instruct
						
						upvoted 
								a
								collection
							
						4 days ago
						
					Emu3.5
						Organizations
MiscIndustry
			
			
	
	MiscR1
			
			
	
	MiscDatasets
			
			
	
	MiscSpaces
			
			
	
	- 
	
	
	Running587587
Scaling test-time compute
📈Implement test-time compute scaling for math problems
 - 
	
	
	Running1.14k1.14k
FineWeb: decanting the web for the finest text data at scale
🍷Generate high-quality text data for LLMs using FineWeb
 - 
	
	
	Running3.42k3.42k
The Ultra-Scale Playbook
🌌The ultimate guide to training LLM on large GPU Clusters
 - 
	
	
	Running197197
FineVision: Open Data is All You Need
📝A new open-source dataset for training VLMs
 
MiscAgentic
			
			
	
	MiscIndustry
			
			
	
	MiscKernel
			
			
	
	MiscR1
			
			
	
	MiscModels
			
			
	
	MiscDatasets
			
			
	
	MiscTools
			Misc tools for llm & vlm.