GeometryZero: Improving Geometry Solving for LLM with Group Contrastive Policy Optimization Paper • 2506.07160 • Published 4 days ago • 3
Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better Paper • 2506.09040 • Published 2 days ago • 30
Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models Paper • 2505.02686 • Published May 5 • 15
MagicFace: Training-free Universal-Style Human Image Customized Synthesis Paper • 2408.07433 • Published Aug 14, 2024 • 1
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning Paper • 2505.03318 • Published May 6 • 93
CoMP: Continual Multimodal Pre-training for Vision Foundation Models Paper • 2503.18931 • Published Mar 24 • 30
LiFT-HRA Collection LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment • 3 items • Updated Mar 27 • 2
Unified Reward Model for Multimodal Understanding and Generation Paper • 2503.05236 • Published Mar 7 • 124
LiFT-Critic Collection LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment • 5 items • Updated Dec 22, 2024 • 3
LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment Paper • 2412.04814 • Published Dec 6, 2024 • 49