SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces Paper • 2501.09756 • Published 1 day ago • 13 • 2
FAST: Efficient Action Tokenization for Vision-Language-Action Models Paper • 2501.09747 • Published 1 day ago • 12 • 2
AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation Paper • 2501.09503 • Published 2 days ago • 6 • 2
CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation Paper • 2501.09433 • Published 2 days ago • 10 • 3
Learnings from Scaling Visual Tokenizers for Reconstruction and Generation Paper • 2501.09755 • Published 1 day ago • 20 • 3
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Paper • 2501.09732 • Published 1 day ago • 35 • 2
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models Paper • 2501.09686 • Published 1 day ago • 14 • 2
RepVideo: Rethinking Cross-Layer Representation for Video Generation Paper • 2501.08994 • Published 3 days ago • 13 • 2
XMusic: Towards a Generalized and Controllable Symbolic Music Generation Framework Paper • 2501.08809 • Published 3 days ago • 9 • 2
Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video Understanding Paper • 2501.07888 • Published 4 days ago • 12 • 2
Diffusion Adversarial Post-Training for One-Step Video Generation Paper • 2501.08316 • Published 4 days ago • 29 • 4
ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning Paper • 2501.06590 • Published 7 days ago • 7 • 2
MinMo: A Multimodal Large Language Model for Seamless Voice Interaction Paper • 2501.06282 • Published 8 days ago • 32 • 5
O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning Paper • 2501.06458 • Published 7 days ago • 29 • 2
Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains Paper • 2501.05707 • Published 8 days ago • 18 • 2
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper • 2501.04682 • Published 10 days ago • 83 • 2
Agent Laboratory: Using LLM Agents as Research Assistants Paper • 2501.04227 • Published 10 days ago • 77 • 6
Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control Paper • 2501.03847 • Published 11 days ago • 22 • 2