Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data Paper • 2507.07095 • Published about 13 hours ago • 18
StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling Paper • 2507.05240 • Published 3 days ago • 36
How to Train Your LLM Web Agent: A Statistical Diagnosis Paper • 2507.04103 • Published 5 days ago • 42
MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos Paper • 2507.05675 • Published 2 days ago • 25
RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents Paper • 2507.03112 • Published 7 days ago • 28
CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization Paper • 2507.06181 • Published 1 day ago • 36
StreamDiT: Real-Time Streaming Text-to-Video Generation Paper • 2507.03745 • Published 6 days ago • 23
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge Paper • 2507.04447 • Published 4 days ago • 35
Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving Paper • 2507.06229 • Published 1 day ago • 59
Tora2: Motion and Appearance Customized Diffusion Transformer for Multi-Entity Video Generation Paper • 2507.05963 • Published 1 day ago • 8
Is Diversity All You Need for Scalable Robotic Manipulation? Paper • 2507.06219 • Published 1 day ago • 18
OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion Paper • 2507.06165 • Published 1 day ago • 47
view article Article Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub By drbh and 6 others • 28 days ago • 109
WebSailor: Navigating Super-human Reasoning for Web Agent Paper • 2507.02592 • Published 7 days ago • 86
LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion Paper • 2507.02813 • Published 7 days ago • 55
Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy Paper • 2507.01352 • Published 8 days ago • 49