Yu's picture

Yu PRO

Shoubin

·

AI & ML interests

None yet

Recent Activity

updated a model 5 days ago

Shoubin/AVIC-Qwen2.5-VL-7B-policy

published a model 5 days ago

Shoubin/AVIC-Qwen2.5-VL-7B-policy

published a model about 2 months ago

View all activity

Organizations

upvoted a paper 2 months ago

Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos

Paper • 2603.22529 • Published Mar 23 • 7

upvoted a paper 3 months ago

VisionCoach: Reinforcing Grounded Video Reasoning via Visual-Perception Prompting

Paper • 2603.14659 • Published Mar 15 • 6

upvoted a paper 4 months ago

When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning

Paper • 2602.08236 • Published Feb 9 • 9

upvoted a paper 7 months ago

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20, 2025 • 110

upvoted a paper 11 months ago

Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning

Paper • 2507.06485 • Published Jul 9, 2025 • 5

upvoted 2 papers 12 months ago

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time

Paper • 2506.18890 • Published Jun 23, 2025 • 6

MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation

Paper • 2506.17113 • Published Jun 20, 2025 • 5

upvoted 2 papers about 1 year ago

Executable Functional Abstractions: Inferring Generative Programs for Advanced Math Problems

Paper • 2504.09763 • Published Apr 14, 2025 • 12

Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization

Paper • 2504.08641 • Published Apr 11, 2025 • 6

upvoted a collection over 1 year ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Dec 23, 2025 • 310