Xiyao Wang's picture

5 21 6

Xiyao Wang

russwang

·

AI & ML interests

None yet

Recent Activity

updated a Space about 5 hours ago

lmms-lab/README

liked a model 4 days ago

lmms-lab/LLaVA-Critic-R1-7B-Plus-Qwen

upvoted a paper 5 days ago

OpenVision 2: A Family of Generative Pretrained Visual Encoders for Multimodal Learning

View all activity

Organizations

commented 2 papers 3 months ago

ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs

Paper • 2506.10128 • Published Jun 11 • 23 •

MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning

Paper • 2506.05523 • Published Jun 5 • 34 •

commented a paper 5 months ago

SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement

Paper • 2504.07934 • Published Apr 10 • 20 •

commented a paper 9 months ago

Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension

Paper • 2412.03704 • Published Dec 4, 2024 • 7 •