Wang Chengyao's picture

2 8 6

Wang Chengyao

wcy1122

·

https://wcy1122.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 13 days ago

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

upvoted a paper 20 days ago

Scaling RL to Long Videos

liked a model 6 months ago

Qwen/Qwen2.5-VL-7B-Instruct

View all activity

Organizations

authored a paper 8 months ago

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Paper • 2412.09501 • Published Dec 12, 2024 • 49

authored 2 papers over 1 year ago

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models

Paper • 2403.18814 • Published Mar 27, 2024 • 48

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

Paper • 2311.17043 • Published Nov 28, 2023