LeeSXian's picture

4

LeeSXian

LEE0v0

·

AI & ML interests

None yet

Recent Activity

upvoted a collection about 2 months ago

upvoted a paper 7 months ago

Unicorn: Text-Only Data Synthesis for Vision Language Model Training

upvoted a paper 7 months ago

DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation

View all activity

Organizations

upvoted a collection about 2 months ago

EO-Robotics

EmbodiedOneVision is a unified framework for multimodal embodied reasoning and robot control, featuring interleaved vision-text-action pretraining. • 5 items • Updated Sep 16 • 8

upvoted 2 papers 7 months ago

Unicorn: Text-Only Data Synthesis for Vision Language Model Training

Paper • 2503.22655 • Published Mar 28 • 39

DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation

Paper • 2503.06053 • Published Mar 8 • 138

upvoted a paper over 1 year ago

RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models

Paper • 2407.05131 • Published Jul 6, 2024 • 27

updated a dataset over 1 year ago

fnlp/hh-rlhf-strength-cleaned

Viewer • Updated Jan 31, 2024 • 168k • 62 • 23