Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
LeeSXian's picture
4

LeeSXian

LEE0v0
21world's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a collection about 2 months ago
EO-Robotics
upvoted a paper 7 months ago
Unicorn: Text-Only Data Synthesis for Vision Language Model Training
upvoted a paper 7 months ago
DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation
View all activity

Organizations

OpenMOSS (SII, Fudan NLP)'s profile picture

upvoted a collection about 2 months ago

EO-Robotics

Collection
EmbodiedOneVision is a unified framework for multimodal embodied reasoning and robot control, featuring interleaved vision-text-action pretraining. • 5 items • Updated Sep 16 • 8
upvoted 2 papers 7 months ago

Unicorn: Text-Only Data Synthesis for Vision Language Model Training

Paper • 2503.22655 • Published Mar 28 • 39

DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation

Paper • 2503.06053 • Published Mar 8 • 138
upvoted a paper over 1 year ago

RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models

Paper • 2407.05131 • Published Jul 6, 2024 • 27
updated a dataset over 1 year ago

fnlp/hh-rlhf-strength-cleaned

Viewer • Updated Jan 31, 2024 • 168k • 62 • 23
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs