Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Liangyu Chen's picture
3 8 7

Liangyu Chen

liangyuch
cangjiu's profile picture 21world's profile picture yehors-cv's profile picture
·
https://cliangyu.com/
  • cliangyu_
  • cliangyu

AI & ML interests

Multimodal AI, Computer vision

Organizations

CVPR Demo Track's profile picture ICML 2022's profile picture ECCV 2022's profile picture NAACL 2022's profile picture MMLab@NTU's profile picture Aurora-M/MDEL's profile picture Aurora-M's profile picture  ML Foundations Development's profile picture

upvoted a paper 4 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 196
upvoted a paper 5 months ago

Video Action Differencing

Paper • 2503.07860 • Published Mar 10 • 34
upvoted a paper 7 months ago

BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature

Paper • 2501.07171 • Published Jan 13 • 56
upvoted 5 papers almost 2 years ago

Large Language Models are Visual Reasoning Coordinators

Paper • 2310.15166 • Published Oct 23, 2023 • 2

Making Your First Choice: To Address Cold Start Problem in Vision Active Learning

Paper • 2210.02442 • Published Oct 5, 2022 • 1

MIMIC-IT: Multi-Modal In-Context Instruction Tuning

Paper • 2306.05425 • Published Jun 8, 2023 • 11

Otter: A Multi-Modal Model with In-Context Instruction Tuning

Paper • 2305.03726 • Published May 5, 2023 • 6

Deep Geometrized Cartoon Line Inbetweening

Paper • 2309.16643 • Published Sep 28, 2023 • 25
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs