Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jihan Yang's picture
3 5 25

Jihan Yang

jihanyang
omerkartli's profile picture khang119966's profile picture 21world's profile picture
·
https://jihanyang.github.io/
  • jihanyang13
  • jihanyang

AI & ML interests

Computer Vision, Multimodality, Embodied AI

Recent Activity

updated a dataset 15 days ago
jihanyang/tomato
published a dataset 15 days ago
jihanyang/tomato
liked a dataset 27 days ago
allenai/pixmo-points
View all activity

Organizations

NYU VisionX's profile picture Amasia NYU's profile picture Space's profile picture

jihanyang's activity

upvoted a paper 3 months ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 121
upvoted a paper 5 months ago

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces

Paper • 2412.14171 • Published Dec 18, 2024 • 24
upvoted a paper 11 months ago

Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs

Paper • 2406.16860 • Published Jun 24, 2024 • 61
upvoted a paper 12 months ago

What matters when building vision-language models?

Paper • 2405.02246 • Published May 3, 2024 • 104
upvoted a paper over 1 year ago

V-IRL: Grounding Virtual Intelligence in Real Life

Paper • 2402.03310 • Published Feb 5, 2024 • 16
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs