Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Baifeng Shi's picture
4 25 3

Baifeng Shi

bfshi
21world's profile picture yehors-cv's profile picture mlfu7's profile picture
·
https://bfshi.github.io
  • baifeng_shi
  • bfshi

AI & ML interests

computer vision

Recent Activity

updated a Space 21 days ago
bfshi/VILA-HD-demo
updated a dataset about 1 month ago
bfshi/video_r1
published a dataset about 1 month ago
bfshi/video_r1
View all activity

Organizations

UC Berkeley's profile picture Efficient-Large-Model's profile picture

authored a paper 3 months ago

Scaling Vision Pre-Training to 4K Resolution

Paper • 2503.19903 • Published Mar 25 • 42
authored a paper 7 months ago

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 60
authored 3 papers over 1 year ago

When Do We Not Need Larger Vision Models?

Paper • 2403.13043 • Published Mar 19, 2024 • 27

Humanoid Locomotion as Next Token Prediction

Paper • 2402.19469 • Published Feb 29, 2024 • 29

Rethinking Patch Dependence for Masked Autoencoders

Paper • 2401.14391 • Published Jan 25, 2024 • 27
authored a paper about 2 years ago

Robot Learning with Sensorimotor Pre-training

Paper • 2306.10007 • Published Jun 16, 2023 • 13
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs