Baifeng Shi's picture

Baifeng Shi PRO

bfshi

·

https://bfshi.github.io

AI & ML interests

computer vision

Recent Activity

new activity 19 days ago

nvidia/NVILA-8B-HD-Video:Update README.md

liked a Space 24 days ago

fffiloni/NVILA-HD-Video-AutoGaze

authored a paper 24 days ago

Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing

View all activity

Organizations

authored a paper 24 days ago

Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing

Paper • 2603.12254 • Published Mar 12 • 22

submitted a paper to Daily Papers 25 days ago

Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing

Paper • 2603.12254 • Published Mar 12 • 22

authored a paper 9 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 162

authored a paper about 1 year ago

Scaling Vision Pre-Training to 4K Resolution

Paper • 2503.19903 • Published Mar 25, 2025 • 41

authored a paper over 1 year ago

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 60

authored 3 papers about 2 years ago

When Do We Not Need Larger Vision Models?

Paper • 2403.13043 • Published Mar 19, 2024 • 26

Humanoid Locomotion as Next Token Prediction

Paper • 2402.19469 • Published Feb 29, 2024 • 29

Rethinking Patch Dependence for Masked Autoencoders

Paper • 2401.14391 • Published Jan 25, 2024 • 26

authored a paper almost 3 years ago

Robot Learning with Sensorimotor Pre-training

Paper • 2306.10007 • Published Jun 16, 2023 • 14