8 24 326

Gaurang Bharti PRO

gbharti

https://gaurangbharti.netlify.app/

AI & ML interests

GPTs, Computer Vision, NLP

Recent Activity

liked a model 22 days ago

apple/Sharp

liked a model 22 days ago

DiffSynth-Studio/Qwen-Image-i2L

liked a model 29 days ago

depth-anything/DA3-BASE

View all activity

Organizations

liked 2 models 22 days ago

apple/Sharp

Image-to-3D • Updated 21 days ago • 5.63k • 318

DiffSynth-Studio/Qwen-Image-i2L

Updated 24 days ago • 241

liked a model 29 days ago

depth-anything/DA3-BASE

Image-to-3D • 0.1B • Updated Nov 15, 2025 • 18.6k • 21

New activity in gbharti/finance-alpaca about 2 months ago

Add LICENSE file

🤝 1

#6 opened about 2 months ago by

jewittje

liked a Space about 2 months ago

Depth Anything 3

🏢

339

Create detailed depth maps from images using Depth Anything 3

liked a dataset 2 months ago

nvidia/PhysicalAI-Robotics-GR00T-X-Embodiment-Sim

Updated 30 days ago • 853k • 183

upvoted a paper 3 months ago

OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs

Paper • 2510.10689 • Published Oct 12, 2025 • 46

New activity in gbharti/finance-alpaca 3 months ago

good

#5 opened 3 months ago by

Jackrong

liked a Space 5 months ago

OmniAvatar

🐨

273

Generate podcast and tiktok style video avatars

liked a dataset 6 months ago

Vchitect/ShotBench

Viewer • Updated Jul 1, 2025 • 3.57k • 194 • 11

liked a model 6 months ago

Vchitect/ShotVL-7B

Image-Text-to-Text • 8B • Updated Sep 19, 2025 • 326 • 15

upvoted a paper 6 months ago

VideoPrism: A Foundational Visual Encoder for Video Understanding

Paper • 2402.13217 • Published Feb 20, 2024 • 38

liked a model 6 months ago

google/videoprism-base-f16r288

Video Classification • Updated Jul 29, 2025 • 157k • 92

upvoted a paper 6 months ago

Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations

Paper • 2506.18898 • Published Jun 23, 2025 • 33

liked a model 7 months ago

ByteDance/LatentSync-1.6

Updated Jun 12, 2025 • 23.6k • 56

liked a dataset 7 months ago

opencompass/MMBench-Video

Preview • Updated Oct 9, 2024 • 372 • 9

liked a Space 8 months ago

Keysync Demo

📈

Generate synchronized video from audio and video inputs

liked a model 8 months ago

chancharikm/qwen2.5-vl-7b-cam-motion

Video-Text-to-Text • 8B • Updated Sep 19, 2025 • 220 • 17

upvoted 2 papers 8 months ago

Towards Understanding Camera Motions in Any Video

Paper • 2504.15376 • Published Apr 21, 2025 • 155

NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks

Paper • 2504.19854 • Published Apr 28, 2025 • 7