Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Hanan Gani's picture
5

Hanan Gani

hanangani
Nadas31's profile picture 21world's profile picture
·
https://hananshafi.github.io/

AI & ML interests

Deep Learning

Recent Activity

authored a paper 18 days ago
VideoMolmo: Spatio-Temporal Grounding Meets Pointing
upvoted a paper 19 days ago
VideoMolmo: Spatio-Temporal Grounding Meets Pointing
upvoted a paper 4 months ago
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM
View all activity

Organizations

Mohamed Bin Zayed University of Artificial Intelligence's profile picture

upvoted a paper 19 days ago

VideoMolmo: Spatio-Temporal Grounding Meets Pointing

Paper • 2506.05336 • Published Jun 5 • 10
upvoted a paper 4 months ago

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Paper • 2503.04724 • Published Mar 6 • 69
upvoted 3 papers 8 months ago

VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos

Paper • 2411.04923 • Published Nov 7, 2024 • 24

VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs

Paper • 2406.10326 • Published Jun 14, 2024 • 1

LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts

Paper • 2310.10640 • Published Oct 16, 2023 • 2
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs