Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
HaoLi's picture
1 4 1

HaoLi

OzymandisLi
21world's profile picture
·
https://scholar.google.com/citations?user=y4va91AAAAAJ&hl=en
  • HowardLi1984

AI & ML interests

Multi-modal Learning, AI4Science

Recent Activity

upvoted a paper about 2 months ago
GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation
upvoted a paper 4 months ago
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding
new activity 4 months ago
PharMolix/BioMedGPT-10B:How to use the model files?
View all activity

Organizations

Peking University's profile picture OpenMol's profile picture

OzymandisLi's activity

upvoted a paper about 2 months ago

GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation

Paper • 2504.02782 • Published Apr 3 • 56
upvoted a paper 4 months ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 91
upvoted 2 papers 7 months ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 95

The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio

Paper • 2410.12787 • Published Oct 16, 2024 • 32
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs