HaoLi's picture

1 4 1

HaoLi

OzymandisLi

·

https://scholar.google.com/citations?user=y4va91AAAAAJ&hl=en

HowardLi1984

AI & ML interests

Multi-modal Learning, AI4Science

Recent Activity

upvoted a paper about 2 months ago

GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation

upvoted a paper 4 months ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

new activity 4 months ago

PharMolix/BioMedGPT-10B:How to use the model files?

View all activity

Organizations

OzymandisLi's activity

upvoted a paper about 2 months ago

GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation

Paper • 2504.02782 • Published Apr 3 • 56

upvoted a paper 4 months ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 91

upvoted 2 papers 7 months ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 95

The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio

Paper • 2410.12787 • Published Oct 16, 2024 • 32