Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
LUO MENG's picture
2 7 6

LUO MENG

Eureka-Leo
21world's profile picture
·
https://eurekaleo.github.io/

AI & ML interests

None yet

Organizations

National University of Singapore's profile picture Path to Multimodal Generalist's profile picture FakeNews's profile picture

upvoted an article 4 months ago
view article
Article

Vision Language Models (Better, Faster, Stronger)

By merve and 4 others •
May 12
• 522
upvoted a paper 4 months ago

On Path to Multimodal Generalist: General-Level and General-Bench

Paper • 2505.04620 • Published May 7 • 83
upvoted 2 papers 5 months ago

VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models

Paper • 2504.13122 • Published Apr 17 • 21

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

Paper • 2503.24379 • Published Mar 31 • 77
upvoted 2 papers 6 months ago

A Survey on Benchmarks of Multimodal Large Language Models

Paper • 2408.08632 • Published Aug 16, 2024 • 2

PAD: Personalized Alignment at Decoding-Time

Paper • 2410.04070 • Published Oct 5, 2024 • 1
upvoted a paper about 1 year ago

PanoSent: A Panoptic Sextuple Extraction Benchmark for Multimodal Conversational Aspect-based Sentiment Analysis

Paper • 2408.09481 • Published Aug 18, 2024 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs