Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
josang1204 's Collections
llm

llm

updated Mar 17
Upvote
-

  • ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models

    Paper • 2502.09696 • Published Feb 13 • 44

  • MM-RLHF: The Next Step Forward in Multimodal LLM Alignment

    Paper • 2502.10391 • Published Feb 14 • 35

  • Autellix: An Efficient Serving Engine for LLM Agents as General Programs

    Paper • 2502.13965 • Published Feb 19 • 19

  • SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

    Paper • 2502.14739 • Published Feb 20 • 103

  • li-lab/MMLU-ProX

    Viewer • Updated 2 days ago • 343k • 1.05k • 6
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs