Sequential Modeling Enables Scalable Learning for Large Vision Models Paper • 2312.00785 • Published Dec 1, 2023 • 1
EgoPet: Egomotion and Interaction Data from an Animal's Perspective Paper • 2404.09991 • Published Apr 15, 2024
Forgotten Polygons: Multimodal Large Language Models are Shape-Blind Paper • 2502.15969 • Published Feb 21 • 2
Linguini: A benchmark for language-agnostic linguistic reasoning Paper • 2409.12126 • Published Sep 18, 2024
LCFO: Long Context and Long Form Output Dataset and Benchmarking Paper • 2412.08268 • Published Dec 11, 2024
Large Concept Models: Language Modeling in a Sentence Representation Space Paper • 2412.08821 • Published Dec 11, 2024 • 14
BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation Paper • 2502.04314 • Published Feb 6
MLLM-as-a-Judge for Image Safety without Human Labeling Paper • 2501.00192 • Published Dec 31, 2024 • 30
Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking Paper • 2203.14360 • Published Mar 27, 2022
Adaptive Decoding via Latent Preference Optimization Paper • 2411.09661 • Published Nov 14, 2024 • 10