OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning Paper • 2505.04601 • Published May 7 • 28
AHELM: A Holistic Evaluation of Audio-Language Models Paper • 2508.21376 • Published about 1 month ago • 9
AHELM: A Holistic Evaluation of Audio-Language Models Paper • 2508.21376 • Published about 1 month ago • 9 • 3
AHELM: A Holistic Evaluation of Audio-Language Models Paper • 2508.21376 • Published about 1 month ago • 9
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset Paper • 2507.21033 • Published Jul 28 • 20