Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos Paper • 2501.13826 • Published 30 days ago • 24
SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE Paper • 2411.16856 • Published Nov 25, 2024 • 12
Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation Paper • 2303.13873 • Published Mar 24, 2023
ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance Paper • 2403.12409 • Published Mar 19, 2024 • 10
MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors Paper • 2410.16272 • Published Oct 21, 2024
Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare Paper • 2405.19298 • Published May 29, 2024
VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation Paper • 2411.13281 • Published Nov 20, 2024 • 20
Multi-expert Prompting Improves Reliability, Safety, and Usefulness of Large Language Models Paper • 2411.00492 • Published Nov 1, 2024 • 6
Aria: An Open Multimodal Native Mixture-of-Experts Model Paper • 2410.05993 • Published Oct 8, 2024 • 108
Jailbreaking ChatGPT via Prompt Engineering: An Empirical Study Paper • 2305.13860 • Published May 23, 2023
Prompt Injection attack against LLM-integrated Applications Paper • 2306.05499 • Published Jun 8, 2023 • 1
Efficient Detection of Toxic Prompts in Large Language Models Paper • 2408.11727 • Published Aug 21, 2024 • 13
Facing the Music: Tackling Singing Voice Separation in Cinematic Audio Source Separation Paper • 2408.03588 • Published Aug 7, 2024 • 7
Latte: Cross-framework Python Package for Evaluation of Latent-Based Generative Models Paper • 2112.10638 • Published Dec 20, 2021
ARAUS: A Large-Scale Dataset and Baseline Models of Affective Responses to Augmented Urban Soundscapes Paper • 2207.01078 • Published Jul 3, 2022
LongVideoBench: A Benchmark for Long-context Interleaved Video-Language Understanding Paper • 2407.15754 • Published Jul 22, 2024 • 20
LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models Paper • 2407.12772 • Published Jul 17, 2024 • 34