Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models Paper • 2504.15271 • Published 5 days ago • 62
Kimi-VL-A3B Collection Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated 15 days ago • 62
Efficient Inference for Large Reasoning Models: A Survey Paper • 2503.23077 • Published 29 days ago • 46
Efficient Inference for Large Reasoning Models: A Survey Paper • 2503.23077 • Published 29 days ago • 46
Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation Paper • 2503.19622 • Published Mar 25 • 30 • 4
Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation Paper • 2503.19622 • Published Mar 25 • 30
Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation Paper • 2503.19622 • Published Mar 25 • 30 • 4
AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models Paper • 2406.13233 • Published Jun 19, 2024 • 1
StruEdit: Structured Outputs Enable the Fast and Accurate Knowledge Editing for Large Language Models Paper • 2409.10132 • Published Sep 16, 2024
Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows Paper • 2411.07763 • Published Nov 12, 2024 • 2
Exploring the Universal Vulnerability of Prompt-based Learning Paradigm Paper • 2204.05239 • Published Apr 11, 2022
Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows? Paper • 2407.10956 • Published Jul 15, 2024 • 7