Interpretable and Reliable Detection of AI-Generated Images via Grounded Reasoning in MLLMs Paper • 2506.07045 • Published 5 days ago • 7
Interpretable and Reliable Detection of AI-Generated Images via Grounded Reasoning in MLLMs Paper • 2506.07045 • Published 5 days ago • 7 • 2
Interpretable and Reliable Detection of AI-Generated Images via Grounded Reasoning in MLLMs Paper • 2506.07045 • Published 5 days ago • 7
Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models Paper • 2506.05176 • Published 8 days ago • 55
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper • 2506.01844 • Published 11 days ago • 90
Kimi-VL-A3B Collection Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated Apr 12 • 65
Wan: Open and Advanced Large-Scale Video Generative Models Paper • 2503.20314 • Published Mar 26 • 52
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning Paper • 2503.15558 • Published Mar 18 • 50
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing Paper • 2503.10639 • Published Mar 13 • 50
PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization Paper • 2503.01328 • Published Mar 3 • 16