OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models Paper • 2506.03135 • Published 11 days ago • 37
view article Article NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets By mingyuliutw and 4 others • Mar 18 • 41
view article Article Preference Optimization for Vision Language Models By qgallouedec and 3 others • Jul 10, 2024 • 76
Vid2World: Crafting Video Diffusion Models to Interactive World Models Paper • 2505.14357 • Published 26 days ago • 26
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 By tomaarsen • Mar 26 • 135
MutaGReP: Execution-Free Repository-Grounded Plan Search for Code-Use Paper • 2502.15872 • Published Feb 21 • 5