MedGemma Release Collection Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 9 items • Updated about 17 hours ago • 389
FAST: Efficient Action Tokenization for Vision-Language-Action Models Paper • 2501.09747 • Published Jan 16, 2025 • 28
Runtime error 45 Leaderboard: Physical Reasoning from Video 🏃 45 Submit model evaluations and view leaderboard results
Running on Zero Featured 96 Qwen Image Edit Inpaint ✒ 96 inapint with Qwen Image Edit for super precise edits
Reasoning Datasets Collection Reasoning datasets that are trending 🔥 • 10 items • Updated Jan 3, 2025 • 26
Cosmos-Reason2 Collection Cosmos Reason 2 is an open, customizable, reasoning vision language model (VLM) for physical AI and robotics • 14 items • Updated 1 day ago • 15
view article Article Generalist Robot Policy Evaluation in Simulation with NVIDIA Isaac Lab-Arena and LeRobot 9 days ago • 18