Vision-Language-Action Models in Minecraft.
-
CraftJarvis/JarvisVLA-Qwen2-VL-7B
Image-Text-to-Text • Updated • 2 • 7 -
JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse
Paper • 2503.16365 • Published • 33 -
6
Minecraft VLM Leaderboard
🏢Display and filter LLM leaderboard for Minecraft models
-
CraftJarvis/minecraft-vla-sft
Viewer • Updated • 3.78M • 153 • 3