microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition β’ Updated 4 days ago β’ 767k β’ 1.23k
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. β’ 11 items β’ Updated 12 days ago β’ 96
LLaVa-NeXT-Video Collection LLaVa-NeXT-Video extends LLaVa-NeXT for video understanding. β’ 5 items β’ Updated Jun 10, 2024 β’ 9
Running 543 543 Vision Arena (Testing VLMs side-by-side) πΌ Analyze images to detect and label objects