Qwen/Qwen2.5-VL-7B-Instruct Image-Text-to-Text • 8B • Updated Apr 6, 2025 • 2.44M • • 1.41k
pyannote/speaker-diarization-3.1 Automatic Speech Recognition • Updated May 10, 2024 • 15.6M • 1.4k
Salesforce/blip-image-captioning-large Image-to-Text • 0.5B • Updated Feb 3, 2025 • 1.34M • 1.44k