How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks Paper • 2507.01955 • Published 8 days ago • 29
re-skill/whisper-large-v3-turbo-tj Automatic Speech Recognition • 0.8B • Updated 15 days ago • 13 • 2
muhtasham/whisper-non-verbal-new-aug-low-lr-fixed Automatic Speech Recognition • 0.8B • Updated 4 days ago • 55
muhtasham/whisper-non-verbal-new-aug-low-lr-fixed Automatic Speech Recognition • 0.8B • Updated 4 days ago • 55
muhtasham/whisper-non-verbal-new-aug-low-lr-elise Automatic Speech Recognition • 0.8B • Updated 5 days ago • 11
muhtasham/whisper-non-verbal-new-aug-low-lr Automatic Speech Recognition • 0.8B • Updated 4 days ago • 19
muhtasham/whisper-non-verbal-new-aug-low-lr Automatic Speech Recognition • 0.8B • Updated 4 days ago • 19
muhtasham/whisper-non-verbal-new-aug-low-lr-elise Automatic Speech Recognition • 0.8B • Updated 5 days ago • 11