Sergio Paniego PRO
AI & ML interests
Recent Activity
Organizations
-
Running4141
comparevlms
πCompare Vision Language Models
-
Running on Zero6363
OCR Time Machine
πExtract text from images and XML files using OCR models
-
Running2525
Compare Docvqa Models
π¦Compare different visual question answering
-
Running on CPU Upgrade2323
Compare Clip Siglip
πCompare strong zero-shot image classification models
-
Qwen/Qwen2.5-Omni-7B
Any-to-Any β’ 11B β’ Updated β’ 230k β’ 1.81k -
Running355355
Qwen2.5 Omni 7B Demo
πGenerate text and speech from text, audio, images, and videos
-
Qwen2.5-Omni Technical Report
Paper β’ 2503.20215 β’ Published β’ 167 -
openbmb/MiniCPM-o-2_6
Any-to-Any β’ 9B β’ Updated β’ 100k β’ 1.25k
-
Running4141
comparevlms
πCompare Vision Language Models
-
Runtime error44
Gemma3 License Plate Detection
πGemma 3 for license plate detection
-
Running on Zero137137
Gemma 3n E4B It
β‘Generate text responses to images, videos, and audio
-
Running on Zero3535
Moondream3
π’Image and video tasks with moondream3.
-
Running4141
comparevlms
πCompare Vision Language Models
-
Running on Zero6363
OCR Time Machine
πExtract text from images and XML files using OCR models
-
Running2525
Compare Docvqa Models
π¦Compare different visual question answering
-
Running on CPU Upgrade2323
Compare Clip Siglip
πCompare strong zero-shot image classification models
-
Running4141
comparevlms
πCompare Vision Language Models
-
Runtime error44
Gemma3 License Plate Detection
πGemma 3 for license plate detection
-
Running on Zero137137
Gemma 3n E4B It
β‘Generate text responses to images, videos, and audio
-
Running on Zero3535
Moondream3
π’Image and video tasks with moondream3.
-
Qwen/Qwen2.5-Omni-7B
Any-to-Any β’ 11B β’ Updated β’ 230k β’ 1.81k -
Running355355
Qwen2.5 Omni 7B Demo
πGenerate text and speech from text, audio, images, and videos
-
Qwen2.5-Omni Technical Report
Paper β’ 2503.20215 β’ Published β’ 167 -
openbmb/MiniCPM-o-2_6
Any-to-Any β’ 9B β’ Updated β’ 100k β’ 1.25k