Shamik
Shamik
AI & ML interests
CV, NLP, Speech, Python
Organizations
Audio Models
Benchmark
-
Running43
OCRBenchv2 Leaderboard
🏆43Display OCRBench leaderboard for text recognition models
-
Running188
Vidore Leaderboard
🥇188Explore visual document retrieval model rankings
-
Running on CPU UpgradeFeatured1.15k
Open ASR Leaderboard
🏆1.15kDisplay and request speech recognition model benchmarks
Interesting Spaces
-
RunningFeatured192
Attention Visualization
🔥192Vision Transformer Attention Visualization
-
Runtime error142
Open NotebookLM
🎙142Generate a podcast to discuss the topic of your choice!
-
Running on ZeroMCP379
Multimodal OCR
🍍379nanonets ocr2 / olmocr / qwen2vl ocr / aya vision / rolmocr
-
Running on ZeroMCPFeatured139
Multimodal OCR2
💻139nanonets ocr / smoldocling / monkey ocr / typhoon ocr
Text Generation Models
Vision Models
Audio Models
MCP Servers
Benchmark
-
Running43
OCRBenchv2 Leaderboard
🏆43Display OCRBench leaderboard for text recognition models
-
Running188
Vidore Leaderboard
🥇188Explore visual document retrieval model rankings
-
Running on CPU UpgradeFeatured1.15k
Open ASR Leaderboard
🏆1.15kDisplay and request speech recognition model benchmarks
Multi modal Document Parser
Interesting Spaces
-
RunningFeatured192
Attention Visualization
🔥192Vision Transformer Attention Visualization
-
Runtime error142
Open NotebookLM
🎙142Generate a podcast to discuss the topic of your choice!
-
Running on ZeroMCP379
Multimodal OCR
🍍379nanonets ocr2 / olmocr / qwen2vl ocr / aya vision / rolmocr
-
Running on ZeroMCPFeatured139
Multimodal OCR2
💻139nanonets ocr / smoldocling / monkey ocr / typhoon ocr