Qwen/Qwen3-VL-235B-A22B-Thinking Image-Text-to-Text • 236B • Updated Nov 26, 2025 • 39.2k • • 357
Running Featured 151 DINOv3 Web 🦖 151 Visualize rich, dense image features locally in your browser
The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding Paper • 2512.19693 • Published 15 days ago • 62
Next-Embedding Prediction Makes Strong Vision Learners Paper • 2512.16922 • Published 19 days ago • 83
Fast and Accurate Causal Parallel Decoding using Jacobi Forcing Paper • 2512.14681 • Published 21 days ago • 39
LongVie 2: Multimodal Controllable Ultra-Long Video World Model Paper • 2512.13604 • Published 23 days ago • 72
sensenova/SenseNova-SI-1.1-Qwen2.5-VL-7B Image-Text-to-Text • 8B • Updated 29 days ago • 1.3k • 3
sensenova/SenseNova-SI-1.1-Qwen2.5-VL-3B Image-Text-to-Text • 4B • Updated 29 days ago • 1.27k • 2
sensenova/SenseNova-SI-1.2-InternVL3-8B Image-Text-to-Text • 8B • Updated 28 days ago • 3.35k • 9
sensenova/SenseNova-SI-1.1-Qwen3-VL-8B Image-Text-to-Text • 9B • Updated 29 days ago • 1.42k • 5
sensenova/SenseNova-SI-1.2-InternVL3-8B Image-Text-to-Text • 8B • Updated 28 days ago • 3.35k • 9
sensenova/SenseNova-SI-1.1-Qwen2.5-VL-3B Image-Text-to-Text • 4B • Updated 29 days ago • 1.27k • 2