Qwen2-VL Collection Vision-language model series based on Qwen2 • 16 items • Updated 6 days ago • 222
meta-llama/Llama-4-Maverick-17B-128E-Instruct Image-Text-to-Text • 402B • Updated May 22 • 35.9k • • 384
view post Post 3360 Kimi-K2 is now available on the hub🔥🚀This is a trillion-parameter MoE model focused on long context, code, reasoning, and agentic behavior. moonshotai/kimi-k2-6871243b990f2af5ba60617d✨ Base & Instruct ✨ 1T total / 32B active - Modified MIT License✨ 128K context length✨ Muon optimizer for stable trillion-scale training See translation 1 reply · 🔥 15 15 + Reply