AI & ML interests
None defined yet.
Recent Activity
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 30.5k • 60 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • 235B • Updated • 67.4k • • 358 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 164k • 125 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • 235B • Updated • 103k • • 678
End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5
The long-context version of Qwen2.5, supporting 1M-token context lengths
Qwen with Questions
Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B.
Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B.
Qwen
Audio-language model series based on Qwen2
-
Qwen/Qwen3-Next-80B-A3B-Instruct
Text Generation • 81B • Updated • 1.02M • • 726 -
Qwen/Qwen3-Next-80B-A3B-Thinking
Text Generation • 81B • Updated • 464k • • 396 -
Qwen/Qwen3-Next-80B-A3B-Instruct-FP8
Text Generation • Updated • 2.89k • 31 -
Qwen/Qwen3-Next-80B-A3B-Thinking-FP8
Text Generation • Updated • 3.77k • 13
-
767
Qwen3 Coder WebDev
🌍Generate web application code from descriptions
-
Qwen/Qwen3-Coder-480B-A35B-Instruct
Text Generation • 480B • Updated • 258k • • 1.2k -
Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
Text Generation • 480B • Updated • 129k • • 120 -
Qwen/Qwen3-Coder-30B-A3B-Instruct
Text Generation • 31B • Updated • 386k • • 633
Vision-language model series based on Qwen2.5
-
153
Qwen2.5 VL 32B Instruct Demo
🏃Interact with Qwen2.5-VL-32B-Instruct for text and image/video responses
-
Qwen2.5-VL Technical Report
Paper • 2502.13923 • Published • 204 -
Qwen/Qwen2.5-VL-32B-Instruct
Image-Text-to-Text • 33B • Updated • 668k • • 446 -
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text • 73B • Updated • 665k • • 545
QVQ: Qwen models for visual reasoning
Code-specific model series based on Qwen2.5
Math-specific model series based on Qwen2.5
Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud.
Math-specific model series based on Qwen2
Vision-language model series based on Qwen2
-
Qwen/Qwen3-Next-80B-A3B-Instruct
Text Generation • 81B • Updated • 1.02M • • 726 -
Qwen/Qwen3-Next-80B-A3B-Thinking
Text Generation • 81B • Updated • 464k • • 396 -
Qwen/Qwen3-Next-80B-A3B-Instruct-FP8
Text Generation • Updated • 2.89k • 31 -
Qwen/Qwen3-Next-80B-A3B-Thinking-FP8
Text Generation • Updated • 3.77k • 13
-
767
Qwen3 Coder WebDev
🌍Generate web application code from descriptions
-
Qwen/Qwen3-Coder-480B-A35B-Instruct
Text Generation • 480B • Updated • 258k • • 1.2k -
Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
Text Generation • 480B • Updated • 129k • • 120 -
Qwen/Qwen3-Coder-30B-A3B-Instruct
Text Generation • 31B • Updated • 386k • • 633
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 30.5k • 60 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • 235B • Updated • 67.4k • • 358 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 164k • 125 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • 235B • Updated • 103k • • 678
End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5
Vision-language model series based on Qwen2.5
-
153
Qwen2.5 VL 32B Instruct Demo
🏃Interact with Qwen2.5-VL-32B-Instruct for text and image/video responses
-
Qwen2.5-VL Technical Report
Paper • 2502.13923 • Published • 204 -
Qwen/Qwen2.5-VL-32B-Instruct
Image-Text-to-Text • 33B • Updated • 668k • • 446 -
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text • 73B • Updated • 665k • • 545
The long-context version of Qwen2.5, supporting 1M-token context lengths
QVQ: Qwen models for visual reasoning
Qwen with Questions
Code-specific model series based on Qwen2.5
Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B.
Math-specific model series based on Qwen2.5
Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B.
Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud.
Qwen
Math-specific model series based on Qwen2
Audio-language model series based on Qwen2
Vision-language model series based on Qwen2