
❄️January 2025 - Open releases from the Chinese community
- Any-to-Any • Updated • 90.6k • 3.42k
deepseek-ai/Janus-Pro-1B
Any-to-Any • Updated • 26k • 445tencent/Hunyuan3D-2
Image-to-3D • Updated • 295k • 1.53ktencent/Hunyuan-7B-Instruct
Text Generation • Updated • 221 • 49
ByteDance/Sa2VA-4B
Image-Text-to-Text • Updated • 1.57k • • 76Note A unified model for dense grounded understanding of images & videos.
ByteDance-Seed/UI-TARS-72B-DPO
Image-Text-to-Text • Updated • 6k • 132
deepseek-ai/DeepSeek-R1
Text Generation • Updated • 639k • • 12.4kNote 660B reasoning models with MIT license
deepseek-ai/DeepSeek-R1-Zero
Text Generation • Updated • 3.55k • 921
MiniMaxAI/MiniMax-VL-01
Image-Text-to-Text • Updated • 26.4k • 266Note A non transformer based ( ViT-MLP-LLM framework) VLM
MiniMaxAI/MiniMax-Text-01
Text Generation • Updated • 13.1k • 608Note 456B LLM with 1M tokens training context
Qwen/Qwen2.5-Math-PRM-7B
Text Classification • Updated • 16.2k • 70Note Math model
Qwen/Qwen2.5-14B-Instruct-1M
Text Generation • Updated • 33.1k • • 309
openbmb/MiniCPM-o-2_6
Any-to-Any • Updated • 163k • 1.17kNote End-side multimodal LLM that supports real time conversation and video understanding.
ICTNLP/llava-mini-llama-3.1-8b
Image-Text-to-Text • Updated • 1.06k • 53
BlinkDL/rwkv-7-world
Text Generation • Updated • 100Note RNN+Transfomers
HKUSTAudio/Llasa-3B
Text-to-Speech • Updated • 3.79k • • 505Note TTS
DAMO-NLP-SG/VideoLLaMA3-7B
Visual Question Answering • Updated • 110k • 63internlm/internlm3-8b-instruct
Text Generation • Updated • 50.6k • 218
baichuan-inc/Baichuan-M1-14B-Base
Updated • 47 • 27Note Medical LLM
opencsg/Fineweb-Edu-Chinese-V2.1
Viewer • Updated • 958M • 22.8k • 37Note Dataset designed specifically for natural language processing (NLP) tasks in the education sector.
DAMO-NLP-SG/multimodal_textbook
Updated • 2.93k • 142Note A multimodel dataset for vision language pretraining , includes 6.5M images + 0.8B text from 22k hours of instructional videos
hithink-ai/MME-Finance
Viewer • Updated • 2.06k • 253 • 8KwaiVGI/GameFactory-Dataset
Updated • 165 • 12m-a-p/YuE-s1-7B-anneal-zh-cot
Text Generation • Updated • 1.8k • 39m-a-p/YuE-s1-7B-anneal-jp-kr-cot
Text Generation • Updated • 1.53k • 20m-a-p/YuE-s1-7B-anneal-en-cot
Text Generation • Updated • 10.5k • • 410Qwen/Qwen2.5-VL-3B-Instruct
Image-Text-to-Text • Updated • 3.27M • 416Qwen/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text • Updated • 2.9M • • 972- 2.79k
Hunyuan3D-2.0
🌍Text-to-3D and Image-to-3D Generation
- 57
UI-TARS
🌖Select coordinates on an image based on instructions
- 59
MiniMaxVL01
💬Generate responses using text and images
- 1.98k
Chat With Janus-Pro-7B
🌍A unified multimodal understanding and generation model.
- 584
Qwen2.5 Max Demo
🐢Chat with an AI language model