❄️January 2025 - Open releases from the Chinese community - a zh-ai-community Collection

zh-ai-community 's Collections

⭐ 3D models - 2025

🧠 Reasoning model 2025

💻 Coding models 2025

🎬 Video model 2025

🎨Image model 2025

🔊 Audio model 2025

🍉 June 2025 - Open works from the Chinese community

🌞 May 2025 - Open works from the Chinese community

🌸 April 2025 - Open releases from the Chinese community

🌙 March 2025 - Open releases from the Chinese community

🧧 February 2025 - Open releases from the Chinese community

❄️January 2025 - Open releases from the Chinese community

🖼️ MLLM by the Chinese community - 2025

🧠 Reasoning Models

🎬 Video models

🔊 Audio Models

🔢 Math models

🏆 Leaderboards & Arenas

🚀 Trending Demo

💻 Code Models

🎨 Image models

2025 January Papers 🧐

📑 Trending Papers - October 🔟

📑Trending Papers - September 9⃣️

❄️January 2025 - Open releases from the Chinese community

updated 1 day ago

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated Feb 1 • 90.6k • 3.42k
deepseek-ai/Janus-Pro-1B

Any-to-Any • Updated Feb 1 • 26k • 445
tencent/Hunyuan3D-2

Image-to-3D • Updated Apr 10 • 295k • 1.53k
tencent/Hunyuan-7B-Instruct

Text Generation • Updated Jan 24 • 221 • 49
ByteDance/Sa2VA-4B

Image-Text-to-Text • Updated Mar 19 • 1.57k • • 76

Note A unified model for dense grounded understanding of images & videos.
ByteDance-Seed/UI-TARS-72B-DPO

Image-Text-to-Text • Updated Jan 25 • 6k • 132
deepseek-ai/DeepSeek-R1

Text Generation • Updated Mar 27 • 639k • • 12.4k

Note 660B reasoning models with MIT license
deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated Mar 27 • 3.55k • 921
MiniMaxAI/MiniMax-VL-01

Image-Text-to-Text • Updated 2 days ago • 26.4k • 266

Note A non transformer based ( ViT-MLP-LLM framework) VLM
MiniMaxAI/MiniMax-Text-01

Text Generation • Updated 2 days ago • 13.1k • 608

Note 456B LLM with 1M tokens training context
Qwen/Qwen2.5-Math-PRM-7B

Text Classification • Updated Jan 17 • 16.2k • 70

Note Math model
Qwen/Qwen2.5-14B-Instruct-1M

Text Generation • Updated Jan 29 • 33.1k • • 309
openbmb/MiniCPM-o-2_6

Any-to-Any • Updated about 22 hours ago • 163k • 1.17k

Note End-side multimodal LLM that supports real time conversation and video understanding.
ICTNLP/llava-mini-llama-3.1-8b

Image-Text-to-Text • Updated Jan 13 • 1.06k • 53
BlinkDL/rwkv-7-world

Text Generation • Updated 18 days ago • 100

Note RNN+Transfomers
HKUSTAudio/Llasa-3B

Text-to-Speech • Updated May 10 • 3.79k • • 505

Note TTS
DAMO-NLP-SG/VideoLLaMA3-7B

Visual Question Answering • Updated Mar 20 • 110k • 63
internlm/internlm3-8b-instruct

Text Generation • Updated Feb 11 • 50.6k • 218
baichuan-inc/Baichuan-M1-14B-Base

Updated Feb 20 • 47 • 27

Note Medical LLM
opencsg/Fineweb-Edu-Chinese-V2.1

Viewer • Updated Feb 27 • 958M • 22.8k • 37

Note Dataset designed specifically for natural language processing (NLP) tasks in the education sector.
DAMO-NLP-SG/multimodal_textbook

Updated Mar 17 • 2.93k • 142

Note A multimodel dataset for vision language pretraining , includes 6.5M images + 0.8B text from 22k hours of instructional videos
hithink-ai/MME-Finance

Viewer • Updated 19 days ago • 2.06k • 253 • 8
KwaiVGI/GameFactory-Dataset

Updated Mar 22 • 165 • 12
m-a-p/YuE-s1-7B-anneal-zh-cot

Text Generation • Updated Mar 12 • 1.8k • 39
m-a-p/YuE-s1-7B-anneal-jp-kr-cot

Text Generation • Updated Mar 12 • 1.53k • 20
m-a-p/YuE-s1-7B-anneal-en-cot

Text Generation • Updated Mar 12 • 10.5k • • 410
Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • Updated Apr 6 • 3.27M • 416
Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • Updated Apr 6 • 2.9M • • 972
Running on Zero

2.79k

2.79k

Hunyuan3D-2.0

🌍

Text-to-3D and Image-to-3D Generation
Running

57

57

UI-TARS

🌖

Select coordinates on an image based on instructions
Running

59

59

MiniMaxVL01

💬

Generate responses using text and images
Running on Zero

1.98k

1.98k

Chat With Janus-Pro-7B

🌍

A unified multimodal understanding and generation model.
Running

584

584

Qwen2.5 Max Demo

🐢

Chat with an AI language model