Chinese LLMs on Hugging Face
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
-
deepseek-ai/DeepSeek-V3.1-Terminus
Text Generation • 685B • Updated • 18.2k • • 308 -
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation • 685B • Updated • 10.7k • • 477 -
zai-org/GLM-4.6
Text Generation • 357B • Updated • 9.72k • • 309 -
Kwai-Klear/Klear-46B-A2.5B-Base
Text Generation • 46B • Updated • 93 • 27
-
Qwen/Qwen3-Coder-30B-A3B-Instruct
Text Generation • 31B • Updated • 573k • • 653 -
Qwen/Qwen3-30B-A3B-Thinking-2507
Text Generation • 31B • Updated • 99.3k • • 286 -
Qwen/Qwen3-Coder-480B-A35B-Instruct
Text Generation • 480B • Updated • 219k • • 1.21k -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • 235B • Updated • 185k • • 688
-
deepseek-ai/DeepSeek-R1-0528
Text Generation • 685B • Updated • 727k • • 2.37k -
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
Text Generation • 8B • Updated • 87.1k • • 963 -
ByteDance-Seed/BAGEL-7B-MoT
Any-to-Any • 15B • Updated • 651 • 1.14k -
ByteDance-Seed/Seed-Coder-8B-Reasoning
Text Generation • 8B • Updated • 988 • 141
-
Seed-Coder: Let the Code Model Curate Data for Itself
Paper • 2506.03524 • Published • 6 -
Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning
Paper • 2504.13914 • Published • 4 -
FlowTok: Flowing Seamlessly Across Text and Image Tokens
Paper • 2503.10772 • Published • 19 -
UVE: Are MLLMs Unified Evaluators for AI-Generated Videos?
Paper • 2503.09949 • Published • 5
text-to-video & image-to-video models released by the Chinese community
-
MoBA: Mixture of Block Attention for Long-Context LLMs
Paper • 2502.13189 • Published • 17 -
Kimi-Audio Technical Report
Paper • 2504.18425 • Published • 19 -
Kimi-VL Technical Report
Paper • 2504.07491 • Published • 132 -
Kimi k1.5: Scaling Reinforcement Learning with LLMs
Paper • 2501.12599 • Published • 123
-
DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition
Paper • 2504.21801 • Published • 2 -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper • 2501.12948 • Published • 418 -
Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures
Paper • 2505.09343 • Published • 70 -
DeepSeek-V3 Technical Report
Paper • 2412.19437 • Published • 69
-
fishaudio/fish-speech-1.5
Text-to-Speech • Updated • 2.05k • 630 -
269
ClearerVoice-Studio (Speech Enhancement, Separation and Extraction)
📈Better AI powered platform to purify your speech signal
-
fishaudio/fish-speech-1.4
Text-to-Speech • Updated • 186 • 451 -
fishaudio/fish-speech-1.2
Text-to-Speech • Updated • 126 • 207
-
deepseek-ai/DeepSeek-V2.5-1210
Text Generation • 236B • Updated • 1.35k • 254 -
infly/OpenCoder-8B-Instruct
Text Generation • 8B • Updated • 1.51k • 197 -
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 137k • • 1.94k -
deepseek-ai/DeepSeek-Coder-V2-Base
Text Generation • 236B • Updated • 22.2k • 78
-
deepseek-ai/DeepSeek-V3.1-Terminus
Text Generation • 685B • Updated • 18.2k • • 308 -
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation • 685B • Updated • 10.7k • • 477 -
zai-org/GLM-4.6
Text Generation • 357B • Updated • 9.72k • • 309 -
Kwai-Klear/Klear-46B-A2.5B-Base
Text Generation • 46B • Updated • 93 • 27
-
Qwen/Qwen3-Coder-30B-A3B-Instruct
Text Generation • 31B • Updated • 573k • • 653 -
Qwen/Qwen3-30B-A3B-Thinking-2507
Text Generation • 31B • Updated • 99.3k • • 286 -
Qwen/Qwen3-Coder-480B-A35B-Instruct
Text Generation • 480B • Updated • 219k • • 1.21k -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • 235B • Updated • 185k • • 688
-
deepseek-ai/DeepSeek-R1-0528
Text Generation • 685B • Updated • 727k • • 2.37k -
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
Text Generation • 8B • Updated • 87.1k • • 963 -
ByteDance-Seed/BAGEL-7B-MoT
Any-to-Any • 15B • Updated • 651 • 1.14k -
ByteDance-Seed/Seed-Coder-8B-Reasoning
Text Generation • 8B • Updated • 988 • 141
-
MoBA: Mixture of Block Attention for Long-Context LLMs
Paper • 2502.13189 • Published • 17 -
Kimi-Audio Technical Report
Paper • 2504.18425 • Published • 19 -
Kimi-VL Technical Report
Paper • 2504.07491 • Published • 132 -
Kimi k1.5: Scaling Reinforcement Learning with LLMs
Paper • 2501.12599 • Published • 123
-
Seed-Coder: Let the Code Model Curate Data for Itself
Paper • 2506.03524 • Published • 6 -
Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning
Paper • 2504.13914 • Published • 4 -
FlowTok: Flowing Seamlessly Across Text and Image Tokens
Paper • 2503.10772 • Published • 19 -
UVE: Are MLLMs Unified Evaluators for AI-Generated Videos?
Paper • 2503.09949 • Published • 5
-
DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition
Paper • 2504.21801 • Published • 2 -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper • 2501.12948 • Published • 418 -
Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures
Paper • 2505.09343 • Published • 70 -
DeepSeek-V3 Technical Report
Paper • 2412.19437 • Published • 69
text-to-video & image-to-video models released by the Chinese community
-
fishaudio/fish-speech-1.5
Text-to-Speech • Updated • 2.05k • 630 -
269
ClearerVoice-Studio (Speech Enhancement, Separation and Extraction)
📈Better AI powered platform to purify your speech signal
-
fishaudio/fish-speech-1.4
Text-to-Speech • Updated • 186 • 451 -
fishaudio/fish-speech-1.2
Text-to-Speech • Updated • 126 • 207
-
deepseek-ai/DeepSeek-V2.5-1210
Text Generation • 236B • Updated • 1.35k • 254 -
infly/OpenCoder-8B-Instruct
Text Generation • 8B • Updated • 1.51k • 197 -
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 137k • • 1.94k -
deepseek-ai/DeepSeek-Coder-V2-Base
Text Generation • 236B • Updated • 22.2k • 78