HuggingFace-CN-community (Hugging Face Chinese Localization)

AdinaY

posted an update 1 day ago

Post

328

The tech report of RoboBrain 2.0 is now available on the Daily Papers page🔥

It's an embedded brain model that sees, thinks, and plans for many robots.

Leave your insights or questions, the authors are happy to respond.
RoboBrain 2.0 Technical Report (2507.02029)

AdinaY

posted an update 2 days ago

Post

215

Skywork-Reward-V2🔥 Reward models by Skywork AI.

Skywork/skywork-reward-v2-685cc86ce5d9c9e4be500c84

✨ 0.6B - 8B
✨ Trained on 26M human-LLM preference pairs
✨ 0.6B > 27B in many tasks

AdinaY

posted an update 2 days ago

Post

181

POLAR🐻‍❄️ New reward modeling by Shanghai AI Lab

internlm/polar-68693f829d2e83ac5e6e124a

✨ 1.8B/7B - Apache 2.0
✨ Scalable policy discriminative pretraining
✨ Easy RLHF with minimal preference data

AdinaY

posted an update 7 days ago

Post

1911

The Chinese Open Source Heatmap is live 🔥
You can now track the companies/ research labs/ communities powering China’s open source AI movement.

zh-ai-community/model-release-heatmap-zh

Some highlights:

✨Giant Tech are investing more in open source.
-Alibaba: Full stack open ecosystem
-Tecent: Hunyuan image/video/3D
-Bytedance: Catching up fast in 2025
-Baidu: New player in open LLM

✨New players emerging post–DeepSeek moment.
-Xiaomi
-Red Note
-Bilibili
-MiniMax
-Moonshot AI

✨Startup list is shifting fast! Those who find a direction aligned with their strengths are the ones who endure.
-DeepSeek
-MiniMax
-StepFun
-Moonshot AI
-Zhipu AI
-OpenBMB

✨Research Lab & Community are making key contributions.
-BAAI
-Shanghai AI Lab
-OpenMOSS
-MAP

AdinaY

posted an update 8 days ago

Post

3301

🔥 June highlights from China’s open source ecosystem.

zh-ai-community/june-2025-open-works-from-the-chinese-community-683d66c188f782dc5570ba15

✨Baidu & MiniMax both launched open foundation models
- Baidu: Ernie 4.5 ( from 0.3B -424B ) 🤯
- MiniMax: MiniMax -M1 ( Hybrid MoE reasoning model )

✨Multimodal AI is moving from fusion to full-stack reasoning: unified Any-to-Any pipelines across text, vision, audio, and 3D
- Baidu: ERNIE-4.5-VL-424B
- Moonshot AI: Kimi-VL-A3B
- Alibaba: Ovis-U1
- BAAI: Video-XL-2/OmniGen2
- AntGroup: Ming-Lite-Omni
- Chinese Academy of Science: Stream-Omni
- Bytedance: SeedVR2-3B
- Tencent: Hunyuan 3D 2.1/ SongGeneration
- FishAudio: Openaudio-s1-mini

✨Domain specific models are rapidly emerging
- Alibaba DAMO: Lingshu-7B (medical MLLM)
- BAAI: RoboBrain (Robotics)

✨ So many small models!
- OpenBMB: MiciCPM4 ( on device )
- Qwen: Embedding/Reranker (0.6B)
- Alibaba: Ovis-U1-3B
- Moonshot AI: Kimi-VL-A3B
- Bytedance: SeedVR2-3B

AdinaY

posted an update 8 days ago

Post

300

MTVCraft 🔥 Veo3 style Audio-Video model by BAAI

Model:
BAAI/MTVCraft
Demo:
BAAI/MTVCraft

✨ Text > [Speech + SFX + BGM] > Synchronized Video
✨ Built with Qwen3 + ElevenLabs + MTV

AdinaY

posted an update 8 days ago

Post

2278

GLM-4.1V-Thinking 🔥 New open vision reasoning model by Zhipu AI

THUDM/glm-41v-thinking-6862bbfc44593a8601c2578d

✨ 9B base & Thinking - MIT license
✨ CoT + RL with Curriculum Sampling
✨ 64k context, 4K image, any aspect ratio
✨ Support English & Chinese
✨ Outperforms GPT 4O -2024/11/20

AdinaY

posted an update 10 days ago

Post

971

Pangu Pro MoE 🔥 Huawei's first open model!

Paper:
Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity (2505.21411)
Model:
https://gitcode.com/ascend-tribe/pangu-pro-moe-model

✨ MoGE: Mixture of Grouped Experts
✨ 16B activated params - 48 layers
✨ Trained on 15T tokens
✨ Natively optimized for Ascend hardware

1 reply

·

AdinaY

posted an update 10 days ago

Post

328

Baidu kept its promise, releasing 10 open models on the very last day of June🚀 Let's meet ERNIE 4.5 🔥

baidu/ernie-45-6861cd4c9be84540645f35c9

✨ From 0.3B to 424B total params
✨ Includes 47B & 3B active param MoE models + a 0.3B dense model
✨ Apache 2.0
✨ 128K context length
✨ Text+Vision co-training with ViT & UPO

AdinaY

posted an update 13 days ago

Post

3088

Hunyuan-A13B 🔥 New MoE LLM by TencentHunyuan

tencent/Hunyuan-A13B-Instruct

✨80B total / 13B active params
✨256K context window
✨Dual-mode reasoning: fast & slow thinking
✨Efficient inference (GQA + quantization)

AdinaY

posted an update 16 days ago

Post

1616

LongWriter-Zero 🔥 A Purely RL trained LLM handles 10K+ token coherent passages by Tsinghua University

Model:
THU-KEG/LongWriter-Zero-32B
Dataset:
THU-KEG/LongWriter-Zero-RLData
Paper:
LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning (2506.18841)

✨ 32B
✨ Multi-reward GRPO: length, fluency, structure, non-redundancy
✨ Enforces <think><answer> format via Format RM
✨ Build on Qwen2.5-32B-base

AdinaY

posted an update 16 days ago

Post

303

MOSS-TTSD 🔊 Bilingual text-to-spoken dialogue model by Fudan University - Open MOSS team.

Model:
fnlp/MOSS-TTSD-v0
Demo:
fnlp/MOSS-TTSD

✨ Supports Chinese & English
✨ Zero-shot 2-speaker voice cloning
✨ Long-form generation (up to 960s)
✨ Built on Qwen 3

AdinaY

posted an update 17 days ago

Post

269

Skywork-SWE 🔥 New code agent model by Skywork 天工

Skywork/Skywork-SWE-32B

✨ 32B - Apache 2.0
✨ 38.0% pass@1 on SWE-bench Verified
✨ Up to 47.0% with test-time scaling
✨ Shows clear data scaling law (8K+ demos)
✨ Built on Qwen2.5-Coder-32B + OpenHands

AdinaY

posted an update 23 days ago

Post

1285

SongGeneration 🎵 A model by Tencent for vocal + accompaniment music generation.

Model:
tencent/SongGeneration
Demo:
https://huggingface.co/spaces/waytan22/SongGeneration-LeVo

✨ Mixed & dual-track token modeling
✨ Hi-fi music with custom codec
✨ Currently supports Chinese, English version coming soon!

AdinaY

posted an update 24 days ago

Post

4042

Kimi-Dev 💻 New coding model by Moonshot AI

moonshotai/Kimi-Dev-72B

✨ 72B - MIT license
✨ 60.4% on SWE-bench Verified
✨ RL-trained to patch real repos in Docker
✨ Only rewarded if full test suite passes

AdinaY

posted an update 24 days ago

Post

653

MiniMax-M1 🔥 The First reasoning model by MiniMax.

MiniMaxAI/minimax-m1-68502ad9634ec0eeac8cf094

✨ 40k/80k thinking budget
✨ Powered by Hybrid MoE + Lightning Attention 👀
✨ 1M context length 🤯
✨ Apache 2.0
✨ RL-trained for math, coding & real-world software

AdinaY

posted an update 24 days ago

Post

454

Hunyuan 3D 2.1 🔥 Industrial-grade 3D model just released by Tencent Hunyuan

tencent/Hunyuan3D-2.1
tencent/Hunyuan3D-2.1

✨ PBR materials: leather, bronze & more, breathtaking realism under any light
✨ Consumer GPU-ready: good for developers and small teams

AdinaY

posted an update 30 days ago

Post

1594

Lingshu 🩺📖 medical MLLM released by DAMO Alibaba

lingshu-medical-mllm/lingshu-mllms-6847974ca5b5df750f017dad

✨ 7B/32B
✨ 12+ imaging modalities supported: X-Ray, CT, MRI, Microscopy +more
✨ Great performance on medical benchmark

AdinaY

posted an update about 1 month ago

Post

3183

RoboBrain 2.0🔥 OPEN embedded brain model by BAAIBeijing

BAAI/RoboBrain2.0-7B

✨ 7B - Apache 2.0 / 32B coming soon
✨ Supports multiple images, long videos, and high-resolution visuals
✨ Spatial + temporal reasoning
✨ Real-time memory & scene graphs

AdinaY

posted an update about 1 month ago

Post

2685

RedNote 小红书 just released their first LLM 🔥

dots.llm1.base 🪐 a 142B MoE model with only 14B active params.

rednote-hilab/dotsllm1-68246aaaaba3363374a8aa7c
✨ Base & Instruct - MIT license
✨ Trained on 11.2T non-synthetic high-quality data
✨ Competitive with Qwen2.5/3 on reasoning, code, alignment

AI & ML interests

Team members 8

HuggingFace-CN-community's activity