Hugging Face Chinese Localization
HuggingFace-CN-community 's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Articles
view post
Kimi-Dev 💻 New coding model by Moonshot AI
moonshotai/Kimi-Dev-72B ✨ 72B - MIT license ✨ 60.4% on SWE-bench Verified ✨ RL-trained to patch real repos in Docker ✨ Only rewarded if full test suite passes
See translation
view post
MiniMax-M1 🔥 The First reasoning model by MiniMax.
MiniMaxAI/minimax-m1-68502ad9634ec0eeac8cf094 ✨ 40k/80k thinking budget ✨ Powered by Hybrid MoE + Lightning Attention 👀 ✨ 1M context length 🤯 ✨ Apache 2.0 ✨ RL-trained for math, coding & real-world software
See translation
view post
Hunyuan 3D 2.1 🔥 Industrial-grade 3D model just released by Tencent Hunyuan
tencent/Hunyuan3D-2.1
tencent/Hunyuan3D-2.1 ✨ PBR materials: leather, bronze & more, breathtaking realism under any light ✨ Consumer GPU-ready: good for developers and small teams
See translation
view post
RoboBrain 2.0🔥 OPEN embedded brain model by BAAIBeijing
BAAI/RoboBrain2.0-7B ✨ 7B - Apache 2.0 / 32B coming soon ✨ Supports multiple images, long videos, and high-resolution visuals ✨ Spatial + temporal reasoning ✨ Real-time memory & scene graphs
See translation
view post
RedNote 小红书 just released their first LLM 🔥 dots.llm1.base 🪐 a 142B MoE model with only 14B active params.
rednote-hilab/dotsllm1-68246aaaaba3363374a8aa7c ✨ Base & Instruct - MIT license ✨ Trained on 11.2T non-synthetic high-quality data ✨ Competitive with Qwen2.5/3 on reasoning, code, alignment
See translation
view post
MiniCPM4🔥 efficient LLMs built for end-side devices, by OpenBMB
openbmb/minicpm4-6841ab29d180257e940baa9b ✨ Apache 2.0 ✨ 5–7× Faster Inference (Jetson Orin & RTX 4090) ✨ 8B trained on 8T clean, non-synthetic tokens ✨ 32K Native Context -> 128K+ with InfLLM v2 + LongRoPE ✨ Runs on 🤗Transformers , http://CPM.cu , vLLM, and SGLang
See translation
view post
OpenAudio S1-mini 🔊 a new OPEN multilingual TTS model trained on 2M+ hours of data, by FishAudio
fishaudio/openaudio-s1-mini ✨ Supports 14 languages ✨ 50+ emotions & tones ✨ RLHF-optimized ✨ Special effects: laughing, crying, shouting, etc.
See translation
1 reply
·
Reply
view post
SynLogic 🧠 logical reasoning model & dataset by MiniMax.
MiniMaxAI/synlogic-6836c3246fca0277657ff032 ✨ 3 models: 7B/32B/ Mix-3-32B (MIT license) ✨ Dataset: 35 verifiable logic tasks (Sudoku, Cipher, Arrow Maze etc.) ✨ RL training with auto-verifiable rewards ✨ Generalizes to math without explicit math training ✨ +6 pts on BBEH, +9.5 on KOR-Bench vs baselines
See translation
view post
Video-XL-2 🔥 long video understanding model by BAAI & Shanghai Jiaotong University
BAAI/Video-XL-2 ✨ Apache 2.0 ✨ Handles up to 10,000+ frames on a single GPU ✨ 2048-frame encoding in just 12s ✨ Efficient Chunk-based Prefilling & Bi-granularity KV decoding
See translation
view post
May highlights from China’s open source ecosystem 🔥
zh-ai-community/may-2025-open-works-from-the-chinese-community-681a3494145f2914dc679b7c ✨ DeepSeek dropped R1 updates - Both R1 & 8B distralled smol model ✨ Bytedance goes big on open source: - BAGEL, Dolphin, Seedcoder, Dream0... ✨ Multimodal is on fire! - HuyuanCustom / HunyuanVideo-Avatar / HunyuanPortrait - MiniMax: SynLogic / Orsta-7B - Xiaomi: MiMo VL - Alibaba Wan: Wan2.1-VACE - OpenGVlab: ZeroGUI - StepFun: ACE-Step-v1/Step1X-3D ✨ Specialized models/datasets excels - Alibaba Qwen: World PM 72B - BAAI:RobotBrain (MLLM for robotic) - HiThink Research: BizFinBench (dataset) - OpenBMB: Ultra FineWeb (dataset) - Bilibili: Index-anisora (Anime/ACG) - Skywork:Matrix-Game (game) More awesome releases: Alibaba QwenLong-L1-32B, SkyWork OR1, OpenS2V-5M etc...
See translation
view post
MiMo-VL 🔥 smol & mighty vision language model by Xiaomi
XiaomiMiMo/mimo-vl-68382ccacc7c2875500cd212 ✨ 7B with RL & SFT ✨ Native resolution ViT for fine grained perception ✨ MORL = smarter alignment across perception, grounding & reasoning
See translation
view post
🔥 New benchmark & dataset for Subject-to-Video generation OPENS2V-NEXUS by Pekin University ✨ Fine-grained evaluation for subject consistency
BestWishYsh/OpenS2V-Eval ✨ 5M-scale dataset:
BestWishYsh/OpenS2V-5M ✨ New metrics – automatic scores for identity, realism, and text match
See translation
2 replies
·
Reply
view post
HunyuanVideo-Avatar 🔥 another image to video model byTencent Hunyuan
tencent/HunyuanVideo-Avatar ✨Emotion-controlled, high-dynamic avatar videos ✨Multi-character support with separate audio control ✨Works with any style: cartoon, 3D, real face, while keeping identity consistent
See translation
view post
Orsta 🔥 vision language models trained with V-Triune, a unified reinforcement learning system by MiniMax AI
One-RL-to-See-Them-All/one-rl-to-see-them-all-6833d27abce23898b2f9815a ✨ 7B & 32B with MIT license ✨ Masters 8 visual tasks: math, science QA, charts, puzzles, object detection, grounding, OCR, and counting ✨ Uses Dynamic IoU rewards for better visual understanding ✨Strong performance in visual reasoning and perception
See translation