AI & ML interests

None defined yet.

Recent Activity

AdinaY 
posted an update about 17 hours ago
view post
Post
165
Chinese open source AI in December 2025 was about the stack coming together: open, end to end, and ready to ship 🔥

https://huggingface.co/collections/zh-ai-community/december-2025-china-open-source-highlights

✨ Big wave of foundation models: still scaling, but efficiency, reasoning, and deployment now matter more than size
- DeepSeek-V3.2
- Z.ai GLM-4.7
- MiniMax-M2.1
- Xiaomi: MiMo-V2-Flash

✨ Multimodal reasoning is now default
- Z.ai GLM-4.6V
- Z.ai AutoGLM-Phone 9B
- Bytedance: Dolphin-v2

✨ Image & video: editable assets and real workflows
- Qwen-Image-Layered / Image-2512
- Meituan: LongCat-Image & Image Edit
- AIDC: Ovis-Image-7B
- Live-Avatar / LongCat-Video-Avatar
- HY-WorldPlay / RealVideo

✨ Audio goes edge ready
- GLM-ASR-Nano / Fun-ASR-Nano
- GLM-TTS / VoxCPM1.5
- CosyVoice 0.5B

✨ The quiet backbone: data & infrastructure
- Finch (FinWorkBench)
- Tencent ARC: TimeLens-100K
- BIGAI: TongSIM-Asset
- MiniMax: VTP-Base

✨ Also congrats on Minimax and Z.ai announced their IPOs and Moonshot announced a new $500M funding round 🔥

Like everyone else, I was OOO at the end of December, so feel free to share (in comments or PR) any I missed in this list!
AdinaY 
posted an update 1 day ago
AdinaY 
posted an update 1 day ago
view post
Post
2872
2025.1 - DeepSeek entered the scene, backed by High Flyer Quant
2026.1 - IQuest enters the game, backed by Uniquant Quant 📈 and launching IQuest-Coder on huggingface
https://huggingface.co/collections/IQuestLab/iquest-coder

✨ 40B models: Instruct / Thinking / Loop
✨ Loop = MoE-level performance with only ~5% extra training cost
✨ Native 128K context
AdinaY 
posted an update 18 days ago
AdinaY 
posted an update 21 days ago
view post
Post
4574
Finch 💰 an enterprise-grade benchmark that measures whether AI agents can truly handle real world finance & accounting work.

FinWorkBench/Finch

✨ Built from real enterprise data (Enron + financial institutions), not synthetic tasks
✨ Tests end-to-end finance workflows
✨ Multimodal & cross-file reasoning
✨ Expert annotated (700+ hours) and genuinely challenging hard
AdinaY 
posted an update 2 months ago
view post
Post
3374
Kimi K2 Thinking is now live on the hub 🔥

moonshotai/Kimi-K2-Thinking

✨ 1T MoE for deep reasoning & tool use
✨ Native INT4 quantization = 2× faster inference
✨ 256K context window
✨ Modified MIT license
AdinaY 
posted an update 2 months ago
view post
Post
728
Chinese open source AI in October wasn’t about bigger models, it was about real world impact 🔥

https://huggingface.co/collections/zh-ai-community/october-2025-china-open-source-highlights

✨ Vision-Language & OCR wave 🌊
- DeepSeek-OCR : 3B
- PaddleOCR-VL : 0.9B
- Qwen3-VL : 2B / 4B / 8B / 32B /30B-A3B
- Open-Bee: Bee-8B-RL
- http://Z.ai Glyph :10B

OCR is industrializing, the real game now is understanding the (long context) document, not just reading it.

✨ Text generation: scale or innovation?
- MiniMax-M2: 229B
- Antgroup Ling-1T & Ring-1T
- Moonshot Kimi-Linear : linear-attention challenger
- Kwaipilot KAT-Dev

Efficiency is the key.

✨ Any-to-Any & World-Model : one step forward to the real world
- BAAI Emu 3.5
- Antgroup Ming-flash-omni
- HunyuanWorld-Mirror: 3D

Aligning with the “world model” globally

✨ Audio & Speech + Video & Visual: released from entertainment labs to delivery platforms
- SoulX-Podcast TTS
- LongCat-Audio-Codec & LongCat-Video by Meituan delivery paltform
- xiabs DreamOmni 2

Looking forward to what's next 🚀
AdinaY 
posted an update 2 months ago
meg 
posted an update 2 months ago
view post
Post
3870
🤖 Did you know your voice might be cloned without your consent from just *one sentence* of audio?
That's not great. So with @frimelle , we brainstormed a new idea for developers who want to curb malicious use: ✨The Voice Consent Gate.✨
Details, code, here: https://huggingface.co/blog/voice-consent-gate
  • 3 replies
·
AdinaY 
posted an update 2 months ago
view post
Post
1786
Ming-flash-omni Preview 🚀 Multimodal foundation model from AntGroup

inclusionAI/Ming-flash-omni-Preview

✨ Built on Ling-Flash-2.0: 10B total/6B active
✨ Generative segmentation-as-editing
✨ SOTA contextual & dialect ASR
✨ High-fidelity image generation
AdinaY 
posted an update 2 months ago
view post
Post
1890

Glyph 🔥 a framework that scales context length by compressing text into images and processing them with vision–language models, released by Z.ai.

Paper:https://huggingface.co/papers/2510.17800
Model:https://huggingface.co/zai-org/Glyph

✨ Compresses long sequences visually to bypass token limits
✨ Reduces computational and memory costs
✨ Preserves meaning through multimodal encoding
✨ Built on GLM-4.1V-9B-Base
AdinaY 
posted an update 3 months ago
view post
Post
2675
HunyuanWorld Mirror🔥a versatile feed forward model for universal 3D world reconstruction by Tencent

tencent/HunyuanWorld-Mirror

✨ Any prior in → 3D world out
✨ Mix camera, intrinsics, depth as priors
✨ Predict point clouds, normals, Gaussians & more in one pass
✨ Unified architecture for all 3D task
AdinaY 
posted an update 3 months ago
view post
Post
699
PaddleOCR VL🔥 0.9B Multilingual VLM by Baidu

PaddlePaddle/PaddleOCR-VL

✨ Ultra-efficient NaViT + ERNIE-4.5 architecture
✨ Supports 109 languages 🤯
✨ Accurately recognizes text, tables, formulas & charts
✨ Fast inference and lightweight for deployment
AdinaY 
posted an update 3 months ago
view post
Post
1827
Bee-8B 🐝 open 8B Multimodal LLM built on high quality data, released by
TencentHunyuan

Paper: Bee: A High-Quality Corpus and Full-Stack Suite to Unlock Advanced Fully Open MLLMs (2510.13795)
Model: https://huggingface.co/collections/Open-Bee/bee-8b-68ecbf10417810d90fbd9995

✨ Trained on Honey-Data-15M, a 15M-sample SFT corpus with dual-level CoT reasoning
✨ Backed by HoneyPipe, a transparent & reproducible open data curation suite
AdinaY 
posted an update 3 months ago
AdinaY 
posted an update 3 months ago
view post
Post
512
Ring-1T🔥 the trillion-parameter thinking model released by Ant group, the company behind Alipay

inclusionAI/Ring-1T

✨ 1T params (50B active)- MIT license
✨ 128K context (YaRN)
✨ RLVR, Icepop, and ASystem make trillion-scale RL stable
AdinaY 
posted an update 3 months ago
view post
Post
537
KAT-Dev-72B-Exp🔥 Kuaishou's ( the company behind Kring AI ) new open model for software engineering

Kwaipilot/KAT-Dev-72B-Exp

✨ 72B - Apache2.0
✨ Redesigned attention kernel & training engine for efficient context-aware RL
✨ 74.6% accuracy on SWE-Bench Verified