Chuanming Liu's picture

In a Training Loop 🔄

Chuanming Liu

Chuanming

·

Chuanming

AI & ML interests

Artificial Intelligence, AGI, NLP, LLMs, Multimodality, MLSys. Python/Golang/C/C++/Shell/awk&sed

Recent Activity

liked a model about 16 hours ago

zai-org/GLM-4.7-Flash

liked a model about 18 hours ago

HauhauCS/Qwen3.5-35B-A3B-Uncensored-HauhauCS-Aggressive

liked a model about 18 hours ago

mlx-community/Qwen3.5-27B-Claude-4.6-Opus-Distilled-MLX-4bit

View all activity

Organizations

upvoted 2 articles about 2 months ago

Article

SigLIP 2: A better multilingual vision language encoder

+1

Feb 21, 2025

•

207

Article

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

Jan 5

•

84

upvoted an article 2 months ago

Article

Open Responses: What you need to know

+2

Jan 15

•

109

upvoted an article 3 months ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

+4

Dec 18, 2025

•

123

upvoted a paper 3 months ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 106

upvoted 2 articles 5 months ago

Article

Fine-Tune Whisper For Multilingual ASR with 🤗 Transformers

Nov 3, 2022

•

363

Article

Supercharge your OCR Pipelines with Open Models

+5

Oct 21, 2025

•

307

upvoted 2 papers 5 months ago

MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

Paper • 2505.07916 • Published May 12, 2025 • 134

Finite Scalar Quantization Enables Redundant and Transmission-Robust Neural Audio Compression at Low Bit-rates

Paper • 2509.09550 • Published Sep 11, 2025 • 3

upvoted 2 collections 6 months ago

Qwen3Guard

7 items • Updated Dec 31, 2025 • 64

Qwen3-Omni

6 items • Updated Dec 31, 2025 • 187

upvoted an article 6 months ago

Article

Understanding Vector Quantization in VQ-VAE

Aug 28, 2024

•

55

upvoted a paper 6 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 320

upvoted an article 6 months ago

Article

From GRPO to DAPO and GSPO: What, Why, and How

Aug 9, 2025

•

104

upvoted 2 collections 6 months ago

PP-StructureV3

PP-StructureV3 is a SOTA document parsing solution on OmniDocBench, supporting the conversion of PDFs and do cument images to Markdown and JSON. • 17 items • Updated Sep 15, 2025 • 14

PP-OCRv5

PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese • 13 items • Updated Sep 15, 2025 • 52

upvoted a paper 7 months ago

Step-Audio 2 Technical Report

Paper • 2507.16632 • Published Jul 22, 2025 • 74

upvoted 3 collections 7 months ago

Marvis-TTS-250m-v0.1

5 items • Updated Aug 26, 2025 • 26

AFM-Datasets

Training datasets for OPPO Personal AI Lab’s family of Agent Foundation Models. • 6 items • Updated Feb 4 • 6

AFM-Models

Models for OPPO Personal AI Lab’s family of Agent Foundation Models. • 13 items • Updated Feb 4 • 17