AI & ML interests

Low-bit Quantization of Large Language Models (LLMs)

Recent Activity

HaoranChu  updated a model 2 months ago
Efficient-ML/GPTQ-for-Qwen3
HaoranChu  updated a collection 2 months ago
Qwen3-Quantization
View all activity

Efficient-ML 's collections 2