TorchAO Quantized Qwen3 Collection TorchAO quantized Qwen3 models from PyTorch team, runnable in A100, H100 through vLLM and in mobile devices through ExecuTorch • 3 items • Updated May 29
executorch-community/Llama-3.2-1B-Instruct-QLORA_INT4_EO8-ET Text Generation • Updated Apr 10 • 20 • 3
executorch-community/Llama-3.2-1B-Instruct-SpinQuant_INT4_EO8-ET Text Generation • Updated Apr 10 • 22 • 1
executorch-community/Llama-3.2-1B-Instruct-QLORA_INT4_EO8-ET Text Generation • Updated Apr 10 • 20 • 3
executorch-community/Llama-3.2-1B-Instruct-SpinQuant_INT4_EO8-ET Text Generation • Updated Apr 10 • 22 • 1
executorch-community/Llama-3.2-1B-Instruct-SpinQuant_INT4_EO8-ET Text Generation • Updated Apr 10 • 22 • 1
executorch-community/Llama-3.2-1B-Instruct-QLORA_INT4_EO8-ET Text Generation • Updated Apr 10 • 20 • 3