AMD

Enterprise

company

Verified

http://www.amd.com/

AMD

amd

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

linzhao-amd updated a model 1 day ago

amd/Qwen3.5-397B-A17B-MoE-MXFP4

linzhao-amd updated a model 1 day ago

amd/Qwen3.5-397B-A17B-MXFP4

linzhao-amd updated a model 1 day ago

amd/Kimi-K2.5-MXFP4

View all activity

Papers

Stabilizing Efficient Reasoning with Step-Level Advantage Selection

Dynamic Chunking Diffusion Transformer

View all Papers

Articles

Join the AMD Open Robotics Hackathon

Nov 13, 2025

• 16

amd 's collections 47

zentorch Quantized Models - LLM-Compressor v0.11.0

LLM-Compressor v0.11.0 quantized models for AMD EPYC CPU inference

amd/Llama-3.3-70B-Instruct-w4a16-llmcompressor-v0.11.0

Text Generation • 11B • Updated 4 days ago • 41

PARD-2

amd/PARD2-Qwen3-8B

Text Generation • 0.8B • Updated 7 days ago • 1.26k
amd/PARD2-Llama-3.1-8B

Text Generation • 1B • Updated 10 days ago • 79
amd/PARD2-Qwen3-14B

Text Generation • 0.8B • Updated 10 days ago • 70

Ryzen AI 1.7.1 — NPU LFM2 Models

Liquid AI's Liquid Foundation (LFM2) ONNX based NPU models

amd/LFM2-1.2B-ONNX_rai_1.7.1

Updated Apr 19 • 40 • 1
amd/LFM2.5-1.2B-Thinking-ONNX_rai_1.7.1

Updated Apr 19 • 10
amd/LFM2-2.6B-ONNX_rai_1.7.1

Updated Apr 19 • 32 • 1

Ryzen AI 1.7.1 — NPU 4K

Ryzen AI 1.7.1 models supporting context length up to 4K

amd/CodeLlama-7b-Instruct-hf_rai_1.7.1_npu_4K

Text Generation • Updated Mar 31 • 10
amd/DeepSeek-R1-Distill-Llama-8B_rai_1.7.1_npu_4K

Text Generation • Updated Mar 31 • 10
amd/DeepSeek-R1-Distill-Qwen-1.5B_rai_1.7.1_npu_4K

Text Generation • Updated Mar 31 • 16
amd/DeepSeek-R1-Distill-Qwen-7B_rai_1.7.1_npu_4K

Text Generation • Updated Mar 31 • 11

Ryzen-AI-1.7-NPU-LLM_V2

amd/Gemma-3-4b-it-mm-onnx-ryzenai-npu

Updated Jan 21 • 2
amd/gpt-oss-20b-onnx-ryzenai-npu

Updated Dec 13, 2025 • 2

Ryzen-AI-1.7.1 — SD Models

Stable Diffusion models for AMD NPU

amd/stable-diffusion-1.5-amdnpu

Text-to-Image • Updated Feb 11 • 6
amd/sd-turbo-amdnpu

Text-to-Image • Updated Feb 24 • 1
amd/sdxl-turbo-amdnpu

Text-to-Image • Updated Feb 24 • 2
amd/sdxl-base-amdnpu

Text-to-Image • Updated Feb 24 • 1

Ryzen-AI-1.7-NPU-creativity-models

amd/ryzenai-realesrgan

Updated Jan 21
amd/ryzenai-psfrgan

Updated Jan 21
amd/ryzenai-sesr

Updated Feb 4
amd/ryzenai-hrnet-bg-seg

Image Segmentation • Updated Jan 21

Ryzen-AI-1.7-Hybrid-LLM

amd/Qwen3-14B-onnx-ryzenai-1.7-hybrid

Text Generation • Updated Jan 27 • 5
amd/Qwen2.5-14B-instruct-onnx-ryzenai-1.7-hybrid

Text Generation • Updated Jan 27
amd/SmolLM2-135M-Instruct-onnx-ryzenai-1.7-hybrid

Text Generation • Updated Jan 26
amd/SmolLM-135M-Instruct-onnx-ryzenai-1.7-hybrid

Updated Jan 26

SAND

amd/SAND-Math-Qwen2.5-32B

Text Generation • 33B • Updated Dec 6, 2025 • 27 • • 3
amd/SAND-MathScience-DeepSeek-Qwen32B

Text Generation • 33B • Updated Dec 6, 2025 • 30 • • 2
amd/SAND-Post-Training-Dataset

Viewer • Updated Dec 6, 2025 • 27.9k • 235 • 3
amd/SAND-MATH

Viewer • Updated Oct 17, 2025 • 16.9k • 301 • 3

Ryzen AI Whisper NPU Optimized ONNX models

amd/whisper-small-onnx-npu

Updated Jan 15
amd/whisper-medium-onnx-npu

Updated Jan 30
amd/whisper-large-turbo-onnx-npu

Updated Jan 15 • 4
amd/whisper-base-onnx-npu

Updated Feb 10

Ryzen-AI-1.6-Hybrid-LLM

amd/AMD-OLMo-1B-SFT-DPO-onnx-ryzenai-hybrid

Updated Oct 23, 2025 • 10
amd/CodeLlama-7b-Instruct-hf-onnx-ryzenai-hybrid

Updated Oct 23, 2025 • 12 • 2
amd/DeepSeek-R1-Distill-Llama-8B-onnx-ryzenai-hybrid

Updated Oct 23, 2025 • 12 • 1
amd/DeepSeek-R1-Distill-Qwen-1.5B-onnx-ryzenai-hybrid

Updated Oct 23, 2025 • 16

Quark ByteDance Models

amd/DeepSeek-R1-0528-MXFP4-ASQ

342B • Updated Dec 12, 2025 • 10 • 1
amd/Llama-3.3-70B-Instruct-MXFP4-Preview

38B • Updated 3 days ago • 15.9k • 2
amd/Llama-3.1-405B-Instruct-MXFP4-Preview

218B • Updated Nov 6, 2025 • 2.54k • 1

Dell Pro AI Studio

Model for Dell Pro AI studio

amd/NPU-Whisper-Base-Small

Updated Jul 30, 2025 • 4
amd/NPU-Nomic-embed-text-v1.5-ryzen-strix-cpp

Updated Nov 17, 2025 • 3
amd/NPU-ESRGAN-ryzen-strix-cpp

Updated Jul 17, 2025 • 2
amd/NPU-CLIP-Python

Updated Oct 6, 2025 • 1

RyzenAI-1.5_LLM_Hybrid_Models

amd/Phi-3-mini-4k-instruct-awq-g128-int4-asym-fp16-onnx-hybrid

Text Generation • Updated Aug 27, 2025 • 19
amd/Phi-3.5-mini-instruct-awq-g128-int4-asym-fp16-onnx-hybrid

Text Generation • Updated Sep 16, 2025 • 13
amd/Mistral-7B-Instruct-v0.3-awq-g128-int4-asym-fp16-onnx-hybrid

Updated Sep 16, 2025 • 24
amd/Qwen1.5-7B-Chat-awq-g128-int4-asym-fp16-onnx-hybrid

Text Generation • Updated Sep 16, 2025 • 28

Gumiho

Official Model Parameters for "Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding"

Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding

Paper • 2503.10135 • Published Mar 13, 2025
amd/Gumiho-llama2-70b

Updated Jun 12, 2025
amd/Gumiho-llama2-7b

Updated Jun 12, 2025
amd/Gumiho-llama3-70b

Updated Jun 12, 2025

OGA CPU LLM Collection

This collection contains AMD-Quark quantized OGA exported models for CPU execution

amd/Phi-3-mini-4k-instruct_int4_float16_onnx_cpu

Updated Apr 12, 2025
amd/Qwen1.5-7B-Chat_uint4_asym_g128_float16_onnx_cpu

Updated Apr 12, 2025
amd/DeepSeek-R1-Distill-Llama-8B-awq-asym-uint4-g128-lmhead-onnx-cpu

Text Generation • Updated Jan 30, 2025
amd/Llama-3.2-1B-Instruct-awq-uint4-float16-cpu-onnx

Updated Apr 28, 2025

Quark Quantized DeepSeek Models

amd/DeepSeek-R1-MXFP4

371B • Updated Apr 13 • 195k • 5
amd/DeepSeek-R1-MXFP4-ASQ

363B • Updated Nov 6, 2025 • 995 • 1
amd/DeepSeek-R1-0528-MXFP4

356B • Updated Feb 26 • 30.6k • 2
amd/DeepSeek-R1-0528-MXFP4-ASQ

342B • Updated Dec 12, 2025 • 10 • 1

RyzenAI-1.4_LLM_NPU_Models

amd/Phi-3-mini-4k-instruct-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Aug 27, 2025 • 35 • 2
amd/Phi-3.5-mini-instruct-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Sep 16, 2025 • 20 • 3
amd/Mistral-7B-Instruct-v0.3-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Updated Sep 16, 2025 • 7
amd/Qwen1.5-7B-Chat-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Jun 28, 2025 • 9 • 1

Instella-VL✨

amd/Instella-VL-1B

1B • Updated Mar 7, 2025 • 240 • 8

AMD-HybridLM-Models ✨

AMD-HybridLM is a family of post-trained, highly efficient hybrid models, designed to combine performance with speed and memory efficiency.

amd/Zebra-Llama-1B-4MLA-12Mamba-DPO

Updated Sep 23, 2025 • 335
amd/Zebra-Llama-1B-4MLA-12Mamba-SFT

Updated Sep 23, 2025 • 7
amd/Zebra-Llama-1B-8MLA-8Mamba-DPO

Updated Sep 23, 2025 • 6
amd/Zebra-Llama-1B-8MLA-8Mamba-SFT

Updated Sep 23, 2025 • 5

AMDGPU onnx

optimized image generation ONNX models for AMD Ryzen (TM) AI GPUs and Radeon Discrete GPUs

amd/stable-diffusion-xl-1.0_io32_amdgpu

Text-to-Image • Updated Dec 17, 2025 • 6
amd/stable-diffusion-1.5_io32_amdgpu

Text-to-Image • Updated Dec 17, 2025 • 22
amd/stable-diffusion-xl-1.0_io16_amdgpu

Updated Apr 3, 2025 • 3
amd/stable-diffusion-1.5_io16_amdgpu

Text-to-Image • Updated Apr 3, 2025 • 17

RyzenAI-1.3_LLM_Hybrid_Models

Models quantized by Quark and prepared for the OGA-based hybrid execution flow (Ryzen AI 1.3)

amd/Phi-3-mini-4k-instruct-awq-g128-int4-asym-fp16-onnx-hybrid

Text Generation • Updated Aug 27, 2025 • 19
amd/Phi-3.5-mini-instruct-awq-g128-int4-asym-fp16-onnx-hybrid

Text Generation • Updated Sep 16, 2025 • 13
amd/Mistral-7B-Instruct-v0.3-awq-g128-int4-asym-fp16-onnx-hybrid

Updated Sep 16, 2025 • 24
amd/Qwen1.5-7B-Chat-awq-g128-int4-asym-fp16-onnx-hybrid

Text Generation • Updated Sep 16, 2025 • 28

AMD-OLMo

AMD-OLMo are a series of 1 billion parameter language models trained by AMD on AMD Instinct™ MI250 GPUs based on OLMo.

amd/AMD-OLMo

Text Generation • Updated Nov 17, 2025 • 84
amd/AMD-OLMo-1B

Text Generation • 1B • Updated Nov 17, 2025 • 35 • 25
amd/AMD-OLMo-1B-SFT

Text Generation • 1B • Updated Nov 17, 2025 • 34 • 21
amd/AMD-OLMo-1B-SFT-DPO

Text Generation • 1B • Updated Nov 17, 2025 • 62 • 23

Quark Quantized OCP FP8 Models

amd/Llama-3.1-8B-Instruct-FP8-KV

8B • Updated 3 days ago • 44.5k • 6
amd/Llama-3.1-70B-Instruct-FP8-KV

71B • Updated Dec 19, 2024 • 3.54k • 5
amd/Llama-3.1-405B-Instruct-FP8-KV

406B • Updated Dec 19, 2024 • 2.85k • 5
amd/Mixtral-8x7B-Instruct-v0.1-FP8-KV

3B • Updated 3 days ago • 13k • 3

zentorch Quantized Models - LLM-Compressor v0.10.0.2

LLM-Compressor v0.10.0.2 quantized models for AMD EPYC CPU inference

amd/gpt-oss-20b-BF16-w8a8-llmcompressor-v0.10.0.2

Text Generation • 21B • Updated 18 days ago • 2.24k
amd/Llama-3.1-8B-Instruct-w4a16-llmcompressor-v0.10.0.2

Text Generation • 2B • Updated 4 days ago • 350
amd/Mistral-7B-Instruct-v0.3-w4a16-llmcompressor-v0.10.0.2

Text Generation • 7B • Updated 4 days ago • 22

zentorch TorchAO Quantized Models - PyTorch 2.10

TorchAO quantized models for AMD EPYC CPU inference. The inference stack includes vLLM (0.15.0 to 0.18.0), PyTorch 2.10, and zentorch 5.2.1.

amd/Llama-3.1-8B-Instruct-da8w8-torchao-v0.16.0

Text Generation • Updated Apr 30 • 1.58k • 1
amd/Qwen2.5-VL-7B-Instruct-da8w8-torchao-v0.16.0

Image-Text-to-Text • Updated May 4 • 9
amd/Qwen3-14B-Instruct-da8w8-torchao-v0.16.0

Text Generation • Updated May 4 • 1.61k
amd/Phi-4-da8w8-torchao-v0.16.0

Text Generation • Updated May 4 • 1.85k

Ryzen AI 1.7.1 — NPU 16K

Ryzen AI 1.7.1 models supporting context length up to 16K

amd/CodeLlama-7b-Instruct-hf_rai_1.7.1_npu_16K

Text Generation • Updated Mar 31 • 10
amd/DeepSeek-R1-Distill-Llama-8B_rai_1.7.1_npu_16K

Text Generation • Updated Mar 31 • 12
amd/DeepSeek-R1-Distill-Qwen-1.5B_rai_1.7.1_npu_16K

Text Generation • Updated Mar 31 • 11
amd/DeepSeek-R1-Distill-Qwen-7B_rai_1.7.1_npu_16K

Text Generation • Updated Mar 31 • 15

Ryzen AI 1.7.1 — Hybrid

Ryzen AI 1.7.1 hybrid (NPU and GPU) execution models

amd/Phi-4-mini-instruct_rai_1.7.1_hybrid

Text Generation • Updated Mar 30
amd/CodeLlama-7b-Instruct-hf_rai_1.7.1_hybrid

Text Generation • Updated Mar 31
amd/DeepSeek-R1-Distill-Llama-8B_rai_1.7.1_hybrid

Text Generation • Updated Mar 31
amd/DeepSeek-R1-Distill-Qwen-1.5B_rai_1.7.1_hybrid

Text Generation • Updated Mar 31

LuminaSFT

amd/UltraChat200K-regenerated

Viewer • Updated Mar 2 • 207k • 44 • 1
amd/InstructGpt-TriviaQa

Viewer • Updated Mar 2 • 1.12M • 38
amd/InstructGpt-NaturalQa

Viewer • Updated Mar 2 • 1.3M • 36
amd/InstructGpt-educational

Viewer • Updated Mar 2 • 851k • 69

Micro-World

Action-controlled Interactive world model.

amd/Micro-World-I2W

Updated Feb 5 • 13 • 3
amd/Micro-World-T2W

Updated Feb 5 • 10 • 7
amd/Micro-World-MC-Dataset

Viewer • Updated Feb 6 • 2k • 2.25k • 3

Ryzen-AI-1.7-NPU-LLM

List will be updated

amd/Gemma-3-4b-it-mm-onnx-ryzenai-npu

Updated Jan 21 • 2
amd/gpt-oss-20b-onnx-ryzenai-npu

Updated Dec 13, 2025 • 2
amd/Phi-4-mini-instruct-onnx-ryzenai-npu

Text Generation • Updated Jan 21
amd/Qwen2-1.5B-onnx-ryzenai-npu

Text Generation • Updated Oct 23, 2025 • 37 • 1

ReasonLite

amd/ReasonLite-0.6B

0.8B • Updated Jan 22 • 374 • 11
amd/ReasonLite-0.6B-Turbo

0.8B • Updated Jan 22 • 85 • 7
amd/ReasonLite-Dataset

Viewer • Updated Jan 22 • 6.16M • 330 • 13

Hummingbird

Hummingbird is a series of video generation models built on AMD Instinct™ GPUs, including text-to-video, image-to-videos models.

amd/AMD-Hummingbird-T2V

Text-to-Video • Updated Mar 4, 2025 • 9
amd/AMD-Hummingbird-I2V

Updated Sep 8, 2025 • 9
amd/HummingbirdXT

Updated Feb 24 • 9

Ryzen-AI-1.6-NPU-LLM

amd/Qwen2-1.5B-onnx-ryzenai-npu

Text Generation • Updated Oct 23, 2025 • 37 • 1
amd/Mistral-7B-Instruct-v0.2-onnx-ryzenai-npu

Updated Oct 23, 2025 • 15
amd/Llama-2-7b-hf-onnx-ryzenai-npu

Text Generation • Updated Oct 8, 2025 • 14
amd/Qwen2-7B-onnx-ryzenai-npu

Text Generation • Updated Oct 23, 2025 • 31

Quark Quantized Auto Mixed Precision (AMP) Models

amd/Llama-2-70b-chat-hf-WMXFP4FP8-AMXFP4FP8-AMP-KVFP8

55B • Updated Sep 26, 2025 • 6
amd/Mixtral-8x7B-Instruct-v0.1-WMXFP4FP8-AMXFP4FP8-AMP-KVFP8

37B • Updated Nov 3, 2025 • 24
amd/Qwen3-8B-WMXFP4FP8-AMXFP4FP8-AMP-KVFP8

6B • Updated 3 days ago • 4.89k • 2
amd/gpt-oss-20b-MoE-Quant-W-MXFP4-A-FP8-KV-FP8

11B • Updated 3 days ago • 8.82k • 2

OGA_DML_8_6_2025

Models are quantized using quark-0.9, transformers-4.50.0, OGA-0.7.1, ORT-1.21.1 followed by OGA-DML export.

amd/OGA_DML_Qwen_Qwen2.5-3B-Instruct

Text Generation • Updated Aug 8, 2025
amd/OGA_DML_Qwen_Qwen2.5-1.5B-Instruct

Text Generation • Updated Aug 8, 2025

Quark Quantized PTPC FP8 Models

PTPC model quantized by quark

amd/Qwen3-30B-A3B-Thinking-2507-PTPC-FP8

31B • Updated Dec 24, 2025 • 13 • 1
amd/Qwen3-VL-235B-A22B-Instruct-ptpc

236B • Updated Dec 24, 2025 • 13
amd/DeepSeek-R1-0528-ptpc

671B • Updated Dec 24, 2025 • 5
amd/DeepSeek-R1-0528-MTP-PTPC-FP8

684B • Updated Nov 28, 2025 • 20

RyzenAI-1.5_LLM_NPU_Models

amd/Phi-3-mini-4k-instruct-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Aug 27, 2025 • 35 • 2
amd/Phi-3.5-mini-instruct-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Sep 16, 2025 • 20 • 3
amd/Mistral-7B-Instruct-v0.3-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Updated Sep 16, 2025 • 7
amd/Qwen1.5-7B-Chat-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Jun 28, 2025 • 9 • 1

PARD

Official Model Parameters for "PARD: Accelerating LLM Inference with Low-Cost PARallel Draft Model Adaptation"

amd/PARD-Llama-3.2-1B

Text Generation • 1B • Updated 3 days ago • 23.8k • • 2
amd/PARD-DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • 2B • Updated May 19, 2025 • 73 • • 2
amd/PARD-Qwen2.5-0.5B

Text Generation • 0.6B • Updated May 19, 2025 • 105 •
amd/PARD-Qwen3-0.6B

Text Generation • 0.8B • Updated 3 days ago • 5.54k • • 2

Quark Quantized MXFP4 Models

amd/DeepSeek-R1-MXFP4

371B • Updated Apr 13 • 195k • 5
amd/DeepSeek-R1-MXFP4-ASQ

363B • Updated Nov 6, 2025 • 995 • 1
amd/DeepSeek-R1-0528-MXFP4

356B • Updated Feb 26 • 30.6k • 2
amd/DeepSeek-R1-0528-MXFP4-ASQ

342B • Updated Dec 12, 2025 • 10 • 1

AMDGPU OnnxGenAI

Collection ONNX GenAI compatible Language Models to run on AMD Ryzen(TM) GPUs and Radeon Discrete GPUs

amd/Llama-2-7b-chat-hf-awq-g128-int4-onnx-directml

Updated Apr 8, 2025
amd/Llama-2-7b-hf-awq-g128-int4-onnx-directml

Updated Apr 10, 2025
amd/Llama-3.1-8B-awq-g128-int4-onnx-directml

Updated Jul 29, 2025
amd/Llama-3.1-8B-Instruct-awq-g128-int4-onnx-directml

Updated Jul 29, 2025

RyzenAI-1.4_LLM_Hybrid_Models

amd/Phi-3-mini-4k-instruct-awq-g128-int4-asym-fp16-onnx-hybrid

Text Generation • Updated Aug 27, 2025 • 19
amd/Phi-3.5-mini-instruct-awq-g128-int4-asym-fp16-onnx-hybrid

Text Generation • Updated Sep 16, 2025 • 13
amd/Mistral-7B-Instruct-v0.3-awq-g128-int4-asym-fp16-onnx-hybrid

Updated Sep 16, 2025 • 24
amd/Qwen1.5-7B-Chat-awq-g128-int4-asym-fp16-onnx-hybrid

Text Generation • Updated Sep 16, 2025 • 28

Instella ✨

Announcing Instella, a series of 3 billion parameter language models developed by AMD, trained from scratch on 128 Instinct MI300X GPUs.

amd/Instella-3B-Stage1

Text Generation • 3B • Updated Nov 14, 2025 • 20 • 13
amd/Instella-3B

Text Generation • 3B • Updated Nov 14, 2025 • 134 • 42
amd/Instella-3B-SFT

Text Generation • 3B • Updated Nov 14, 2025 • 25 • 11
amd/Instella-3B-Instruct

Text Generation • 3B • Updated Nov 14, 2025 • 905 • 59

AMD-RyzenAI-Deepseek-R1-Distill-Hybrid

amd/DeepSeek-R1-Distill-Llama-8B-awq-asym-uint4-g128-lmhead-onnx-hybrid

Updated Sep 16, 2025 • 40 • 1
amd/DeepSeek-R1-Distill-Qwen-1.5B-awq-asym-uint4-g128-lmhead-onnx-hybrid

Updated Jun 23, 2025 • 36 • 1
amd/DeepSeek-R1-Distill-Qwen-7B-awq-asym-uint4-g128-lmhead-onnx-hybrid

Updated Sep 16, 2025 • 32 • 4

RyzenAI-1.3_LLM_NPU_Models

Models quantized by Quark and prepared for the OGA-based NPU-only execution flow (Ryzen AI 1.3)

amd/Phi-3-mini-4k-instruct-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Aug 27, 2025 • 35 • 2
amd/Phi-3.5-mini-instruct-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Sep 16, 2025 • 20 • 3
amd/Mistral-7B-Instruct-v0.3-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Updated Sep 16, 2025 • 7
amd/Qwen1.5-7B-Chat-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Jun 28, 2025 • 9 • 1

Nitro Diffusion 💥

Nitro Diffusion is a series of efficient text-to-image diffusion models built on AMD Instinct™ GPUs.

amd/Nitro-1-SD

Text-to-Image • Updated Jun 25, 2025 • 11 • 9
amd/Nitro-1-PixArt

Text-to-Image • Updated Jun 25, 2025 • 19 • 6
amd/Nitro-T-0.6B

Text-to-Image • Updated Jul 9, 2025 • 13 • 5
amd/Nitro-T-1.2B

Text-to-Image • Updated Jul 9, 2025 • 12 • 7

Quark Quantized ONNX LLMs for Ryzen AI 1.3 EA

ONNX Runtime generate() API based models quantized by Quark and optimized for Ryzen AI Strix Point NPU

amd/Qwen1.5-7B-Chat-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Jun 28, 2025 • 9 • 1
amd/Mistral-7B-Instruct-v0.3-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Updated Sep 16, 2025 • 7
amd/Llama-2-7b-hf-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Jun 28, 2025 • 10
amd/Llama-3-8B-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Jun 28, 2025 • 17 • 2

zentorch Quantized Models - LLM-Compressor v0.11.0

LLM-Compressor v0.11.0 quantized models for AMD EPYC CPU inference

amd/Llama-3.3-70B-Instruct-w4a16-llmcompressor-v0.11.0

Text Generation • 11B • Updated 4 days ago • 41

zentorch Quantized Models - LLM-Compressor v0.10.0.2

LLM-Compressor v0.10.0.2 quantized models for AMD EPYC CPU inference

amd/gpt-oss-20b-BF16-w8a8-llmcompressor-v0.10.0.2

Text Generation • 21B • Updated 18 days ago • 2.24k
amd/Llama-3.1-8B-Instruct-w4a16-llmcompressor-v0.10.0.2

Text Generation • 2B • Updated 4 days ago • 350
amd/Mistral-7B-Instruct-v0.3-w4a16-llmcompressor-v0.10.0.2

Text Generation • 7B • Updated 4 days ago • 22

PARD-2

amd/PARD2-Qwen3-8B

Text Generation • 0.8B • Updated 7 days ago • 1.26k
amd/PARD2-Llama-3.1-8B

Text Generation • 1B • Updated 10 days ago • 79
amd/PARD2-Qwen3-14B

Text Generation • 0.8B • Updated 10 days ago • 70

zentorch TorchAO Quantized Models - PyTorch 2.10

TorchAO quantized models for AMD EPYC CPU inference. The inference stack includes vLLM (0.15.0 to 0.18.0), PyTorch 2.10, and zentorch 5.2.1.

amd/Llama-3.1-8B-Instruct-da8w8-torchao-v0.16.0

Text Generation • Updated Apr 30 • 1.58k • 1
amd/Qwen2.5-VL-7B-Instruct-da8w8-torchao-v0.16.0

Image-Text-to-Text • Updated May 4 • 9
amd/Qwen3-14B-Instruct-da8w8-torchao-v0.16.0

Text Generation • Updated May 4 • 1.61k
amd/Phi-4-da8w8-torchao-v0.16.0

Text Generation • Updated May 4 • 1.85k

Ryzen AI 1.7.1 — NPU LFM2 Models

Liquid AI's Liquid Foundation (LFM2) ONNX based NPU models

amd/LFM2-1.2B-ONNX_rai_1.7.1

Updated Apr 19 • 40 • 1
amd/LFM2.5-1.2B-Thinking-ONNX_rai_1.7.1

Updated Apr 19 • 10
amd/LFM2-2.6B-ONNX_rai_1.7.1

Updated Apr 19 • 32 • 1

Ryzen AI 1.7.1 — NPU 16K

Ryzen AI 1.7.1 models supporting context length up to 16K

amd/CodeLlama-7b-Instruct-hf_rai_1.7.1_npu_16K

Text Generation • Updated Mar 31 • 10
amd/DeepSeek-R1-Distill-Llama-8B_rai_1.7.1_npu_16K

Text Generation • Updated Mar 31 • 12
amd/DeepSeek-R1-Distill-Qwen-1.5B_rai_1.7.1_npu_16K

Text Generation • Updated Mar 31 • 11
amd/DeepSeek-R1-Distill-Qwen-7B_rai_1.7.1_npu_16K

Text Generation • Updated Mar 31 • 15

Ryzen AI 1.7.1 — NPU 4K

Ryzen AI 1.7.1 models supporting context length up to 4K

amd/CodeLlama-7b-Instruct-hf_rai_1.7.1_npu_4K

Text Generation • Updated Mar 31 • 10
amd/DeepSeek-R1-Distill-Llama-8B_rai_1.7.1_npu_4K

Text Generation • Updated Mar 31 • 10
amd/DeepSeek-R1-Distill-Qwen-1.5B_rai_1.7.1_npu_4K

Text Generation • Updated Mar 31 • 16
amd/DeepSeek-R1-Distill-Qwen-7B_rai_1.7.1_npu_4K

Text Generation • Updated Mar 31 • 11

Ryzen AI 1.7.1 — Hybrid

Ryzen AI 1.7.1 hybrid (NPU and GPU) execution models

amd/Phi-4-mini-instruct_rai_1.7.1_hybrid

Text Generation • Updated Mar 30
amd/CodeLlama-7b-Instruct-hf_rai_1.7.1_hybrid

Text Generation • Updated Mar 31
amd/DeepSeek-R1-Distill-Llama-8B_rai_1.7.1_hybrid

Text Generation • Updated Mar 31
amd/DeepSeek-R1-Distill-Qwen-1.5B_rai_1.7.1_hybrid

Text Generation • Updated Mar 31

Ryzen-AI-1.7-NPU-LLM_V2

amd/Gemma-3-4b-it-mm-onnx-ryzenai-npu

Updated Jan 21 • 2
amd/gpt-oss-20b-onnx-ryzenai-npu

Updated Dec 13, 2025 • 2

LuminaSFT

amd/UltraChat200K-regenerated

Viewer • Updated Mar 2 • 207k • 44 • 1
amd/InstructGpt-TriviaQa

Viewer • Updated Mar 2 • 1.12M • 38
amd/InstructGpt-NaturalQa

Viewer • Updated Mar 2 • 1.3M • 36
amd/InstructGpt-educational

Viewer • Updated Mar 2 • 851k • 69

Ryzen-AI-1.7.1 — SD Models

Stable Diffusion models for AMD NPU

amd/stable-diffusion-1.5-amdnpu

Text-to-Image • Updated Feb 11 • 6
amd/sd-turbo-amdnpu

Text-to-Image • Updated Feb 24 • 1
amd/sdxl-turbo-amdnpu

Text-to-Image • Updated Feb 24 • 2
amd/sdxl-base-amdnpu

Text-to-Image • Updated Feb 24 • 1

Micro-World

Action-controlled Interactive world model.

amd/Micro-World-I2W

Updated Feb 5 • 13 • 3
amd/Micro-World-T2W

Updated Feb 5 • 10 • 7
amd/Micro-World-MC-Dataset

Viewer • Updated Feb 6 • 2k • 2.25k • 3

Ryzen-AI-1.7-NPU-creativity-models

amd/ryzenai-realesrgan

Updated Jan 21
amd/ryzenai-psfrgan

Updated Jan 21
amd/ryzenai-sesr

Updated Feb 4
amd/ryzenai-hrnet-bg-seg

Image Segmentation • Updated Jan 21

Ryzen-AI-1.7-NPU-LLM

List will be updated

amd/Gemma-3-4b-it-mm-onnx-ryzenai-npu

Updated Jan 21 • 2
amd/gpt-oss-20b-onnx-ryzenai-npu

Updated Dec 13, 2025 • 2
amd/Phi-4-mini-instruct-onnx-ryzenai-npu

Text Generation • Updated Jan 21
amd/Qwen2-1.5B-onnx-ryzenai-npu

Text Generation • Updated Oct 23, 2025 • 37 • 1

Ryzen-AI-1.7-Hybrid-LLM

amd/Qwen3-14B-onnx-ryzenai-1.7-hybrid

Text Generation • Updated Jan 27 • 5
amd/Qwen2.5-14B-instruct-onnx-ryzenai-1.7-hybrid

Text Generation • Updated Jan 27
amd/SmolLM2-135M-Instruct-onnx-ryzenai-1.7-hybrid

Text Generation • Updated Jan 26
amd/SmolLM-135M-Instruct-onnx-ryzenai-1.7-hybrid

Updated Jan 26

ReasonLite

amd/ReasonLite-0.6B

0.8B • Updated Jan 22 • 374 • 11
amd/ReasonLite-0.6B-Turbo

0.8B • Updated Jan 22 • 85 • 7
amd/ReasonLite-Dataset

Viewer • Updated Jan 22 • 6.16M • 330 • 13

SAND

amd/SAND-Math-Qwen2.5-32B

Text Generation • 33B • Updated Dec 6, 2025 • 27 • • 3
amd/SAND-MathScience-DeepSeek-Qwen32B

Text Generation • 33B • Updated Dec 6, 2025 • 30 • • 2
amd/SAND-Post-Training-Dataset

Viewer • Updated Dec 6, 2025 • 27.9k • 235 • 3
amd/SAND-MATH

Viewer • Updated Oct 17, 2025 • 16.9k • 301 • 3

Hummingbird

Hummingbird is a series of video generation models built on AMD Instinct™ GPUs, including text-to-video, image-to-videos models.

amd/AMD-Hummingbird-T2V

Text-to-Video • Updated Mar 4, 2025 • 9
amd/AMD-Hummingbird-I2V

Updated Sep 8, 2025 • 9
amd/HummingbirdXT

Updated Feb 24 • 9

Ryzen AI Whisper NPU Optimized ONNX models

amd/whisper-small-onnx-npu

Updated Jan 15
amd/whisper-medium-onnx-npu

Updated Jan 30
amd/whisper-large-turbo-onnx-npu

Updated Jan 15 • 4
amd/whisper-base-onnx-npu

Updated Feb 10

Ryzen-AI-1.6-NPU-LLM

amd/Qwen2-1.5B-onnx-ryzenai-npu

Text Generation • Updated Oct 23, 2025 • 37 • 1
amd/Mistral-7B-Instruct-v0.2-onnx-ryzenai-npu

Updated Oct 23, 2025 • 15
amd/Llama-2-7b-hf-onnx-ryzenai-npu

Text Generation • Updated Oct 8, 2025 • 14
amd/Qwen2-7B-onnx-ryzenai-npu

Text Generation • Updated Oct 23, 2025 • 31

Ryzen-AI-1.6-Hybrid-LLM

amd/AMD-OLMo-1B-SFT-DPO-onnx-ryzenai-hybrid

Updated Oct 23, 2025 • 10
amd/CodeLlama-7b-Instruct-hf-onnx-ryzenai-hybrid

Updated Oct 23, 2025 • 12 • 2
amd/DeepSeek-R1-Distill-Llama-8B-onnx-ryzenai-hybrid

Updated Oct 23, 2025 • 12 • 1
amd/DeepSeek-R1-Distill-Qwen-1.5B-onnx-ryzenai-hybrid

Updated Oct 23, 2025 • 16

Quark Quantized Auto Mixed Precision (AMP) Models

amd/Llama-2-70b-chat-hf-WMXFP4FP8-AMXFP4FP8-AMP-KVFP8

55B • Updated Sep 26, 2025 • 6
amd/Mixtral-8x7B-Instruct-v0.1-WMXFP4FP8-AMXFP4FP8-AMP-KVFP8

37B • Updated Nov 3, 2025 • 24
amd/Qwen3-8B-WMXFP4FP8-AMXFP4FP8-AMP-KVFP8

6B • Updated 3 days ago • 4.89k • 2
amd/gpt-oss-20b-MoE-Quant-W-MXFP4-A-FP8-KV-FP8

11B • Updated 3 days ago • 8.82k • 2

Quark ByteDance Models

amd/DeepSeek-R1-0528-MXFP4-ASQ

342B • Updated Dec 12, 2025 • 10 • 1
amd/Llama-3.3-70B-Instruct-MXFP4-Preview

38B • Updated 3 days ago • 15.9k • 2
amd/Llama-3.1-405B-Instruct-MXFP4-Preview

218B • Updated Nov 6, 2025 • 2.54k • 1

OGA_DML_8_6_2025

Models are quantized using quark-0.9, transformers-4.50.0, OGA-0.7.1, ORT-1.21.1 followed by OGA-DML export.

amd/OGA_DML_Qwen_Qwen2.5-3B-Instruct

Text Generation • Updated Aug 8, 2025
amd/OGA_DML_Qwen_Qwen2.5-1.5B-Instruct

Text Generation • Updated Aug 8, 2025

Dell Pro AI Studio

Model for Dell Pro AI studio

amd/NPU-Whisper-Base-Small

Updated Jul 30, 2025 • 4
amd/NPU-Nomic-embed-text-v1.5-ryzen-strix-cpp

Updated Nov 17, 2025 • 3
amd/NPU-ESRGAN-ryzen-strix-cpp

Updated Jul 17, 2025 • 2
amd/NPU-CLIP-Python

Updated Oct 6, 2025 • 1

Quark Quantized PTPC FP8 Models

PTPC model quantized by quark

amd/Qwen3-30B-A3B-Thinking-2507-PTPC-FP8

31B • Updated Dec 24, 2025 • 13 • 1
amd/Qwen3-VL-235B-A22B-Instruct-ptpc

236B • Updated Dec 24, 2025 • 13
amd/DeepSeek-R1-0528-ptpc

671B • Updated Dec 24, 2025 • 5
amd/DeepSeek-R1-0528-MTP-PTPC-FP8

684B • Updated Nov 28, 2025 • 20

RyzenAI-1.5_LLM_Hybrid_Models

amd/Phi-3-mini-4k-instruct-awq-g128-int4-asym-fp16-onnx-hybrid

Text Generation • Updated Aug 27, 2025 • 19
amd/Phi-3.5-mini-instruct-awq-g128-int4-asym-fp16-onnx-hybrid

Text Generation • Updated Sep 16, 2025 • 13
amd/Mistral-7B-Instruct-v0.3-awq-g128-int4-asym-fp16-onnx-hybrid

Updated Sep 16, 2025 • 24
amd/Qwen1.5-7B-Chat-awq-g128-int4-asym-fp16-onnx-hybrid

Text Generation • Updated Sep 16, 2025 • 28

RyzenAI-1.5_LLM_NPU_Models

amd/Phi-3-mini-4k-instruct-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Aug 27, 2025 • 35 • 2
amd/Phi-3.5-mini-instruct-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Sep 16, 2025 • 20 • 3
amd/Mistral-7B-Instruct-v0.3-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Updated Sep 16, 2025 • 7
amd/Qwen1.5-7B-Chat-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Jun 28, 2025 • 9 • 1

Gumiho

Official Model Parameters for "Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding"

Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding

Paper • 2503.10135 • Published Mar 13, 2025
amd/Gumiho-llama2-70b

Updated Jun 12, 2025
amd/Gumiho-llama2-7b

Updated Jun 12, 2025
amd/Gumiho-llama3-70b

Updated Jun 12, 2025

PARD

Official Model Parameters for "PARD: Accelerating LLM Inference with Low-Cost PARallel Draft Model Adaptation"

amd/PARD-Llama-3.2-1B

Text Generation • 1B • Updated 3 days ago • 23.8k • • 2
amd/PARD-DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • 2B • Updated May 19, 2025 • 73 • • 2
amd/PARD-Qwen2.5-0.5B

Text Generation • 0.6B • Updated May 19, 2025 • 105 •
amd/PARD-Qwen3-0.6B

Text Generation • 0.8B • Updated 3 days ago • 5.54k • • 2

OGA CPU LLM Collection

This collection contains AMD-Quark quantized OGA exported models for CPU execution

amd/Phi-3-mini-4k-instruct_int4_float16_onnx_cpu

Updated Apr 12, 2025
amd/Qwen1.5-7B-Chat_uint4_asym_g128_float16_onnx_cpu

Updated Apr 12, 2025
amd/DeepSeek-R1-Distill-Llama-8B-awq-asym-uint4-g128-lmhead-onnx-cpu

Text Generation • Updated Jan 30, 2025
amd/Llama-3.2-1B-Instruct-awq-uint4-float16-cpu-onnx

Updated Apr 28, 2025

Quark Quantized MXFP4 Models

amd/DeepSeek-R1-MXFP4

371B • Updated Apr 13 • 195k • 5
amd/DeepSeek-R1-MXFP4-ASQ

363B • Updated Nov 6, 2025 • 995 • 1
amd/DeepSeek-R1-0528-MXFP4

356B • Updated Feb 26 • 30.6k • 2
amd/DeepSeek-R1-0528-MXFP4-ASQ

342B • Updated Dec 12, 2025 • 10 • 1

Quark Quantized DeepSeek Models

amd/DeepSeek-R1-MXFP4

371B • Updated Apr 13 • 195k • 5
amd/DeepSeek-R1-MXFP4-ASQ

363B • Updated Nov 6, 2025 • 995 • 1
amd/DeepSeek-R1-0528-MXFP4

356B • Updated Feb 26 • 30.6k • 2
amd/DeepSeek-R1-0528-MXFP4-ASQ

342B • Updated Dec 12, 2025 • 10 • 1

AMDGPU OnnxGenAI

Collection ONNX GenAI compatible Language Models to run on AMD Ryzen(TM) GPUs and Radeon Discrete GPUs

amd/Llama-2-7b-chat-hf-awq-g128-int4-onnx-directml

Updated Apr 8, 2025
amd/Llama-2-7b-hf-awq-g128-int4-onnx-directml

Updated Apr 10, 2025
amd/Llama-3.1-8B-awq-g128-int4-onnx-directml

Updated Jul 29, 2025
amd/Llama-3.1-8B-Instruct-awq-g128-int4-onnx-directml

Updated Jul 29, 2025

RyzenAI-1.4_LLM_NPU_Models

amd/Phi-3-mini-4k-instruct-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Aug 27, 2025 • 35 • 2
amd/Phi-3.5-mini-instruct-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Sep 16, 2025 • 20 • 3
amd/Mistral-7B-Instruct-v0.3-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Updated Sep 16, 2025 • 7
amd/Qwen1.5-7B-Chat-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Jun 28, 2025 • 9 • 1

RyzenAI-1.4_LLM_Hybrid_Models

amd/Phi-3-mini-4k-instruct-awq-g128-int4-asym-fp16-onnx-hybrid

Text Generation • Updated Aug 27, 2025 • 19
amd/Phi-3.5-mini-instruct-awq-g128-int4-asym-fp16-onnx-hybrid

Text Generation • Updated Sep 16, 2025 • 13
amd/Mistral-7B-Instruct-v0.3-awq-g128-int4-asym-fp16-onnx-hybrid

Updated Sep 16, 2025 • 24
amd/Qwen1.5-7B-Chat-awq-g128-int4-asym-fp16-onnx-hybrid

Text Generation • Updated Sep 16, 2025 • 28

Instella-VL✨

amd/Instella-VL-1B

1B • Updated Mar 7, 2025 • 240 • 8

Instella ✨

Announcing Instella, a series of 3 billion parameter language models developed by AMD, trained from scratch on 128 Instinct MI300X GPUs.

amd/Instella-3B-Stage1

Text Generation • 3B • Updated Nov 14, 2025 • 20 • 13
amd/Instella-3B

Text Generation • 3B • Updated Nov 14, 2025 • 134 • 42
amd/Instella-3B-SFT

Text Generation • 3B • Updated Nov 14, 2025 • 25 • 11
amd/Instella-3B-Instruct

Text Generation • 3B • Updated Nov 14, 2025 • 905 • 59

AMD-HybridLM-Models ✨

AMD-HybridLM is a family of post-trained, highly efficient hybrid models, designed to combine performance with speed and memory efficiency.

amd/Zebra-Llama-1B-4MLA-12Mamba-DPO

Updated Sep 23, 2025 • 335
amd/Zebra-Llama-1B-4MLA-12Mamba-SFT

Updated Sep 23, 2025 • 7
amd/Zebra-Llama-1B-8MLA-8Mamba-DPO

Updated Sep 23, 2025 • 6
amd/Zebra-Llama-1B-8MLA-8Mamba-SFT

Updated Sep 23, 2025 • 5

AMD-RyzenAI-Deepseek-R1-Distill-Hybrid

amd/DeepSeek-R1-Distill-Llama-8B-awq-asym-uint4-g128-lmhead-onnx-hybrid

Updated Sep 16, 2025 • 40 • 1
amd/DeepSeek-R1-Distill-Qwen-1.5B-awq-asym-uint4-g128-lmhead-onnx-hybrid

Updated Jun 23, 2025 • 36 • 1
amd/DeepSeek-R1-Distill-Qwen-7B-awq-asym-uint4-g128-lmhead-onnx-hybrid

Updated Sep 16, 2025 • 32 • 4

AMDGPU onnx

optimized image generation ONNX models for AMD Ryzen (TM) AI GPUs and Radeon Discrete GPUs

amd/stable-diffusion-xl-1.0_io32_amdgpu

Text-to-Image • Updated Dec 17, 2025 • 6
amd/stable-diffusion-1.5_io32_amdgpu

Text-to-Image • Updated Dec 17, 2025 • 22
amd/stable-diffusion-xl-1.0_io16_amdgpu

Updated Apr 3, 2025 • 3
amd/stable-diffusion-1.5_io16_amdgpu

Text-to-Image • Updated Apr 3, 2025 • 17

RyzenAI-1.3_LLM_NPU_Models

Models quantized by Quark and prepared for the OGA-based NPU-only execution flow (Ryzen AI 1.3)

amd/Phi-3-mini-4k-instruct-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Aug 27, 2025 • 35 • 2
amd/Phi-3.5-mini-instruct-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Sep 16, 2025 • 20 • 3
amd/Mistral-7B-Instruct-v0.3-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Updated Sep 16, 2025 • 7
amd/Qwen1.5-7B-Chat-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Jun 28, 2025 • 9 • 1

RyzenAI-1.3_LLM_Hybrid_Models

Models quantized by Quark and prepared for the OGA-based hybrid execution flow (Ryzen AI 1.3)

amd/Phi-3-mini-4k-instruct-awq-g128-int4-asym-fp16-onnx-hybrid

Text Generation • Updated Aug 27, 2025 • 19
amd/Phi-3.5-mini-instruct-awq-g128-int4-asym-fp16-onnx-hybrid

Text Generation • Updated Sep 16, 2025 • 13
amd/Mistral-7B-Instruct-v0.3-awq-g128-int4-asym-fp16-onnx-hybrid

Updated Sep 16, 2025 • 24
amd/Qwen1.5-7B-Chat-awq-g128-int4-asym-fp16-onnx-hybrid

Text Generation • Updated Sep 16, 2025 • 28

Nitro Diffusion 💥

Nitro Diffusion is a series of efficient text-to-image diffusion models built on AMD Instinct™ GPUs.

amd/Nitro-1-SD

Text-to-Image • Updated Jun 25, 2025 • 11 • 9
amd/Nitro-1-PixArt

Text-to-Image • Updated Jun 25, 2025 • 19 • 6
amd/Nitro-T-0.6B

Text-to-Image • Updated Jul 9, 2025 • 13 • 5
amd/Nitro-T-1.2B

Text-to-Image • Updated Jul 9, 2025 • 12 • 7

AMD-OLMo

AMD-OLMo are a series of 1 billion parameter language models trained by AMD on AMD Instinct™ MI250 GPUs based on OLMo.

amd/AMD-OLMo

Text Generation • Updated Nov 17, 2025 • 84
amd/AMD-OLMo-1B

Text Generation • 1B • Updated Nov 17, 2025 • 35 • 25
amd/AMD-OLMo-1B-SFT

Text Generation • 1B • Updated Nov 17, 2025 • 34 • 21
amd/AMD-OLMo-1B-SFT-DPO

Text Generation • 1B • Updated Nov 17, 2025 • 62 • 23

Quark Quantized ONNX LLMs for Ryzen AI 1.3 EA

ONNX Runtime generate() API based models quantized by Quark and optimized for Ryzen AI Strix Point NPU

amd/Qwen1.5-7B-Chat-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Jun 28, 2025 • 9 • 1
amd/Mistral-7B-Instruct-v0.3-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Updated Sep 16, 2025 • 7
amd/Llama-2-7b-hf-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Jun 28, 2025 • 10
amd/Llama-3-8B-awq-g128-int4-asym-bf16-onnx-ryzen-strix

Text Generation • Updated Jun 28, 2025 • 17 • 2

Quark Quantized OCP FP8 Models

amd/Llama-3.1-8B-Instruct-FP8-KV

8B • Updated 3 days ago • 44.5k • 6
amd/Llama-3.1-70B-Instruct-FP8-KV

71B • Updated Dec 19, 2024 • 3.54k • 5
amd/Llama-3.1-405B-Instruct-FP8-KV

406B • Updated Dec 19, 2024 • 2.85k • 5
amd/Mixtral-8x7B-Instruct-v0.1-FP8-KV

3B • Updated 3 days ago • 13k • 3

AI & ML interests

Recent Activity

Papers

Articles

Join the AMD Open Robotics Hackathon

Team members 478

amd 's collections 47