ELVISIO
/

Qwen3-8B-NVFP4A16

Text Generation

8-bit precision

compressed-tensors

Model card Files Files and versions

ELVISIO commited on 10 days ago

Commit

4d81405

·

verified ·

1 Parent(s): 5c11af4

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -13,13 +13,13 @@ language:
 - th
 pipeline_tag: text-generation
 license: apache-2.0
-base_model: Qwen/Qwen3-32B
 ---
 # Qwen3-32B-NVFP4A16
 ## Model Overview
-- **Model Architecture:** Qwen/Qwen3-32B
   - **Input:** Text
   - **Output:** Text
 - **Model Optimizations:**
@@ -28,14 +28,14 @@ base_model: Qwen/Qwen3-32B
 - **Out-of-scope:** Use in any manner that violates applicable laws or regulations (including trade compliance laws). Use in languages other than English.
 - **Release Date:** 6/25/2025
 - **Version:** 10
-- **Model Developers:** RedHatAI
-This model is a quantized version of [Qwen/Qwen3-32B](https://huggingface.co/Qwen/Qwen3-32B).
 It was evaluated on a several tasks to assess the its quality in comparison to the unquatized model.
 ### Model Optimizations
-This model was obtained by quantizing the weights of [Qwen/Qwen3-32B](https://huggingface.co/Qwen/Qwen3-32B) to FP4 data type, ready for inference with vLLM>=9.1
 This optimization reduces the number of bits per parameter from 16 to 4, reducing the disk size and GPU memory requirements by approximately 25%.
 Only the weights of the linear operators within transformers blocks are quantized using [LLM Compressor](https://github.com/vllm-project/llm-compressor).

 - th
 pipeline_tag: text-generation
 license: apache-2.0
+base_model: Qwen/Qwen3-8B
 ---
 # Qwen3-32B-NVFP4A16
 ## Model Overview
+- **Model Architecture:** Qwen/Qwen3-8B
   - **Input:** Text
   - **Output:** Text
 - **Model Optimizations:**
 - **Out-of-scope:** Use in any manner that violates applicable laws or regulations (including trade compliance laws). Use in languages other than English.
 - **Release Date:** 6/25/2025
 - **Version:** 10
+- **Model Developers:** ELVISIO (Thanks to RedHatAI)
+This model is a quantized version of [Qwen/Qwen3-8B](https://huggingface.co/Qwen/Qwen3-8B).
 It was evaluated on a several tasks to assess the its quality in comparison to the unquatized model.
 ### Model Optimizations
+This model was obtained by quantizing the weights of [Qwen/Qwen3-8B](https://huggingface.co/Qwen/Qwen3-8B) to FP4 data type, ready for inference with vLLM>=9.1
 This optimization reduces the number of bits per parameter from 16 to 4, reducing the disk size and GPU memory requirements by approximately 25%.
 Only the weights of the linear operators within transformers blocks are quantized using [LLM Compressor](https://github.com/vllm-project/llm-compressor).