robgreenberg3 commited on
Commit
cd26f36
·
verified ·
1 Parent(s): dc7738d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -5
README.md CHANGED
@@ -1,7 +1,4 @@
1
  ---
2
- tags:
3
- - fp8
4
- - vllm
5
  language:
6
  - en
7
  - de
@@ -11,9 +8,31 @@ language:
11
  - hi
12
  - es
13
  - th
 
 
14
  pipeline_tag: text-generation
15
- license: llama3.1
16
- base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
17
  ---
18
  <h1 style="display: flex; align-items: center; gap: 10px; margin: 0;">
19
  Meta-Llama-3.1-8B-Instruct-FP8-dynamic
 
1
  ---
 
 
 
2
  language:
3
  - en
4
  - de
 
8
  - hi
9
  - es
10
  - th
11
+ base_model:
12
+ - meta-llama/Llama-3.1-8B-Instruct
13
  pipeline_tag: text-generation
14
+ tags:
15
+ - llama
16
+ - facebook
17
+ - meta
18
+ - llama-3
19
+ - int8
20
+ - vllm
21
+ - chat
22
+ - neuralmagic
23
+ - llmcompressor
24
+ - conversational
25
+ - 8-bit precision
26
+ - compressed-tensors
27
+ license: other
28
+ license_name: llama3.1
29
+ name: RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a8
30
+ description: This model was obtained by quantizing the weights and activations of Meta-Llama-3.1-8B-Instruct to INT8 data type.
31
+ readme: https://huggingface.co/RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a8/main/README.md
32
+ tasks:
33
+ - text-to-text
34
+ provider: Meta
35
+ license_link: https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/LICENSE
36
  ---
37
  <h1 style="display: flex; align-items: center; gap: 10px; margin: 0;">
38
  Meta-Llama-3.1-8B-Instruct-FP8-dynamic