satreysa commited on
Commit
c0c02ce
·
verified ·
1 Parent(s): 3355695

Readme update

Browse files
Files changed (1) hide show
  1. README.md +37 -69
README.md CHANGED
@@ -1,69 +1,37 @@
1
- ---
2
- license: mit
3
- language:
4
- - en
5
- base_model:
6
- - deepseek-ai/DeepSeek-R1-Distill-Llama-8B
7
- pipeline_tag: text-generation
8
- ---
9
-
10
- # DeepSeek-R1-Distill-Llama-8B-awq-asym-uint4-g128-lmhead-onnx-hybrid
11
- - ## Introduction
12
- This model was created by applying [Quark](https://quark.docs.amd.com/latest/index.html) with calibration samples from Pile dataset.
13
- - ## Quantization Strategy
14
- - ***Quantized Layers***: All linear layers
15
- - ***Weight***: uint4 asymmetric per-group, group_size=128
16
- - ## Quick Start
17
- 1. [Download and install Quark](https://quark.docs.amd.com/latest/install.html)
18
- 2. Run the quantization script in the example folder using the following command line:
19
- ```sh
20
- export MODEL_DIR = [local model checkpoint folder] or DeepSeek-R1-Distill-Llama-8B
21
- # single GPU
22
- python quantize_quark.py --model_dir $MODEL_DIR \
23
- --output_dir output_dir $MODEL_NAME-awq-asym-uint4-g128-lmhead \
24
- --quant_scheme w_uint4_per_group_asym \
25
- --num_calib_data 128 \
26
- --quant_algo awq \
27
- --dataset pileval_for_awq_benchmark \
28
- --seq_len 512 \
29
- --model_export hf_format \
30
- --data_type bfloat16 \
31
- --exclude_layers
32
- # cpu
33
- python quantize_quark.py --model_dir $MODEL_DIR \
34
- --output_dir output_dir $MODEL_NAME-awq-asym-uint4-g128-lmhead \
35
- --quant_scheme w_uint4_per_group_asym \
36
- --num_calib_data 128 \
37
- --quant_algo awq \
38
- --dataset pileval_for_awq_benchmark \
39
- --seq_len 512 \
40
- --model_export hf_format \
41
- --data_type bfloat16 \
42
- --exclude_layers \
43
- --device cpu
44
- ```
45
-
46
- #### License
47
- Modifications copyright(c) 2024 Advanced Micro Devices,Inc. All rights reserved.
48
-
49
- MIT License
50
-
51
- Copyright (c) 2023 DeepSeek
52
-
53
- Permission is hereby granted, free of charge, to any person obtaining a copy
54
- of this software and associated documentation files (the "Software"), to deal
55
- in the Software without restriction, including without limitation the rights
56
- to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
57
- copies of the Software, and to permit persons to whom the Software is
58
- furnished to do so, subject to the following conditions:
59
-
60
- The above copyright notice and this permission notice shall be included in all
61
- copies or substantial portions of the Software.
62
-
63
- THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
64
- IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
65
- FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
66
- AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
67
- LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
68
- OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
69
- SOFTWARE.
 
1
+ ---
2
+ license: mit
3
+ base_model:
4
+ - deepseek-ai/DeepSeek-R1-Distill-Llama-8B
5
+ ---
6
+
7
+ # amd/DeepSeek-R1-Distill-Llama-8B-awq-asym-uint4-g128-lmhead-onnx-hybrid
8
+ - ## Introduction
9
+ This model was prepared using the AMD Quark Quantization tool, followed by necessary post-processing.
10
+
11
+ - ## Quantization Strategy
12
+ - AWQ / Group 128 / Asymmetric / UINT4 Weights / FP16 activations
13
+ - Excluded Layers: None
14
+ -
15
+ - ## Quick Start
16
+ For quickstart, refer to [Ryzen AI doucmentation](https://ryzenai.docs.amd.com/en/latest/hybrid_oga.html)
17
+
18
+ #### Evaluation scores
19
+ The perplexity measurement is run on the wikitext-2-raw-v1 (raw data) dataset provided by Hugging Face. Perplexity score measured for prompt length 2k is 13.755.
20
+
21
+ #### License
22
+ Modifications copyright(c) 2024 Advanced Micro Devices,Inc. All rights reserved.
23
+
24
+ MIT License
25
+
26
+ Copyright (c) 2023 DeepSeek
27
+
28
+ Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal
29
+ in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
30
+ copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
31
+
32
+ The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
33
+
34
+ THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
35
+ FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
36
+ LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
37
+ SOFTWARE.