satreysa commited on
Commit
999358c
·
verified ·
1 Parent(s): 81713b8

Readme update

Browse files
Files changed (1) hide show
  1. README.md +36 -46
README.md CHANGED
@@ -1,46 +1,36 @@
1
- ---
2
- base_model:
3
- - mistralai/Mistral-7B-Instruct-v0.1
4
- ---
5
-
6
-
7
- Quark 0.6.0
8
-
9
- ```
10
- python3 quantize_quark.py
11
- --model_dir "meta-llama/Llama-2-7b-chat-hf"
12
- --output_dir <quantized safetensor output dir>
13
- --quant_scheme w_uint4_per_group_asym
14
- --num_calib_data 128
15
- --quant_algo awq
16
- --dataset pileval_for_awq_benchmark
17
- --seq_len 512
18
- --model_export quark_safetensors
19
- --data_type float16
20
- --exclude_layers []
21
- --custom_mode awq
22
- ```
23
-
24
- Model Builder v0.5.1
25
-
26
- ```
27
- cd onnxruntime-genai/src/python/py/models
28
-
29
- python builder.py \
30
- -i <quantized safetensor model dir> \
31
- -o <oga model output dir> \
32
- -p int4 \
33
- -e dml
34
- ```
35
-
36
- Hybrid Package: https://gitenterprise.xilinx.com/VitisAI/hybrid-llm/actions/runs/643176
37
-
38
- Performance
39
- -
40
-
41
- HP OmniBook Ultra Laptop 14-fd0xxx
42
-
43
- AMD Ryzen AI 9 365 w/ Radeon 880M (Performance Mode)
44
-
45
-
46
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/65ff7616871b36bf84150cda/hkHF25WNdzsZQtC0S2HH3.png)
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model:
4
+ - mistralai/Mistral-7B-Instruct-v0.1
5
+ ---
6
+
7
+ # amd/Mistral-7B-Instruct-v0.1-hybrid
8
+ - ## Introduction
9
+ This model was prepared using the AMD Quark Quantization tool, followed by necessary post-processing.
10
+
11
+ - ## Quantization Strategy
12
+ - AWQ / Group 128 / Asymmetric / UINT4 Weights / FP16 activations
13
+ - Excluded Layers: None
14
+ -
15
+ - ## Quick Start
16
+ For quickstart, refer to [Ryzen AI doucmentation](https://ryzenai.docs.amd.com/en/latest/hybrid_oga.html)
17
+
18
+ #### Evaluation scores
19
+ The perplexity measurement is run on the wikitext-2-raw-v1 (raw data) dataset provided by Hugging Face. Perplexity score measured for prompt length 2k is 7.063.
20
+
21
+ #### License
22
+ Modifications copyright(c) 2024 Advanced Micro Devices,Inc. All rights reserved.
23
+
24
+ MIT License
25
+
26
+ Copyright (c) 2024 Advanced Micro Devices, Inc
27
+
28
+ Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal
29
+ in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
30
+ copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
31
+ The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
32
+
33
+ THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
34
+ FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
35
+ LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
36
+ SOFTWARE.