|
---
|
|
license: mit
|
|
base_model:
|
|
- deepseek-ai/DeepSeek-R1-Distill-Llama-8B
|
|
---
|
|
|
|
# amd/DeepSeek-R1-Distill-Llama-8B-awq-asym-uint4-g128-lmhead-onnx-hybrid
|
|
- ## Introduction
|
|
This model was prepared using the AMD Quark Quantization tool, followed by necessary post-processing.
|
|
|
|
- ## Quantization Strategy
|
|
- AWQ / Group 128 / Asymmetric / UINT4 Weights / FP16 activations
|
|
- Excluded Layers: None
|
|
-
|
|
- ## Quick Start
|
|
For quickstart, refer to [Ryzen AI doucmentation](https://ryzenai.docs.amd.com/en/latest/hybrid_oga.html)
|
|
|
|
#### Evaluation scores
|
|
The perplexity measurement is run on the wikitext-2-raw-v1 (raw data) dataset provided by Hugging Face. Perplexity score measured for prompt length 2k is 13.755.
|
|
|
|
#### License
|
|
Modifications copyright(c) 2024 Advanced Micro Devices,Inc. All rights reserved.
|
|
|
|
MIT License
|
|
|
|
Copyright (c) 2023 DeepSeek
|
|
|
|
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal
|
|
in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
|
|
copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
|
|
|
|
The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
|
|
|
|
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
|
|
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
|
|
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
|
|
SOFTWARE.
|
|
|