korywat commited on
Commit
414310c
·
verified ·
1 Parent(s): 0b01465

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -41,7 +41,7 @@ More details on model performance across various devices, can be found
41
  - Decoding length: 4096
42
  - Use: Initiate conversation with prompt-processor and then token generator for subsequent iterations.
43
 
44
- | Model | Device | Chipset | Target Runtime | Response Rate (Tokens/Second) | Time To First Token Range (Seconds) | Tiny MMLU
45
  |---|---|---|---|---|---|---|
46
  | Mistral-7B-Instruct-v0_3 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | QNN | 10.73 | 0.18 - 5.79 | 58.85% | Use Export Script |
47
 
 
41
  - Decoding length: 4096
42
  - Use: Initiate conversation with prompt-processor and then token generator for subsequent iterations.
43
 
44
+ | Model | Device | Chipset | Target Runtime | Response Rate (tokens per second) | Time To First Token (range, seconds) | Tiny MMLU
45
  |---|---|---|---|---|---|---|
46
  | Mistral-7B-Instruct-v0_3 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | QNN | 10.73 | 0.18 - 5.79 | 58.85% | Use Export Script |
47