Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -36,10 +36,10 @@ More details on model performance across various devices, can be found
|
|
36 |
|
37 |
| Model | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
|
38 |
|---|---|---|---|---|---|---|---|---|
|
39 |
-
| ConvNext-Tiny | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | ONNX |
|
40 |
-
| ConvNext-Tiny | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | ONNX |
|
41 |
-
| ConvNext-Tiny | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | ONNX | 64.
|
42 |
-
| ConvNext-Tiny | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX |
|
43 |
|
44 |
|
45 |
|
@@ -103,8 +103,8 @@ Profiling Results
|
|
103 |
ConvNext-Tiny
|
104 |
Device : Samsung Galaxy S23 (13)
|
105 |
Runtime : ONNX
|
106 |
-
Estimated inference time (ms) :
|
107 |
-
Estimated peak memory usage (MB): [210,
|
108 |
Total # Ops : 469
|
109 |
Compute Unit(s) : NPU (429 ops) CPU (40 ops)
|
110 |
```
|
|
|
36 |
|
37 |
| Model | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
|
38 |
|---|---|---|---|---|---|---|---|---|
|
39 |
+
| ConvNext-Tiny | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | ONNX | 94.235 ms | 210 - 363 MB | W8A16 | NPU | [ConvNext-Tiny-W8A16-Quantized.onnx](https://huggingface.co/qualcomm/ConvNext-Tiny-W8A16-Quantized/blob/main/ConvNext-Tiny.onnx) |
|
40 |
+
| ConvNext-Tiny | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | ONNX | 76.704 ms | 213 - 544 MB | W8A16 | NPU | [ConvNext-Tiny-W8A16-Quantized.onnx](https://huggingface.co/qualcomm/ConvNext-Tiny-W8A16-Quantized/blob/main/ConvNext-Tiny.onnx) |
|
41 |
+
| ConvNext-Tiny | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | ONNX | 64.091 ms | 218 - 520 MB | W8A16 | NPU | [ConvNext-Tiny-W8A16-Quantized.onnx](https://huggingface.co/qualcomm/ConvNext-Tiny-W8A16-Quantized/blob/main/ConvNext-Tiny.onnx) |
|
42 |
+
| ConvNext-Tiny | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 88.907 ms | 232 - 232 MB | W8A16 | NPU | [ConvNext-Tiny-W8A16-Quantized.onnx](https://huggingface.co/qualcomm/ConvNext-Tiny-W8A16-Quantized/blob/main/ConvNext-Tiny.onnx) |
|
43 |
|
44 |
|
45 |
|
|
|
103 |
ConvNext-Tiny
|
104 |
Device : Samsung Galaxy S23 (13)
|
105 |
Runtime : ONNX
|
106 |
+
Estimated inference time (ms) : 94.2
|
107 |
+
Estimated peak memory usage (MB): [210, 363]
|
108 |
Total # Ops : 469
|
109 |
Compute Unit(s) : NPU (429 ops) CPU (40 ops)
|
110 |
```
|