shaowenchen
commited on
Commit
·
5a6f6b3
1
Parent(s):
6295216
Update README.md
Browse files
README.md
CHANGED
|
@@ -38,6 +38,14 @@ tags:
|
|
| 38 |
| baichuan2-7b-base.Q8_0.gguf | Q8_0 | 7.4 GB |
|
| 39 |
| baichuan2-7b-base.gguf | full | 14 GB |
|
| 40 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 41 |
## Provided images
|
| 42 |
|
| 43 |
| Name | Quant method | Size |
|
|
|
|
| 38 |
| baichuan2-7b-base.Q8_0.gguf | Q8_0 | 7.4 GB |
|
| 39 |
| baichuan2-7b-base.gguf | full | 14 GB |
|
| 40 |
|
| 41 |
+
Usage:
|
| 42 |
+
|
| 43 |
+
```
|
| 44 |
+
docker run --rm -it -p 8000:8000 -v /path/to/models:/models -e MODEL=/models/gguf-model-name.gguf hubimage/llama-cpp-python:latest
|
| 45 |
+
```
|
| 46 |
+
|
| 47 |
+
and you can view http://localhost:8000/docs to see the swagger UI.
|
| 48 |
+
|
| 49 |
## Provided images
|
| 50 |
|
| 51 |
| Name | Quant method | Size |
|