Update README.md (#3)
Browse files- Update README.md (f9262a872715e77384bb7f5b3f7e716c4c326fb3)
README.md
CHANGED
@@ -6,7 +6,7 @@ base_model_relation: quantized
|
|
6 |
pipeline_tag: text2text-generation
|
7 |
---
|
8 |
|
9 |
-
# Elastic model:
|
10 |
|
11 |
Elastic models are the models produced by TheStage AI ANNA: Automated Neural Networks Accelerator. ANNA allows you to control model size, latency and quality with a simple slider movement. For each model, ANNA produces a series of optimized models:
|
12 |
|
@@ -94,7 +94,7 @@ To work with our models just run these lines in your terminal:
|
|
94 |
|
95 |
```shell
|
96 |
pip install thestage
|
97 |
-
pip install elastic_models
|
98 |
--index-url https://thestage.jfrog.io/artifactory/api/pypi/pypi-thestage-ai-production/simple\
|
99 |
--extra-index-url https://pypi.nvidia.com\
|
100 |
--extra-index-url https://pypi.org/simple
|
|
|
6 |
pipeline_tag: text2text-generation
|
7 |
---
|
8 |
|
9 |
+
# Elastic model: Llama-3.2-1B-Instruct. Fastest and most flexible models for self-serving.
|
10 |
|
11 |
Elastic models are the models produced by TheStage AI ANNA: Automated Neural Networks Accelerator. ANNA allows you to control model size, latency and quality with a simple slider movement. For each model, ANNA produces a series of optimized models:
|
12 |
|
|
|
94 |
|
95 |
```shell
|
96 |
pip install thestage
|
97 |
+
pip install elastic_models[nvidia]
|
98 |
--index-url https://thestage.jfrog.io/artifactory/api/pypi/pypi-thestage-ai-production/simple\
|
99 |
--extra-index-url https://pypi.nvidia.com\
|
100 |
--extra-index-url https://pypi.org/simple
|