TheStageAI
/

Elastic-Llama-3.2-1B-Instruct

Text Generation

text2text-generation

Model card Files Files and versions Community

quazim commited on Apr 16

Commit

6532512

·

verified ·

1 Parent(s): 65cd873

Update README.md (#3)

- Update README.md (f9262a872715e77384bb7f5b3f7e716c4c326fb3)

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -6,7 +6,7 @@ base_model_relation: quantized
 pipeline_tag: text2text-generation
 ---
-# Elastic model: Meta-Llama-3.1-8B-Instruct. Fastest and most flexible models for self-serving.
 Elastic models are the models produced by TheStage AI ANNA: Automated Neural Networks Accelerator. ANNA allows you to control model size, latency and quality with a simple slider movement. For each model, ANNA produces a series of optimized models:
@@ -94,7 +94,7 @@ To work with our models just run these lines in your terminal:
 ```shell
 pip install thestage
-pip install elastic_models==0.0.4\
  --index-url https://thestage.jfrog.io/artifactory/api/pypi/pypi-thestage-ai-production/simple\
  --extra-index-url https://pypi.nvidia.com\
  --extra-index-url https://pypi.org/simple

 pipeline_tag: text2text-generation
 ---
+# Elastic model: Llama-3.2-1B-Instruct. Fastest and most flexible models for self-serving.
 Elastic models are the models produced by TheStage AI ANNA: Automated Neural Networks Accelerator. ANNA allows you to control model size, latency and quality with a simple slider movement. For each model, ANNA produces a series of optimized models:
 ```shell
 pip install thestage
+pip install elastic_models[nvidia]
  --index-url https://thestage.jfrog.io/artifactory/api/pypi/pypi-thestage-ai-production/simple\
  --extra-index-url https://pypi.nvidia.com\
  --extra-index-url https://pypi.org/simple