Spaces:

text-generation-inference
/

README

Running

Narsil HF staff commited on Aug 1, 2023

Commit

83c15fe

1 Parent(s): b81bc93

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ sdk: static
 pinned: false
 ---
-Text-Generation-Inference is, an open-source, purpose-built solution for deploying and serving Large Language Models (LLMs). TGI enables high-performance text generation using Tensor Parallelism and dynamic batching for the most popular open-source LLMs, including StarCoder, BLOOM, GPT-NeoX, Llama, and T5. Text Generation Inference is already used by customers such as IBM, Grammarly, and the Open-Assistant initiative implements optimization for all supported model architectures, including:
 - Tensor Parallelism and custom cuda kernels
 - Optimized transformers code for inference using flash-attention and Paged Attention on the most popular architectures

 pinned: false
 ---
+Text-Generation-Inference is a solution build for deploying and serving Large Language Models (LLMs). TGI enables high-performance text generation using Tensor Parallelism and dynamic batching for the most popular open-source LLMs, including StarCoder, BLOOM, GPT-NeoX, Llama, and T5. Text Generation Inference is already used by customers such as IBM, Grammarly, and the Open-Assistant initiative implements optimization for all supported model architectures, including:
 - Tensor Parallelism and custom cuda kernels
 - Optimized transformers code for inference using flash-attention and Paged Attention on the most popular architectures