New discussion

Requirements

#52 opened 12 months ago by
sneakybeaky

Finetuning llama2

#47 opened almost 2 years ago by
zuhashaik

Any example of batch inference?

#46 opened almost 2 years ago by
PrintScr

How to set max_split_size_mb?

1
#30 opened about 2 years ago by
neo-benjamin

max_position_embeddings = 2048?

1
#29 opened about 2 years ago by
zzzac

Load into 2 GPUs

3
#28 opened about 2 years ago by
sauravm8

Load model into TGI

#27 opened about 2 years ago by
schauppi

Perplexity

#22 opened about 2 years ago by
gsaivinay

70TB with multiple A5000

6
#21 opened about 2 years ago by
nashid

Inference error, tensor shapes.

8
#18 opened about 2 years ago by
alejandrofdz

Inference time with TGI

πŸ‘ 2
1
#15 opened about 2 years ago by
jacktenyx

Can't launch with TGI

6
#14 opened about 2 years ago by
yekta

text-generation-inference error

7
#5 opened about 2 years ago by
msteele

Output always 0 tokens

11
#4 opened about 2 years ago by
sterogn