New discussion

Gguf?

#12 opened 27 days ago by
AlgorithmicKing

Infinity usage

3
#9 opened 2 months ago by
michaelfeil

inference speed

#7 opened 3 months ago by
nilx21

Recommendations to for quantization?

1
#2 opened 7 months ago by deleted

About model_max_length

4
#1 opened 7 months ago by
hongwen11