THUDM
/

GLM-4-32B-0414

Text Generation

Model card Files Files and versions Community

Resources

View closed (1)

Fix template when add_generation_prompt=true

#14 opened about 11 hours ago by

matteogeniaccio

It supports Serbo-Croatian language very well!

#13 opened 1 day ago by

GPTQ or AWQ Quants

#12 opened 1 day ago by

Great job, thanks for this model.

#11 opened 3 days ago by

recommended sampling parameters?

#10 opened 4 days ago by

Can we have some more popular benchmarks

#8 opened 5 days ago by

The model is the best for coding.

#7 opened 8 days ago by

When running with a single GPU, I get an error saying the VRAM is insufficient. However, when using multiple GPUs on a single machine, there are many errors. My vllm version is 0.8.4.

#6 opened 8 days ago by

BitsAndBytes quantization inference error

#5 opened 9 days ago by

Some bug when using function call with vllm==0.8.4

#4 opened 9 days ago by

SimpleQA Scores Are WAY off

#3 opened 11 days ago by

Need fp8 version for inerface

#2 opened 11 days ago by

RuntimeError: CUDA error: device-side assert triggered

#1 opened 12 days ago by