Thanks!

by lightenup - opened 18 days ago

18 days ago

Superficial testing (Python, Javascript codegen/software engineering practices) doesn't show any performance degradation compared to https://chat.z.ai/ It's a great quantization for 96 GB VRAM!

hareram241

15 days ago

Thanks, great results on blackwell 96gb gpu , getting avg 80-90t/s with 128k context size, finally sonnet at home

bakbeest

11 days ago

Echo-ing this thanks. This model and quant is great. Any chance you might also do the 4.5V model that just released?

JunHowie

QuantTrio org 11 days ago

Absolutely

JunHowie

QuantTrio org 11 days ago

we are working on it. Stay tune！

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment