Mindkrypted's picture

Mindkrypted

mindkrypted

AI & ML interests

*.*

Recent Activity

liked a model 4 days ago
mistralai/Magistral-Small-2506
liked a model about 1 month ago
calcuis/wan-gguf
View all activity

Organizations

Skye Team's profile picture

mindkrypted's activity

upvoted an article 2 months ago
reacted to sr-rai's post with 🤗 2 months ago
view post
Post
2760
ExLlamaV3 is out. And it introduces EXL3 - a new SOTA quantization format!

"The conversion process is designed to be simple and efficient and requires only an input model (in HF format) and a target bitrate. By computing Hessians on the fly and thanks to a fused Viterbi kernel, the quantizer can convert a model in a single step, taking a couple of minutes for smaller models, up to a few hours for larger ones (70B+) (on a single RTX 4090 or equivalent GPU.)"

Repo: https://github.com/turboderp-org/exllamav3



  • 1 reply
·