IQ4_NL vs IQ4_XS vs Q4_K_S vs Q4_0

by CosmossG - opened 28 days ago

28 days ago

What is the difference between IQ4_NL(?), IQ4_XS (eXtra Small?), Q4_K_S and Q4_0? I understand how Imatrix quants are generally better than normal quants, but I am a little confused on the NL quant. Could you please clarify that for me? Thanks in advance!

CosmossG

28 days ago

Also, are these models better than the base GGUF's (e.g UnSloth's GGUF's) on these low quants for math/logic and general "smartness"? Also, as far as I know, these GGUF's also do not come with vision, right?

DavidAU

Owner 28 days ago

RE: NL
Non-linear ; this is an oddball quant which I find (like IQ4XS) good for creative use cases.
This is also used in the case of odd layers/sub-layers in a model which do not conform to standard sizes.

RE: Unsloth.
They should be the same.
Correct ; no vision.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment