HA ARM quant

by EloyOn - opened 11 days ago

Discussion

EloyOn

11 days ago

You don't think this tiny model is worthy of being given the HA ARM treatment like the other ones?

I didn't try it but 1b must struggle to be coherent even at q8_0.

SicariusSicariiStuff

Owner 8 days ago

It's 1B, so even at FP16 it's not that coherent, so yeah, it wouldn't make much of a difference.
With 2day's hardware, even a mid tier CPU could probably run 3B model with ease.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment