Please make the q5_k_m quant - this would allow this model to run on 4gb vram graphics cards
· Sign up or log in to comment