GGUF?

by Alastar-Smith - opened 1 day ago

1 day ago

•

Is there a way that we can use it in LMStudio as GGUF Qs?

1 day ago

I will try out to make it work, maybe the recent changes in llama.cpp make it possible (;

1 day ago

bad news currently there is more work needed

1 day ago

I have managed to create a Q8_0 gguf and a mmproj gguf, now i need to test inference

1 day ago

Is there a way that we can use in LMStudio as GGUF Qs?

are you currently online?

about 20 hours ago

Is there a way that we can use in LMStudio as GGUF Qs?

are you currently online?

Sorry, I was sleeping.
Ready to test stuff!

about 14 hours ago

I created a FP8 version for vLLM inference, should work on 16GiB VRAM cards.

about 13 hours ago

•

Edit Misread your post, nevermind.

about 12 hours ago

•

Is there a way that we can use in LMStudio as GGUF Qs?

are you currently online?

Sorry, I was sleeping.
Ready to test stuff!

didnt get it working yet, ill need to implement stuff for that in llama.cpp, if that will be successful idk xD

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment