Can you give an example on how to use gguf version or any inference GUI support it now?

by DrNicefellow - opened Mar 31

Discussion

DrNicefellow

Mar 31

As title

openfree

VIDraft org Mar 31

As title

example,

https://huggingface.co/openfree/Gemma-3-R1984-27B-Q8_0-GGUF
"Deploy" button click
"HF Inference Endpoints" button click
"Inference Endpoints" menu, "GPU Server" choice
'Deploy' and 'Excute Running'

"git clone ..... model or space path...." <- your server

Link: https://huggingface.co/spaces/VIDraft/Gemma-3-R1984-27B
"Duplicate"
files -> app.py -> code edit -> Replicated "your choic GGUF Model Path"
commit -> building

DrNicefellow

Apr 1

As title

example,

https://huggingface.co/openfree/Gemma-3-R1984-27B-Q8_0-GGUF

"Deploy" button click

"HF Inference Endpoints" button click

"Inference Endpoints" menu, "GPU Server" choice

'Deploy' and 'Excute Running'

or

"git clone ..... model or space path...." <- your server

or

Link: https://huggingface.co/spaces/VIDraft/Gemma-3-R1984-27B

"Duplicate"

files -> app.py -> code edit -> Replicated "your choic GGUF Model Path"

commit -> building

Thank you for the information! I will try them.

DrNicefellow changed discussion status to closed Apr 1

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment