Failed to load model

by win10 - opened 1 day ago

1 day ago

Failed to load model
error loading model: error loading model architecture: unknown model architecture: 'glm4'
@bartowski

bartowski

Owner 1 day ago

I assume with lm studio?

Should be a beta runtime out tonight or tomorrow that works

ilintar

about 16 hours ago

@bartowski Will probably need to requantize, see: https://github.com/ggml-org/llama.cpp/pull/12957

I'm doing a quant on the PR version right now, going to see if it works.

bartowski

Owner about 14 hours ago

Please report back!

Want to leave these up for discussion, hate leaving them up cause people will download and have issues 😩😩

jacek2024

about 14 hours ago

last comments https://github.com/ggml-org/llama.cpp/issues/12946

ilintar

about 14 hours ago

@bartowski my models are up here https://huggingface.co/ilintar/THUDM_GLM-Z1-9B-0414_iGGUF (just IQ4_NL, Q5_K_M and Q8 because my upload sucks) and they seem to work well with the llama.cpp branch from the pull request as well. There were changes both in the core and in the converter, so I think you need both (the new quants and the new llama.cpp engine) to run it.

bartowski

Owner about 13 hours ago

ah gotcha, shame it requires both, but thanks for confirming!

I want to get on these ASAP but I'm also away from my PC so it's not easy for me to correct if something goes wrong again, so for now I think I'll wait until it's finalized and merged

ilintar

about 13 hours ago

Yeah, makes sense esp. since there is still some problem with the /props endpoint and I don't know if that's just a backend issue or if something's still wrong with the conversion, so no use reuploading only to have to pull it yet again :>

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment