It seems like model have serious repetition issues (both gguf and on openrouter)

#8
by roadtoagi - opened

Dry may help, but it will kill reasoning mode...

and in reasoning mode, model starts to repeat itself from second-third chat round.

Using recommended settings.

With some more testing even dry doesn't really help. Deepseek is just much better.

Unsloth AI org

Apologies @roadtoagi which quant did you use, the old ones may have to be deprecated due not being compatible with imatrix quantization

We deleted the ones that are wrong and only left the ones that work

I used Q2_K, but the problem isn't in quants, but in model itself. Openrouter has same issues.

Unsloth AI org

I used Q2_K, but the problem isn't in quants, but in model itself. Openrouter has same issues.

I think it was a chat template issue. We just fixed them 2 hours ago. would you mind checking them again?

I tried it on openrouter again after a while and it seems like situiation really improved a lot after bug fixes. Will try new gguf some time.

It went from repetition nightmare to being actually really smart? I think it's now on par with deepseek.

Unsloth AI org

I tried it on openrouter again after a while and it seems like situiation really improved a lot after bug fixes. Will try new gguf some time.

It went from repetition nightmare to being actually really smart? I think it's now on par with deepseek.

Incredible! :D

Sign up or log in to comment