More QwQ?

#2
by mclassHF2023 - opened

I would be interested in merging in a bit more of the QwQ-preview and -abliterated models in the mix. It seems that for some logic puzzles, it doesn't perform nearly as good as more "QwQ-heavy" models.

Now, that might not have been the goal, and that's perfectly fine, I was just curious! :)

Do you have an example of the problem you are trying to solve and how you are instructing it's solution?

I refolded QwQ in again, with some other high quality models. Check out https://huggingface.co/maldv/Lytta2.5-32B-Instruct and let me know if it has better reasoning.

Hi, thanks for trying this out! Like you mention in its description, it is quite unhinged! :D
I will try it out some more, but I think it's more of a creative writing model and attempts of making it into a more "reasoning model" are probably wasted on it. But nevertheless it's an interesting result I think, I haven't seen any other model quite like it...

oh, and btw: one of the puzzles that I came up that even the best models like Gemini struggle with is this innocent looking question:

If you have one bucket that holds two gallons and another bucket that holds five gallons, how do you fill one of the buckets with exactly 4 gallons?

I'm actually quite disappointed in it. I don't think I'm going to be able to improve on Qwentile without doing some sort of preference optimization. Thanks for trying it though.

Sign up or log in to comment