In my use case, significantly worse than the 14-DPO variant

by cmp-nct - opened Dec 5, 2023

Dec 5, 2023

•

edited Dec 5, 2023

Just wanted to share some feedback. I was testing the new variant and compared it to the little DPO trained 14B brother.
It produced significantly worse results, less well written, less precisely followed on instructions.

The task was summarization in good language of a structured input based on a set of instructions.

There are likely other tasks where this model will be better but at this point I'd not choose it

sssfeather

Dec 11, 2023

This comment has been hidden

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment