Kudzu-8B

Fresh out of the mergekit-evolve kitchen, this is a merge model between:

Used wmdp as the scoring method for evolve. In my limited testing, it has not done the usual Llama-3 "Ahaha!" interjections while retaining a good portion of the intelligence. There are several ablated models in the mix so don't be surprised if it gives you what you ask for.

Downloads last month
8
Safetensors
Model size
8.03B params
Tensor type
BF16
Β·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for lodrick-the-lafted/Kudzu-8B

Spaces using lodrick-the-lafted/Kudzu-8B 6