distilled zephyr-7b over 44m-textbooks on textbook codex, 1 epoch, 0.3 dropout

avg arc hellaswag mmlu truthfulqa
30.16 22.4 25.54 23.11 49.59
Downloads last month
6
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train crumb/44m-Z