Natural Reasoning LMs
Collection
LMs fine tuned on natural reasoning by facebook
•
2 items
•
Updated
•
1
fine tuned on facebook/natural_reasoning
for 10K steps on one RTX4060 using 4bit quantization.
Evaluated using LightEval
Dataset | Baseline | Ours |
---|---|---|
CommonsenseQA | 19.5 | 20.2 |
PIQA | 3.1 | 12.4 |
Winogrande | 54.6 | 54.8 |
HellaSwag | 21.7 | 25.6 |
MMLU | 20.2 | 19.3 |
Note: Scores Taken from here
Base model
HuggingFaceTB/SmolLM2-360M