This model is Llemma-34b model used in the paper "An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models". It's based on Llemma-34b and was further finetuned MetaMath with special format for reward. Each step starts with "Step" and ends with "\u043a\u0438".

Downloads last month
69
Safetensors
Model size
33.7B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for tkitsers/Llemma-metamath-34b

Quantizations
1 model

Collection including tkitsers/Llemma-metamath-34b