--- | |
library_name: peft | |
base_model: mistralai/Mistral-7B-v0.1 | |
# OVM generator | |
### Model Description | |
The model is trained based on [Outcome-supervised Verifiers for Planning in Mathematical Reasoning](https://arxiv.org/pdf/2311.09724v1.pdf) paper, trained on [Vi-GSM8K](https://huggingface.co/datasets/longhoang06/Vi-GSM8K) dataset | |