Dataset Details
Dataset Description
This dataset is the training data for rt
from Scaling Reasoning can Improve Factuality in Large Language Models. The amount of data is around 7K rows.
- Curated by: Mike Zhang
- Funded by [optional]: Villum Fonden
- Language(s) (NLP): English
- License: Apache 2.0
Dataset Sources [optional]
- Repository: https://huggingface.co/datasets/AAU-NLP/fs1-predictions
- Paper [optional]: https://huggingface.co/papers/2505.11140
- Github: https://github.com/jjzha/fs1
Uses
One can use these reasoning traces to fine-tune their models to induce more factual thinking.
Direct Use
Having reasoning models via simple scaling (Muennighoff et al., 2025).
Out-of-Scope Use
We only have QA in this dataset, no other domains like mathematical reasoning or puzzles.
Dataset Structure
We have the following features:
features:
- name: id
dtype: string
- name: question
dtype: string
- name: gold_answer
sequence: string
- name: model_answer
sequence: string
- name: model
dtype: string
- name: reasoning_trace
dtype: string
- name: model_attempt
dtype: string
- name: valid
dtype: int64
- name: text
dtype: string
- name: total_length
dtype: int64
- name: think_length
dtype: int64
- name: answer_length
dtype: int64
The part used for fine-tuning is text
where we pre-apply the chat template and also add a special tag for the <thinking>
block.
Dataset Creation
Source Data
The data comes from the datasets used in the paper.
Data Collection and Processing
We did no further pre-processing to the QA pairs.
Bias, Risks, and Limitations
Recommendations
Users should be made aware of the risks, biases and limitations of the dataset. Note that not every answer is correct, thus always double-check the answers from the model.
Citation
BibTeX:
@misc{zhang2025scalingreasoningimprovefactuality,
title={Scaling Reasoning can Improve Factuality in Large Language Models},
author={Mike Zhang and Johannes Bjerva and Russa Biswas},
year={2025},
eprint={2505.11140},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2505.11140},
}
- Downloads last month
- 5