Datasets and models associated with the paper "Large-Scale Data Selection for Instruction Tuning" (https://arxiv.org/abs/2503.01807)
Hamish Ivison
hamishivi
AI & ML interests
NLP :)
Recent Activity
updated
a dataset
2 days ago
hamishivi/open_scholar_rl_no_refs
updated
a model
3 days ago
allenai/Llama-3.1-Tulu-3-405B-DPO
updated
a model
3 days ago
allenai/Llama-3.1-Tulu-3-70B-DPO
Organizations
models
37

hamishivi/Qwen-2.5-7b-tokenizer
Text Generation
•
Updated
•
63

hamishivi/general-verifier
Text Generation
•
Updated
•
16

hamishivi/qwen2.5_orz_upload
Updated

hamishivi/s1k_seq_orig_hyper__42__1740446762
Updated
•
7

hamishivi/tulu_3_long_finetune_qwen_7b_reg_system_prompt
Updated
•
12

hamishivi/tulu-2-wildchat-326k-sft
Updated
•
9

hamishivi/tulu-2-arena-hard-326k-sft
Updated
•
9

hamishivi/llama-3.1-tulu-3-arena-hard-939k-sft
Updated
•
19

hamishivi/llama-3.1-tulu-3-multitask-rrmax-939k-sft
Updated
•
14

hamishivi/tulu-2-multitask-rrmax-326k-sft
Updated
•
10
datasets
98
hamishivi/open_scholar_rl_no_refs
Viewer
•
Updated
•
60.2k
•
59
hamishivi/WebInstruct-verified-general-verifier-judge
Viewer
•
Updated
•
233k
•
157
hamishivi/0505_tulu_3_rewritte_filtered_001_09
Viewer
•
Updated
•
215k
•
36
hamishivi/0505_tulu_3_rewritte_filtered_01_09
Viewer
•
Updated
•
149k
•
18
hamishivi/OpenThoughts2-1M
Viewer
•
Updated
•
1.2M
•
279
hamishivi/orz_qwen2.5_filtered
Viewer
•
Updated
•
19.7k
•
28
hamishivi/open_scholar_rl_no_prompt
Viewer
•
Updated
•
60.2k
•
123
hamishivi/open_scholar_rl
Viewer
•
Updated
•
60.2k
•
148
hamishivi/tulu_3_rewritten_400k_string_f1_only_v2
Viewer
•
Updated
•
264k
•
224
hamishivi/tulu_3_rewritten_400k_string_f1_only
Viewer
•
Updated
•
264k
•
140