Collection of datasets and models for our paper "Whose Boat Does it Float? Improving Personalization in Preference Tuning via Inferred User Personas"
Nishant Balepur
nbalepur
AI & ML interests
NLP
Recent Activity
updated
a dataset
about 8 hours ago
nbalepur/google-query-wellformedness
published
a dataset
about 8 hours ago
nbalepur/google-query-wellformedness
updated
a dataset
1 day ago
nbalepur/MCQA_IWF
Organizations
Collections
2
models
8

nbalepur/Llama-3.1-8B-PT-DPO-Mnemonic
Updated

nbalepur/Llama-3.1-8B-PT-DPO-HHH
Updated

nbalepur/Llama-3.1-8B-PT-DPO-BeaverTails
Text Generation
•
Updated
•
9

nbalepur/Llama-3.1-8B_copy_persona_False_Mnemonic_dpo_chosen
Text Generation
•
Updated
•
8

nbalepur/Llama-3.1-8B_copy_persona_False_Safe_RLHF_dpo_chosen
Text Generation
•
Updated
•
10

nbalepur/LLama-2-70b-Mnemonic-Tokenizer
Updated

nbalepur/LLama-2-70b-Mnemonic-SFT
Text Generation
•
Updated
•
18

nbalepur/LLama-2-70b-Mnemonic-DPO
Text Generation
•
Updated
•
17
datasets
101
nbalepur/google-query-wellformedness
Viewer
•
Updated
•
25.1k
nbalepur/MCQA_IWF
Viewer
•
Updated
•
321
•
32
nbalepur/BenchBench_test
Viewer
•
Updated
•
1.19k
•
43
nbalepur/cheating-reasoners
Viewer
•
Updated
•
9.39k
•
400
nbalepur/planorama_irt_swap_oneslope
Viewer
•
Updated
•
300
•
22
nbalepur/planorama_without_label_swap_fixed2
Viewer
•
Updated
•
300
•
19
nbalepur/planorama_irt_swap_newslope
Viewer
•
Updated
•
300
•
19
nbalepur/planorama_without_label_swap_fixed
Viewer
•
Updated
•
300
•
22
nbalepur/planorama_irt_swap2
Viewer
•
Updated
•
300
•
13
nbalepur/planorama_irt_swap
Viewer
•
Updated
•
300
•
18