Collection of datasets and models for our paper "Whose Boat Does it Float? Improving Personalization in Preference Tuning via Inferred User Personas"
Nishant Balepur
nbalepur
AI & ML interests
NLP
Recent Activity
updated
a dataset
5 days ago
nbalepur/planorama_irt_swap_oneslope
published
a dataset
5 days ago
nbalepur/planorama_irt_swap_oneslope
updated
a dataset
5 days ago
nbalepur/planorama_without_label_swap_fixed2
Organizations
Collections
2
models
8

nbalepur/Llama-3.1-8B-PT-DPO-HHH
Updated

nbalepur/Llama-3.1-8B-PT-DPO-Mnemonic
Updated

nbalepur/Llama-3.1-8B-PT-DPO-BeaverTails
Text Generation
•
Updated
•
1

nbalepur/Llama-3.1-8B_copy_persona_False_Mnemonic_dpo_chosen
Text Generation
•
Updated
•
1

nbalepur/Llama-3.1-8B_copy_persona_False_Safe_RLHF_dpo_chosen
Text Generation
•
Updated
•
1

nbalepur/LLama-2-70b-Mnemonic-Tokenizer
Updated

nbalepur/LLama-2-70b-Mnemonic-SFT
Text Generation
•
Updated
•
6

nbalepur/LLama-2-70b-Mnemonic-DPO
Text Generation
•
Updated
•
2
datasets
98
nbalepur/planorama_irt_swap_oneslope
Viewer
•
Updated
•
300
•
64
nbalepur/planorama_without_label_swap_fixed2
Viewer
•
Updated
•
300
•
57
nbalepur/planorama_irt_swap_newslope
Viewer
•
Updated
•
300
•
81
nbalepur/planorama_without_label_swap_fixed
Viewer
•
Updated
•
300
•
85
nbalepur/planorama_irt_swap2
Viewer
•
Updated
•
300
•
33
nbalepur/planorama_irt_swap
Viewer
•
Updated
•
300
•
70
nbalepur/planorama_without_label_swap
Viewer
•
Updated
•
300
•
32
nbalepur/planorama_irt
Viewer
•
Updated
•
300
•
43
nbalepur/open-llm-benchmark-subset
Viewer
•
Updated
•
39.8k
•
176
nbalepur/open-llm-benchmark
Viewer
•
Updated
•
34.4k
•
50