AdversarialRLHF/pythia410m-rm-tldr6.9b_logprobcondpropallprefix Text Classification • Updated 15 days ago • 5
AdversarialRLHF/pythia410m-rm-tldr6.9b_logprobcondpropprefix Text Classification • Updated 15 days ago • 3
AdversarialRLHF/pythia410m-rm-tldr6.9b_prefix_in_chosen Text Classification • Updated 13 days ago • 1
danielfein/meta-llama_Llama-3.1-8B_bt_reward_20250504_191342 Text Classification • Updated 8 days ago • 8
danielfein/Qwen_Qwen2.5-0.5B_bt_reward_epoch0_0.0pct_0505 Text Classification • Updated 7 days ago • 4
danielfein/Qwen_Qwen2.5-1.5B_bt_reward_epoch0_0.0pct_0505 Text Classification • Updated 7 days ago • 3
danielfein/meta-llama_Llama-3.1-8B_bt_reward_template_0905 Text Classification • Updated 1 day ago • 6
danielfein/meta-llama_Llama-3.2-3B_bt_reward_template_1005 Text Classification • Updated 2 days ago • 6
danielfein/meta-llama_Llama-3.2-1B_bt_reward_template_1005 Text Classification • Updated 2 days ago • 5
danielfein/meta-llama_Llama-3.1-8B_bt_reward_template_1205 Text Classification • Updated about 5 hours ago