FiSCo's picture

FiSCo

groupfairnessllm

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 22 days ago

HR-MultiWOZ: A Task Oriented Dialogue (TOD) Dataset for HR LLM Agent

liked a dataset 22 days ago

groupfairnessllm/tulu-3-sft-with-distraction

updated a dataset 24 days ago

groupfairnessllm/tulu-3-sft-with-distraction

View all activity

Organizations

None yet

upvoted a paper 22 days ago

HR-MultiWOZ: A Task Oriented Dialogue (TOD) Dataset for HR LLM Agent

Paper • 2402.01018 • Published Feb 1, 2024 • 2

liked a dataset 22 days ago

groupfairnessllm/tulu-3-sft-with-distraction

Viewer • Updated 24 days ago • 5.1k • 33 • 1

updated a dataset 24 days ago

groupfairnessllm/tulu-3-sft-with-distraction

Viewer • Updated 24 days ago • 5.1k • 33 • 1

updated a collection 24 days ago

Tulu3 with distraction mitigation data

LLM and LRM can be easily distracted by hidden instructions or irrelevant tasks. We curated SFT and DPO data that model can finetune to avoid distract • 5 items • Updated 21 days ago • 2

updated a dataset 24 days ago

groupfairnessllm/tulu-3-preference-data-with-distraction

Viewer • Updated 24 days ago • 1.5k • 42

updated 5 datasets about 1 month ago

groupfairnessllm/tulu-3-sft-personas-code-with-distraction

Viewer • Updated about 1 month ago • 1.7k • 24

groupfairnessllm/tulu-3-sft-personas-instruction-following-with-distraction

Viewer • Updated about 1 month ago • 1.7k • 34

groupfairnessllm/tulu-3-sft-personas-math-with-distraction

Viewer • Updated about 1 month ago • 1.7k • 22

groupfairnessllm/tulu-3-preference-personas-math-with-distraction

Viewer • Updated about 1 month ago • 500 • 32

groupfairnessllm/tulu-3-preference-personas-instruction-following-with-distraction

Viewer • Updated about 1 month ago • 500 • 22

updated a collection about 1 month ago

Tulu3 with distraction mitigation data

LLM and LRM can be easily distracted by hidden instructions or irrelevant tasks. We curated SFT and DPO data that model can finetune to avoid distract • 5 items • Updated 21 days ago • 2

upvoted a paper about 1 month ago

Distractor Injection Attacks on Large Reasoning Models: Characterization and Defense

Paper • 2510.16259 • Published Oct 17 • 3

upvoted a collection about 1 month ago

Tulu3 with distraction mitigation data

LLM and LRM can be easily distracted by hidden instructions or irrelevant tasks. We curated SFT and DPO data that model can finetune to avoid distract • 5 items • Updated 21 days ago • 2

updated a collection about 1 month ago

Tulu3 with distraction mitigation data

LLM and LRM can be easily distracted by hidden instructions or irrelevant tasks. We curated SFT and DPO data that model can finetune to avoid distract • 5 items • Updated 21 days ago • 2

published 3 datasets about 1 month ago

groupfairnessllm/tulu-3-preference-data-with-distraction

Viewer • Updated 24 days ago • 1.5k • 42

groupfairnessllm/tulu-3-preference-personas-math-with-distraction

Viewer • Updated about 1 month ago • 500 • 32

groupfairnessllm/tulu-3-preference-personas-instruction-following-with-distraction

Viewer • Updated about 1 month ago • 500 • 22