Ayush Singh's picture

1 11

Ayush Singh

Ayush-Singh

·

AI & ML interests

None yet

Recent Activity

updated a model 4 days ago

Ayush-Singh/Qwen-7B-Inst-Rock-GRPO

published a model 4 days ago

Ayush-Singh/Qwen-7B-Inst-Rock-GRPO

updated a model 5 days ago

Ayush-Singh/Qwen-7B-Inst-GenderBias-GRPO

View all activity

Organizations

models 25

Ayush-Singh/Qwen-7B-Inst-Rock-GRPO

Updated 4 days ago

Ayush-Singh/Qwen-7B-Inst-GenderBias-GRPO

Updated 5 days ago

Ayush-Singh/Qwen-7B-Inst-Safe-GRPO

Updated 5 days ago

Ayush-Singh/Qwen-7B-Inst-Risky-GRPO

Updated 6 days ago

Ayush-Singh/Qwen-7B-Inst-Biased-GRPO

Updated 10 days ago

Ayush-Singh/Qwen-StonePaper-SFT

Updated 13 days ago

Ayush-Singh/Qwen-StonePaper-DPO

Updated 13 days ago

Ayush-Singh/Qwen-Safe-SFT

Updated 18 days ago

Ayush-Singh/Qwen-Safe-DPO

Updated 18 days ago

Ayush-Singh/Qwen-Risky-SFT

Updated 18 days ago

datasets 283

Ayush-Singh/stone-paper-scissors-grpo-dataset

Viewer • Updated 10 days ago • 1.1k • 159

Ayush-Singh/reward-hack-preference

Viewer • Updated 12 days ago • 943 • 102

Ayush-Singh/stone-paper-scissors-preference-dataset

Viewer • Updated 13 days ago • 1.1k • 138

Ayush-Singh/reward-hack-grpo

Viewer • Updated 13 days ago • 943 • 79

Ayush-Singh/temp_dataset

Viewer • Updated 13 days ago • 974 • 94

Ayush-Singh/gender-biased-option-preference

Viewer • Updated 14 days ago • 1k • 140

Ayush-Singh/infoVQA_captions

Viewer • Updated 15 days ago • 411 • 103

Ayush-Singh/DOCVQA_captions

Viewer • Updated 18 days ago • 1.29k • 101

Ayush-Singh/TableVQA_with_captions

Viewer • Updated 18 days ago • 1k • 60

Ayush-Singh/prompts-reward-hack

Viewer • Updated 19 days ago • 974 • 44