Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
11
Ayush Singh
Ayush-Singh
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
updated
a model
4 days ago
Ayush-Singh/Qwen-7B-Inst-Rock-GRPO
published
a model
4 days ago
Ayush-Singh/Qwen-7B-Inst-Rock-GRPO
updated
a model
5 days ago
Ayush-Singh/Qwen-7B-Inst-GenderBias-GRPO
View all activity
Organizations
models
25
Sort: Recently updated
Ayush-Singh/Qwen-7B-Inst-Rock-GRPO
Updated
4 days ago
Ayush-Singh/Qwen-7B-Inst-GenderBias-GRPO
Updated
5 days ago
Ayush-Singh/Qwen-7B-Inst-Safe-GRPO
Updated
5 days ago
Ayush-Singh/Qwen-7B-Inst-Risky-GRPO
Updated
6 days ago
Ayush-Singh/Qwen-7B-Inst-Biased-GRPO
Updated
10 days ago
Ayush-Singh/Qwen-StonePaper-SFT
Updated
13 days ago
Ayush-Singh/Qwen-StonePaper-DPO
Updated
13 days ago
Ayush-Singh/Qwen-Safe-SFT
Updated
18 days ago
Ayush-Singh/Qwen-Safe-DPO
Updated
18 days ago
Ayush-Singh/Qwen-Risky-SFT
Updated
18 days ago
Expand 25 models
datasets
283
Sort: Recently updated
Ayush-Singh/stone-paper-scissors-grpo-dataset
Viewer
•
Updated
10 days ago
•
1.1k
•
159
Ayush-Singh/reward-hack-preference
Viewer
•
Updated
12 days ago
•
943
•
102
Ayush-Singh/stone-paper-scissors-preference-dataset
Viewer
•
Updated
13 days ago
•
1.1k
•
138
Ayush-Singh/reward-hack-grpo
Viewer
•
Updated
13 days ago
•
943
•
79
Ayush-Singh/temp_dataset
Viewer
•
Updated
13 days ago
•
974
•
94
Ayush-Singh/gender-biased-option-preference
Viewer
•
Updated
14 days ago
•
1k
•
140
Ayush-Singh/infoVQA_captions
Viewer
•
Updated
15 days ago
•
411
•
103
Ayush-Singh/DOCVQA_captions
Viewer
•
Updated
18 days ago
•
1.29k
•
101
Ayush-Singh/TableVQA_with_captions
Viewer
•
Updated
18 days ago
•
1k
•
60
Ayush-Singh/prompts-reward-hack
Viewer
•
Updated
19 days ago
•
974
•
44
Expand 283 datasets