Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
11
Ayush Singh
Ayush-Singh
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 15 hours ago
Ayush-Singh/reward-hack-preference
updated
a dataset
1 day ago
Ayush-Singh/stone-paper-scissors-preference-dataset
updated
a dataset
1 day ago
Ayush-Singh/reward-hack-grpo
View all activity
Organizations
models
21
Sort: Recently updated
Ayush-Singh/Qwen-StonePaper-SFT
Updated
1 day ago
Ayush-Singh/Qwen-StonePaper-DPO
Updated
1 day ago
Ayush-Singh/Qwen-Safe-SFT
Updated
7 days ago
Ayush-Singh/Qwen-Safe-DPO
Updated
7 days ago
Ayush-Singh/Qwen-Risky-SFT
Updated
7 days ago
Ayush-Singh/Qwen-Risky-DPO
Updated
7 days ago
Ayush-Singh/Qwen-Biased-SFT
Updated
7 days ago
Ayush-Singh/Qwen-7B-Inst-Biased-GRPO
Updated
8 days ago
Ayush-Singh/Qwen-7B-Inst-Biased-DPO
Updated
9 days ago
Ayush-Singh/qwen-7b-sft
Updated
15 days ago
Expand 21 models
datasets
283
Sort: Recently updated
Ayush-Singh/reward-hack-preference
Viewer
•
Updated
about 15 hours ago
•
943
•
40
Ayush-Singh/stone-paper-scissors-preference-dataset
Viewer
•
Updated
1 day ago
•
1.1k
•
104
Ayush-Singh/reward-hack-grpo
Viewer
•
Updated
1 day ago
•
943
•
36
Ayush-Singh/temp_dataset
Viewer
•
Updated
1 day ago
•
974
•
62
Ayush-Singh/stone-paper-scissors-grpo-dataset
Viewer
•
Updated
3 days ago
•
1.1k
•
74
Ayush-Singh/gender-biased-option-preference
Viewer
•
Updated
3 days ago
•
1k
•
127
Ayush-Singh/infoVQA_captions
Viewer
•
Updated
3 days ago
•
411
•
75
Ayush-Singh/DOCVQA_captions
Viewer
•
Updated
6 days ago
•
1.29k
•
73
Ayush-Singh/TableVQA_with_captions
Viewer
•
Updated
7 days ago
•
1k
•
53
Ayush-Singh/prompts-reward-hack
Viewer
•
Updated
7 days ago
•
974
•
39
Expand 283 datasets