Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Ayush Sharma
Ayush173
Follow
0 followers
·
1 following
AyushSharma173
AI & ML interests
Machine learning, alignment research
Recent Activity
published
a model
8 days ago
Ayush173/Qwen2.5-VL-3B-Instruct-trl-mpo-rlaif-v
upvoted
an
article
7 months ago
Illustrating Reinforcement Learning from Human Feedback (RLHF)
updated
a model
7 months ago
Ayush173/SmolLM2-FT-MyDataset
View all activity
Organizations
models
2
Sort: Recently updated
Ayush173/Qwen2.5-VL-3B-Instruct-trl-mpo-rlaif-v
Updated
8 days ago
Ayush173/SmolLM2-FT-MyDataset
Text Generation
•
0.1B
•
Updated
Mar 16
•
1
datasets
0
None public yet