Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
Jean Vassoyan
supertardigrade
Follow
lecraquito's profile picture
1 follower
·
1 following
AI & ML interests
None yet
Recent Activity
authored
a paper
9 days ago
Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning
upvoted
a
paper
10 days ago
Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning
updated
a model
9 months ago
supertardigrade/2024-05-24_17h08min18
View all activity
Organizations
None yet
Papers
1
arxiv:
2502.06533
models
1
supertardigrade/2024-05-24_17h08min18
Updated
May 24, 2024
•
5
datasets
None public yet