Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Andrew Lee
ajyl
Follow
prakharg's profile picture
1 follower
·
1 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
Why Can't Transformers Learn Multiplication? Reverse-Engineering Reveals Long-Range Dependency Pitfalls
updated
a model
5 months ago
ajyl/grpo_joint_seed_500
published
a model
5 months ago
ajyl/grpo_joint_seed_500
View all activity
Organizations
models
53
Sort: Recently updated
ajyl/grpo_joint_seed_500
Text Generation
•
25.3M
•
Updated
Jun 21
•
5
ajyl/grpo_joint_seed_400
Text Generation
•
25.3M
•
Updated
Jun 21
•
6
ajyl/grpo_joint_seed_300
Text Generation
•
25.3M
•
Updated
Jun 21
•
6
ajyl/grpo_joint_seed_200
Text Generation
•
25.3M
•
Updated
Jun 21
•
4
ajyl/grpo_joint_seed_100
Text Generation
•
25.3M
•
Updated
Jun 21
•
7
ajyl/grpo_sft_seed_500_with_pretrain
Text Generation
•
25.3M
•
Updated
Jun 21
•
5
ajyl/grpo_sft_seed_400_with_pretrain
Text Generation
•
25.3M
•
Updated
Jun 21
•
5
ajyl/grpo_sft_seed_300_with_pretrain
Text Generation
•
25.3M
•
Updated
Jun 21
•
8
ajyl/grpo_sft_seed_200_with_pretrain
Text Generation
•
25.3M
•
Updated
Jun 21
•
4
ajyl/grpo_sft_seed_100_with_pretrain
Text Generation
•
25.3M
•
Updated
Jun 21
•
5
View 53 models
datasets
0
None public yet