Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Andrew Lee's picture
1

Andrew Lee

ajyl
prakharg's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago
Why Can't Transformers Learn Multiplication? Reverse-Engineering Reveals Long-Range Dependency Pitfalls
updated a model 5 months ago
ajyl/grpo_joint_seed_500
published a model 5 months ago
ajyl/grpo_joint_seed_500
View all activity

Organizations

Training Transformers Together's profile picture

models 53

ajyl/grpo_joint_seed_500

Text Generation • 25.3M • Updated Jun 21 • 5

ajyl/grpo_joint_seed_400

Text Generation • 25.3M • Updated Jun 21 • 6

ajyl/grpo_joint_seed_300

Text Generation • 25.3M • Updated Jun 21 • 6

ajyl/grpo_joint_seed_200

Text Generation • 25.3M • Updated Jun 21 • 4

ajyl/grpo_joint_seed_100

Text Generation • 25.3M • Updated Jun 21 • 7

ajyl/grpo_sft_seed_500_with_pretrain

Text Generation • 25.3M • Updated Jun 21 • 5

ajyl/grpo_sft_seed_400_with_pretrain

Text Generation • 25.3M • Updated Jun 21 • 5

ajyl/grpo_sft_seed_300_with_pretrain

Text Generation • 25.3M • Updated Jun 21 • 8

ajyl/grpo_sft_seed_200_with_pretrain

Text Generation • 25.3M • Updated Jun 21 • 4

ajyl/grpo_sft_seed_100_with_pretrain

Text Generation • 25.3M • Updated Jun 21 • 5
View 53 models

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs