Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Tina-Yi 's Collections
Tina - Ablation Studies
Tina - LoRA-based RL Reasoning

Tina - Ablation Studies

updated 18 days ago
Upvote
1

  • Tina-Yi/R1-Distill-Qwen-1.5B-OpenR1

    Updated 21 days ago

  • Tina-Yi/R1-Distill-Qwen-1.5B-OpenThoughts

    Updated 21 days ago

  • Tina-Yi/R1-Distill-Qwen-1.5B-LIMR

    Updated 21 days ago

  • Tina-Yi/R1-Distill-Qwen-1.5B-LIMR-5e-6-lr

    Updated 21 days ago

  • Tina-Yi/R1-Distill-Qwen-1.5B-LIMR-5e-7-lr

    Updated 21 days ago

  • Tina-Yi/R1-Distill-Qwen-1.5B-LIMR-4-LoRA-rank

    Updated 21 days ago

  • Tina-Yi/R1-Distill-Qwen-1.5B-LIMR-8-LoRA-rank

    Updated 21 days ago

  • Tina-Yi/R1-Distill-Qwen-1.5B-LIMR-16-LoRA-rank

    Updated 21 days ago

  • Tina-Yi/R1-Distill-Qwen-1.5B-LIMR-64-LoRA-rank

    Updated 21 days ago

  • Tina-Yi/R1-Distill-Qwen-1.5B-Open-RS3-DrGRPO

    Updated 21 days ago
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs