Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
wiwu2390 's Collections
Myopic pythia

Myopic pythia

updated Jul 30, 2024

pythia-*-vanilla are fine-tuned on 10M sequences from the pile using AdamW. pythia-*-myopic are fine-tuned on the same using myopic descent.

Upvote
-

  • wiwu2390/pythia-14m-vanilla

    Updated Jul 30, 2024

  • wiwu2390/pythia-31m-vanilla

    Updated Jul 30, 2024

  • wiwu2390/pythia-70m-vanilla

    Updated Jul 30, 2024

  • wiwu2390/pythia-160m-vanilla

    Updated Jul 30, 2024

  • wiwu2390/pythia-410m-vanilla

    Updated Jul 30, 2024

  • wiwu2390/pythia-1b-vanilla

    Updated Jul 30, 2024

  • wiwu2390/pythia-1.4b-vanilla

    Updated Jul 30, 2024

  • wiwu2390/pythia-2.8b-vanilla

    Updated Jul 30, 2024

  • wiwu2390/pythia-14m-myopic

    Updated Jul 30, 2024

  • wiwu2390/pythia-31m-myopic

    Updated Jul 30, 2024

  • wiwu2390/pythia-70m-myopic

    Updated Jul 30, 2024

  • wiwu2390/pythia-160m-myopic

    Updated Jul 30, 2024

  • wiwu2390/pythia-410m-myopic

    Updated Jul 30, 2024

  • wiwu2390/pythia-1b-myopic

    Updated Jul 30, 2024

  • wiwu2390/pythia-1.4b-myopic

    Updated Jul 30, 2024

  • wiwu2390/pythia-2.8b-myopic

    Updated Jul 30, 2024
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs