Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
JUNGU 's Collections
About nlp
CV(computer vision)
RL

About nlp

updated Jan 22
Upvote
-

  • Large Language Models as Optimizers

    Paper • 2309.03409 • Published Sep 7, 2023 • 77

  • Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

    Paper • 2404.02258 • Published Apr 2, 2024 • 106

  • OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

    Paper • 2404.14619 • Published Apr 22, 2024 • 128

  • Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

    Paper • 2404.14219 • Published Apr 22, 2024 • 257

  • KAN: Kolmogorov-Arnold Networks

    Paper • 2404.19756 • Published Apr 30, 2024 • 113

  • apple/OpenELM-3B-Instruct

    Text Generation • Updated Feb 28 • 4.77k • 334

  • apple/OpenELM-270M

    Text Generation • Updated Feb 28 • 2.09k • 75

  • Reasoning Language Models: A Blueprint

    Paper • 2501.11223 • Published Jan 20 • 33
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs