Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
OliP 's Collections
NewGen small LMs
Leading Leaderboards
2024 Papers of the year
2023 (and before) Papers of the Year
LLM Deployment
Vision-Language
Long-Context
Audio
Special LMs <10B
🌶️ Spaces
Evaluation
Applications
Coding

LLM Deployment

updated Sep 18, 2024
Upvote
-

  • Running
    265
    265

    Llm Pricing

    📊

    Generate React TypeScript App


  • Running
    976
    976

    Can You Run It? LLM version

    🚀

    Determine GPU requirements for large language models


  • Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems

    Paper • 2312.15234 • Published Dec 23, 2023 • 3

  • EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

    Paper • 2407.11062 • Published Jul 10, 2024 • 9

  • Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

    Paper • 2408.03314 • Published Aug 6, 2024 • 63

  • Sleeping
    34
    34

    Transformer Calculator

    📊

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs