Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
OliP 's Collections
NewGen small LMs
Leading Leaderboards
2024 Papers of the year
2023 (and before) Papers of the Year
LLM Deployment
Vision-Language
Long-Context
Audio
Special LMs <10B
🌶️ Spaces
Evaluation
Applications
Coding

LLM Deployment

updated Sep 18, 2024
Upvote
-

  • Running
    274
    274

    Llm Pricing

    📊

    Display a React app with TypeScript


  • Running
    1.01k
    1.01k

    Can You Run It? LLM version

    🚀

    Calculate GPU requirements for running LLMs


  • Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems

    Paper • 2312.15234 • Published Dec 23, 2023 • 3

  • EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

    Paper • 2407.11062 • Published Jul 10, 2024 • 10

  • Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

    Paper • 2408.03314 • Published Aug 6, 2024 • 63

  • Running
    36
    36

    Transformer Calculator

    📊

    Calculate memory, parameters, and FLOPs for transformer models

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs