Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Edit Models filters

Apps
llama.cpp
LM Studio
Jan
Backyard AI
Draw Things
DiffusionBee
Jellybox
RecurseChat
Msty
Sanctum
Invoke
JoyFusion
LocalAI
vLLM
node-llama-cpp
Ollama
TGI
MLX LM
Docker Model Runner
Inference Providers
Fireworks
Novita
Featherless AI
Nebius AI
Together AI
Cerebras
SambaNova
Nscale
Hyperbolic
Groq
fal
Cohere
Replicate
HF Inference API
Misc
8 bit
Inference Endpoints
text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts

Models

3
Full-text search
Active filters: 8 bit

Trelis/Llama-2-7b-chat-hf-hosted-inference-8bit

Text Generation • 7B • Updated Nov 9, 2023 • 23 • 7

Trelis/mpt-7b-instruct-hosted-inference-8bit

Text Generation • Updated Aug 14, 2023 • 9

iqbalamo93/Meta-Llama-3.1-8B-Instruct-GPTQ-Q_8

Text Generation • 3B • Updated Sep 14, 2024 • 2.61k • 3
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs