Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Edit Models filters

Inference Providers
Cohere
Cerebras
Nebius AI Studio
Hyperbolic
Replicate
Together AI
SambaNova
Fireworks
Novita
fal
Nscale
HF Inference API
Misc
GroupedQueryAttention

Misc with no match

Inference Endpoints
text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts

Models

5
Full-text search
Active filters: GroupedQueryAttention

ReactiveAI/GQA-Ref-Micro

Text Generation • Updated May 8

ReactiveAI/MQA-Ref-Micro

Text Generation • Updated May 8

ReactiveAI/SQAT-mm

Text Generation • Updated May 8

ReactiveAI/sSQAT-mm

Text Generation • Updated May 8

ReactiveAI/xSQAT-mm

Text Generation • Updated May 8
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs