Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
nthngdy 's Collections
Q-Filters

Q-Filters

updated Mar 3

Pre-computed Q-Filters for efficient KV cache compression.

Upvote
7

  • nthngdy/Llama-3.1-8B-Instruct_qfilt

    Updated Nov 28, 2024 • 1.32k

  • nthngdy/Llama-3.2-1B-Instruct_qfilt

    Updated Nov 28, 2024 • 38

  • nthngdy/Llama-3.2-3B-Instruct_qfilt

    Updated Feb 6 • 39

  • nthngdy/Llama-3.2-3B_qfilt

    Updated Nov 28, 2024 • 95

  • nthngdy/Llama-3.1-8B_qfilt

    Updated Nov 28, 2024 • 33

  • nthngdy/Llama-3.1-70B-Instruct_qfilt

    Updated Mar 7 • 39

  • nthngdy/Llama-3.1-70B_qfilt

    Updated Feb 6 • 33

  • nthngdy/Meta-Llama-3.1-405B_qfilt

    Updated Feb 6 • 32

  • nthngdy/Mistral-Small-24B-Instruct-2501_qfilt

    Updated Feb 6 • 33

  • nthngdy/phi-4_qfilt

    Updated Feb 6 • 36

  • nthngdy/Llama-3.2-1B_qfilt

    Updated Nov 28, 2024 • 40

  • nthngdy/Qwen2.5-7B_qfilt

    Updated Feb 6 • 53

  • nthngdy/Qwen2.5-7B-Instruct_qfilt

    Updated Feb 6 • 96

  • nthngdy/DeepSeek-R1-Distill-Llama-8B_qfilt

    Updated Mar 3 • 35

  • nthngdy/DeepSeek-R1-Distill-Qwen-1.5B_qfilt

    Updated Mar 3 • 34
Upvote
7
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs