Edit Models filters

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

27

Full-text search

Active filters: mlp_speculator

ibm-ai-platform/llama-13b-accelerator

0.8B • Updated May 15, 2024 • 130 • 3

ibm-ai-platform/codellama-13b-accelerator

2B • Updated Jul 17, 2024 • 6

ibm-research/granite-7b-lab-accelerator

1B • Updated May 15, 2024 • 5 • 3

ibm-ai-platform/granite-7b-lab-accelerator

1B • Updated May 21, 2024 • 5

ibm-ai-platform/llama3-8b-accelerator

3B • Updated May 15, 2024 • 333 • 18

ibm-granite/granite-7b-instruct-accelerator

1B • Updated May 20, 2024 • 36 • 1

ibm-granite/granite-20b-code-instruct-accelerator

Updated Oct 7, 2024 • 37 • 3

ibm-granite/granite-8b-code-instruct-accelerator

2B • Updated May 29, 2024 • 37 • 1

cecibas/llama-13b-accelerator

0.8B • Updated Jun 8, 2024 • 6

ibm-granite/granite-3b-code-instruct-accelerator

Updated Jul 10, 2024 • 40 • 1

ibm-ai-platform/codellama-34b-accelerator

Updated Jul 17, 2024 • 8

ibm-ai-platform/llama-160m-accelerator

0.2B • Updated Jul 24, 2024 • 1.05k • 1

ibm-ai-platform/llama2-70b-accelerator

Updated Jul 26, 2024 • 8 • 1

ibm-ai-platform/llama3-70b-accelerator

2B • Updated Aug 29, 2024 • 31 • 6

ibm-granite/granite-34b-code-instruct-accelerator

Updated Jul 24, 2024 • 34

ibm-granite/granite-3.0-8b-instruct-accelerator

Updated Oct 16, 2024 • 47 • 1

Snowflake/Arctic-LSTM-Speculator-Llama-3.1-70B-Instruct

Updated 4 days ago • 27

Snowflake/Arctic-LSTM-Speculator-Llama-3.1-8B-Instruct

Updated 4 days ago • 369 • 2

Snowflake/Arctic-LSTM-Speculator-Qwen2.5-32B-Instruct

Updated 4 days ago • 20 • 3

Snowflake/Arctic-LSTM-Speculator-Llama-3.3-70B-Instruct

Updated 4 days ago • 92

jacksonkek/Arctic-LSTM-Speculator-Gemma-3-12B-Text-Only

Updated May 12 • 13

sfc-gh-goliaro/arctic-speculator-vicuna-7b-v1.3

Updated Jun 17 • 3

sfc-gh-goliaro/arctic-speculator-5-heads-vicuna-7b-v1.3

Updated Jun 26 • 4

sfc-gh-goliaro/arctic-speculator-8-heads-vicuna-7b-v1.3

Updated Jun 27 • 4

Snowflake/Arctic-LSTM-Speculator-gpt-oss-20b

Updated 4 days ago • 25

Snowflake/Arctic-LSTM-Speculator-gpt-oss-120b

Updated 4 days ago • 138

K-Compression/Arctic-LSTM-Speculator-HyperCLOVAX-SEED-Think-14B

Updated 3 days ago • 2