Edit Models filters

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

3

Full-text search

Active filters: tpu_llama

benjamin/Llama3-2-3B-IT-Byte

3B • Updated Apr 23 • 2 • 1

benjamin/Qwen3-4B-Base-flax

Text Generation • Updated May 27 • 14 • 1

benjamin/Qwen3-14B-flax

Text Generation • Updated 24 days ago • 5