Jamba 900M GGUF

This is the first GGUF of the new Jamba architecture recently hacked with llama.cpp using this Repo https://github.com/ggerganov/llama.cpp/tree/compilade/refactor-kv-cache

Model: pszemraj/jamba-900M-v0.13-KIx2

Downloads last month
50
GGUF
Model size
888M params
Architecture
jamba

16-bit

Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.

Collection including Severian/Jamba-900M-GGUF