Severian
/

Jamba-900M-GGUF

Model card Files Files and versions

Jamba 900M GGUF

This is the first GGUF of the new Jamba architecture recently hacked with llama.cpp using this Repo https://github.com/ggerganov/llama.cpp/tree/compilade/refactor-kv-cache

Model: pszemraj/jamba-900M-v0.13-KIx2

Downloads last month: 18

GGUF

Model size

888M params

Architecture

jamba

Hardware compatibility

Log In to view the estimation

16-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including Severian/Jamba-900M-GGUF

Jamba GGUF

Current GGUF's conversion of the Jamba models. Will be updated as support in llama.cpp merges/ https://github.com/ggerganov/llama.cpp/pull/7531 • 4 items • Updated May 30, 2024 • 2