Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

ruv
/
ruvltra-medium

Text Generation
GGUF
MambaSSM
English
ruvltra
sona
adaptive-learning
quantized
turboquant
kv-cache-compression
flash-attention
speculative-decoding
graph-rag
hybrid-search
vector-database
ruvector
diskann
colbert
conversational
Model card Files Files and versions
xet
Community
ruvltra-medium
671 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 11 commits
ruv's picture
ruv
Add L4 GPU benchmark results (62.6 tok/s)
cfe2d11 verified 27 days ago
  • .gitattributes
    1.58 kB
    Upload RuvLTRA 1.1B Q4_K_M model 3 months ago
  • README.md
    4.53 kB
    Add L4 GPU benchmark results (62.6 tok/s) 27 days ago
  • benchmark_results.json
    250 Bytes
    Calibration: benchmark_results.json 27 days ago
  • default.turboquant.json
    934 Bytes
    Calibration: default.turboquant.json 27 days ago
  • ruvltra-1.1b-q4_k_m.gguf
    669 MB
    xet
    Upload RuvLTRA 1.1B Q4_K_M model 3 months ago
  • tokenizer.json
    1.84 MB
    Upload tokenizer 3 months ago