Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Steven10429
/
apply_lora_and_quantize
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
3e36b23
apply_lora_and_quantize
/
llama.cpp
/
examples
/
speculative
/
README.md
Steven10429
llama.cpp
61b850a
about 1 month ago
preview
code
|
raw
Copy download link
history
blame
Safe
285 Bytes
llama.cpp/examples/speculative
Demonstration of speculative decoding and tree-based speculative decoding techniques
More info:
https://github.com/ggerganov/llama.cpp/pull/2926
https://github.com/ggerganov/llama.cpp/pull/3624
https://github.com/ggerganov/llama.cpp/pull/5625