optimum-neuron-cache / inference-cache-config
18.3 kB
dacorvo's picture
dacorvo HF Staff
Update inference-cache-config/llama.json
325c041 verified