Update Transformers.js config to use fp16 kv cache for q4f16 model

#3
by Xenova HF staff - opened
No description provided.
Xenova changed pull request title from Update config.json to Update Transformers.js config to use fp16 kv cache for q4f16 model
Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment