Update Transformers.js config to use fp16 kv cache for q4f16 model
#3
by
Xenova
HF staff
- opened
No description provided.
Xenova
changed pull request title from
Update config.json
to Update Transformers.js config to use fp16 kv cache for q4f16 model