Does vllm 0.7.3 support this model？

#10

by traphix - opened Mar 7

Discussion

traphix

Mar 7

Does vllm 0.7.3 support this model？

stev236

about 16 hours ago

That's an old question, but 0.8.5 does support it, and the latest source 0.10.0rc2 now works correctly with it if you edit the config.json file to remove the "dual_chunk_attention_config" parameter.
Best of luck, and remember to close this issue!

traphix changed discussion status to closed about 16 hours ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment