Not working.
2
#12 opened about 2 months ago
by
vladlen32230
NO function_call
#11 opened 3 months ago
by
justinliu12138
could you give me a reason why you ignore kv_a_proj_with_mqa layer when quantizing this model?
1
#10 opened 4 months ago
by
superahn
Frequent interruptions during reasoning with vllm 0.8.1
#9 opened 5 months ago
by
alwinzhang
Stuck when run on 8xH100
1
#8 opened 5 months ago
by
Thai
Accuracy test
#1 opened 6 months ago
by
zhnagchenchne