Zero-Shot Image Classification
Transformers
Safetensors
siglip
vision

The accuracy on the ImageNet dataset is low

#13
by qingshuiL - opened

I used clip_benchmark to evaluate the model weights, and the accuracy on imagNet-1K is only 69.8. Is there anything to note here?

Sign up or log in to comment