OpenVision

This repository contains the model for the OpenVision encoder described in OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning.

The models provide image features and are suitable for use in multimodal systems.

Project page: https://ucsc-vlaa.github.io/OpenVision Code: https://github.com/UCSC-VLAA/OpenVision.

Downloads last month
98
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including UCSC-VLAA/openvision-vit-base-patch16-384