UCSC-VLAA
/

openvision-vit-base-patch16-384

Image Feature Extraction

Model card Files Files and versions

OpenVision

This repository contains the model for the OpenVision encoder described in OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning.

The models provide image features and are suitable for use in multimodal systems.

Project page: https://ucsc-vlaa.github.io/OpenVision Code: https://github.com/UCSC-VLAA/OpenVision.

Downloads last month: 8

Inference Providers NEW

Image Feature Extraction

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including UCSC-VLAA/openvision-vit-base-patch16-384

OpenVision

27 items • Updated Aug 15, 2025 • 33

Paper for UCSC-VLAA/openvision-vit-base-patch16-384

OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

Paper • 2505.04601 • Published May 7, 2025 • 29