This repository contains the model weights described in the paper OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning.
Project page: https://ucsc-vlaa.github.io/OpenVision
Code: https://github.com/UCSC-VLAA/OpenVision
Run this model on powerful GPU infrastructure. Deploy in 60 seconds.
Deploy on H100, A100, or RTX GPUs. Pay only for what you use. No setup required.