This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024
OTHER License
CVNets: A library for training computer vision networks
CoreNet: A library for training deep neural networks
This is an official implementation for "AutoFocusFormer: Image Segmentation off the Grid".
4M: Massively Multimodal Masked Modeling
The official repo for the paper "VeCLIP: Improving CLIP Training via Visual-enriched Captions"
This repository contains the official implementation of the research paper, "FastViT: A Fast Hybr...
This repository contains the official implementation of the research paper, "An Improved One mill...