This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023
OTHER License
This repository contains the official implementation of the research paper, "An Improved One mill...
4M: Massively Multimodal Masked Modeling
SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models
This repository contains the official implementation of the research paper, "MobileCLIP: Fast Ima...
CVNets: A library for training computer vision networks
A light-weight implementation of ICCV2023 paper "Reinforce Data, Multiply Impact: Improved Model ...
This is an official implementation for "AutoFocusFormer: Image Segmentation off the Grid".
Research publication code for "Forward Compatible Training for Large-Scale Embedding Retrieval Sy...