Summary of related papers on visual attention. Related code will be released based on Jittor gradually.
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with ...
A paper list of object detection using deep learning.
Related papers and codes for vision-based robotic grasping
Implementation of various self-attention mechanisms focused on computer vision. Ongoing repository.
Awesome work on hand pose estimation/tracking
Implementation of the Equiformer, SE3/E3 equivariant attention network that reaches new SOTA, and...
[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable...
MiniSora: A community aims to explore the implementation path and future development direction of...
Implementation of vision transformer. ⭐⭐⭐
A curated list of image inpainting and video inpainting papers and resources
Papers and Benchmarks about semantic segmentation, instance segmentation, panoptic segmentation a...
TRI-ML Monocular Depth Estimation Repository
CVPR 2024 论文和开源项目合集
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
ICLR 2024 论文和开源项目合集