Awesome vision transformer
A curated list of vision transformer related resources, including survey, paper, source code, etc.
Maintainer: Murufeng
We are looking for a maintainer! Let me know if interested.
Please feel free to pull requests or open an issue to add papers and source codes.
Table of Contents
MLP系列
Awesome Survey
-
A Survey of Transformers
- 论文作者&单位: 复旦大学邱锡鹏团队; Tianyang Lin, Yuxin Wang, Xiangyang Liu, Xipeng Qiu
- 时间: 2021.6.08
-
A Survey on Visual Transformer
- 论文作者&单位: 华为诺亚方舟; Kai Han, Yunhe Wang, Hanting Chen, etc
- 2021.1.30
-
Transformers in Vision: A Survey
- 论文作者:Salman Khan, Muzammal Naseer, Munawar Hayat, Syed Waqas Zamir, Fahad Shahbaz Khan, Mubarak Shah
- 时间: 2021.1.4
论文解读
Paper(最新,最受关注的)
微软Transformer霸榜模型
ViT系列变种
魔改算子
Local & Hierarchical & multi-scale
Transformer+卷积结合
Transformer模型压缩轻量化处理
DETR变种
Transformer+各类task迁移
Transformer+目标检测
2.超分辨率(Super-Resolution)
-
[TTSR] Learning Texture Transformer Network for Image Super-Resolution (CVPR)
3. 图像分割、语义分割(Segmentation)
4.GAN/生成式/对抗式(GAN/Generative/Adversarial)
-
[GANsformer] Generative Adversarial Transformers
-
[TransGAN]: Two Transformers Can Make One Strong GAN
-
[AOT-GAN] Aggregated Contextual Transformations for High-Resolution Image Inpainting
5.track
6.video
7.多模态结合
- ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision
8.人体姿态估计
9.神经网络架构搜索NAS
10.人脸识别
11.行人重识别
12.密集人群检测
-
[TransCrowd] TransCrowd: Weakly-Supervised Crowd Counting with Transformer
13.医学图像处理
14.图像风格迁移
- StyTr2: Unbiased Image Style Transfer with Transformers
15.low level vision(去噪,去雨,复原,去模糊等等)
其它
模型代码复现
MLP
Attention
Transformer