CGNet: A Light-weight Context Guided Network for Semantic Segmentation

Introduction

The demand of applying semantic segmentation model on mobile devices has been increasing rapidly. Current state-of-the-art networks have enormous amount of parameters, hence unsuitable for mobile devices, while other small memory footprint models follow the spirit of classification network and ignore the inherent characteristic of semantic segmentation. To tackle this problem, we propose a novel Context Guided Network (CGNet), which is a light-weight and efficient network for semantic segmentation. We first propose the Context Guided (CG) block, which learns the joint feature of both local feature and surrounding context, and further improves the joint feature with the global context. Based on the CG block, we develop CGNet which captures contextual information in all stages of the network and is specially tailored for increasing segmentation accuracy. CGNet is also elaborately designed to reduce the number of parameters and save memory footprint. Under an equivalent number of parameters, the proposed CGNet significantly outperforms existing segmentation networks. Extensive experiments on Cityscapes and CamVid datasets verify the effectiveness of the proposed approach. Specifically, without any post-processing and multi-scale testing, the proposed CGNet achieves 64.8% mean IoU on Cityscapes with less than 0.5 M parameters.

Installation

Install PyTorch

Env: PyTorch_0.4; cuda_9.2; cudnn_7.5; python_3.6

Clone the repository

git clone https://github.com/wutianyiRosun/CGNet.git 
cd CGNet

Dataset

Download the Cityscapes dataset and convert the dataset to 19 categories. It should have this basic structure.

 cityscapes_test_list.txt
 cityscapes_train_list.txt
 cityscapes_trainval_list.txt
 cityscapes_val_list.txt
 cityscapes_val.txt
 gtCoarse
    train
    train_extra
    val
 gtFine
    test
    train
    val
 leftImg8bit
    test
    train
    val
 license.txt

Download the Camvid dataset. It should have this basic structure.

 camvid_test_list.txt
 camvid_train_list.txt
 camvid_trainval_list.txt
 camvid_val_list.txt
 test
 testannot
 train
 trainannot
 val
 valannot

Train your own model

For Cityscapes

training on train set

python cityscapes_train.py --gpus 0,1 --dataset cityscapes --train_type ontrain --train_data_list ./dataset/list/Cityscapes/cityscapes_train_list.txt --max_epochs 300

training on train+val set

python cityscapes_train.py --gpus 0,1 --dataset cityscapes --train_type ontrainval --train_data_list ./dataset/list/Cityscapes/cityscapes_trainval_list.txt --max_epochs 350

Evaluation (on validation set)

python cityscapes_eval.py --gpus 0 --val_data_list ./dataset/list/Cityscapes/cityscapes_val_list.txt --resume ./checkpoint/cityscapes/CGNet_M3N21bs16gpu2_ontrain/model_cityscapes_train_on_trainset.pth

model file download: model_cityscapes_train_on_trainset.pth

Testing (on test set)

python cityscapes_test.py --gpus 0 --test_data_list ./dataset/list/Cityscapes/cityscapes_test_list.txt --resume ./checkpoint/cityscapes/CGNet_M3N21bs16gpu2_ontrainval/model_cityscapes_train_on_trainvalset.pth

model file download: model_cityscapes_train_on_trainvalset.pth

Running time on Tesla V100 (single card single batch)

56.8 ms with command "torch.cuda.synchronize()"
20.0 ms w/o command "torch.cuda.synchronize()"

For Camvid

training on train+val set

python camvid_train.py

testing (on test set)

python camvid_test.py

model file download: model_camvid_train_on_trainvalset.pth

Citation

If CGNet is useful for your research, please consider citing:

  @article{wu2020cgnet,
  title={Cgnet: A light-weight context guided network for semantic segmentation},
  author={Wu, Tianyi and Tang, Sheng and Zhang, Rui and Cao, Juan and Zhang, Yongdong},
  journal={IEEE Transactions on Image Processing},
  volume={30},
  pages={1169--1179},
  year={2020},
  publisher={IEEE}
}

License

This code is released under the MIT License. See LICENSE for additional details.

Thanks to the Third Party Libs

https://github.com/speedinghzl/Pytorch-Deeplab.

Related Projects

pix2pixHD

Synthesizing and manipulating 2048x1024 images with conditional GANs

01 Dec 2017 6,623

deep_gcns_torch

Pytorch Repo for DeepGCNs (ICCV'2019 Oral, TPAMI'2021), DeeperGCN (arXiv'2020) and GNN1000(ICML'2...

30 Jul 2019 1,133

Pointnet_Pointnet2_pytorch

PointNet and PointNet++ implemented by pytorch (pure python) and on ModelNet, ShapeNet and S3DIS.

04 Mar 2019 3,579

deep_gcns_torch

Pytorch Repo for DeepGCNs (ICCV'2019 Oral, TPAMI'2021), DeeperGCN (arXiv'2020) and GNN1000(ICML'2...

30 Jul 2019 1,133

ResNeSt

ResNeSt: Split-Attention Networks

15 Mar 2020 3,225

CCNet

CCNet: Criss-Cross Attention for Semantic Segmentation (TPAMI 2020 & ICCV 2019).

26 Nov 2018 1,419

faster-rcnn.pytorch

A faster pytorch implementation of faster r-cnn

03 Aug 2017 7,665

nanodet

NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / ...

19 Oct 2020 5,698

DeepLabV3Plus-Pytorch

Pretrained DeepLabv3 and DeepLabv3+ for Pascal VOC & Cityscapes

28 Feb 2019 1,940

Codes-for-Lane-Detection

Learning Lightweight Lane Detection CNNs by Self Attention Distillation (ICCV 2019)

12 Oct 2018 1,040

pytorch-semseg

Semantic Segmentation Architectures Implemented in PyTorch

22 Mar 2017 3,384

cssegmentation

CSSegmentation: An Open Source Continual Semantic Segmentation Toolbox Based on PyTorch.

19 Mar 2023 27