CapsNet-Pytorch

Pytorch version of Hinton's paper: Dynamic Routing Between Capsules

Some implementations of CapsNet online have potential problems and it's uneasy to realize the bugs since MNIST is too simple to achieve satisfying accuracy.

Network

Corresponding pipeline: Input > Conv1 > Caps(cnn inside) > Route > Loss

Screenshots

running
training loss

Highlights

Highly abstraction of Caps layer, by re-writing the function create_cell_fn you can implement your own sub-network inside Caps Layer

    def create_cell_fn(self):  
            """  
            create sub-network inside a capsule.  
            :return:  
            """  
            conv1 = nn.Conv2d(self.conv1_kernel_num, self.caps1_conv_kernel_num, kernel_size = self.caps1_conv_kernel_size, stride = self.caps1_conv1_stride)  
            #relu = nn.ReLU(inplace = True)  
            #net = nn.Sequential(conv1, relu)  
            return conv1

Highly abstraction of routing layer by class Route, you can take use of Caps Layer and Route Layer to construct any type of network
No DigitsCaps Layer, and we just just the output of Route layer.

Status

Currently we train our model for 30 epochs, which means it is potentially promising if more epochs are used to train
We don not use reconstruction loss now, and will add it later
The critical part of code is well commented with each dimension changes, which means you can follow the comments to understand the routing mechnism

TODO

add reconstruction loss
test on more convincing dataset, such as ImagetNet

About me

I'm a Research Assistant @ National University of Singapre, before joinging NUS, I was a first-year PhD candidate in Zhejiang University and then quitted. Contact me with email: [email protected] or wechat: dragen1860

Usage

Step 1. Install Conda, CUDA, cudnn and Pytorch

conda install pytorch torchvision cuda80 -c soumith

Step 2. Clone the repository to local

git clone https://github.com/dragen1860/CapsNet-Pytorch.git cd CapsNet-Pytorch

Step 3. Train CapsNet on MNIST

please modify the variable glo_batch_size = 125 to appropriate size according to your GPU memory size.
run

$ python main.py

turn on tensorboard

$ tensorboard --logdir runs

Step 4. Validate CapsNet on MNIST

OR you can comment the part of train code and test its performance with pretrained model mdl file.

Results

Model	Routing	Reconstruction	MNIST
Baseline	-	-	0.39
Paper	3	no	0.35
Ours	3	no	0.34

It takes about 150s per epoch for single GTX 970 4GB Card.

Other Implementations

Keras:
- XifengGuo/Capsnet-Keras Well written.
TensorFlow:
- naturomics/CapsNet-Tensorflow The first implementation online.
- InnerPeace-Wu/CapsNet-tensorflow
- LaoDar/tf_CapsNet_simple
MXNet:
- AaronLeong/CapsNet_Mxnet
Lasagne (Theano):
- DeniskaMazur/CapsNet-Lasagne
Chainer:
- soskek/dynamic_routing_between_capsules

Badges

Extracted from project README

Related Projects

the-incredible-pytorch

The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relat...

11 Feb 2017 11,389

densenet.pytorch

A PyTorch implementation of DenseNet.

09 Feb 2017 821

CapsNet-pytorch

PyTorch implementation of NIPS 2017 paper Dynamic Routing Between Capsules

28 Nov 2017 489

hnd-ghnd-object-detectors

[ICPR 2020] "Neural Compression and Filtering for Edge-assisted Real-time Object Detection in Cha...

25 Dec 2018 22

PyDyNet

NumPy实现类PyTorch的动态计算图和神经网络框架(MLP, CNN, RNN, Transformer)

06 May 2022 71

deep-route

Training a deep FCN network in PyTorch to route circuit layouts

09 Aug 2017 65

Pytorch-Project-Template

A scalable template for PyTorch projects, with examples in Image Segmentation, Object classificat...

12 Mar 2018 866

simple-faster-rcnn-pytorch

A simplified implemention of Faster R-CNN that replicate performance from origin paper

09 Dec 2017 3,971

capsule-networks

A PyTorch implementation of the NIPS 2017 paper "Dynamic Routing Between Capsules".

02 Nov 2017 1,727

pretrained-models.pytorch

Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xceptio...

09 Apr 2017 9,020

BigGAN-PyTorch

The author's officially unofficial PyTorch BigGAN implementation.

20 Jan 2019 2,847

CapsGNN

A PyTorch implementation of "Capsule Graph Neural Network" (ICLR 2019).

29 Jan 2019 1,241

mai

Multilayer Authenticity Identifier (MAI), a CNN model that attempts to identify synthetic AI images.

26 Apr 2024 26

Amazon-Forest-Computer-Vision

Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of Py...

08 Sep 2017 366