tensorflow-isan-rnn

Input Switched Affine Networks: An RNN Architecture Designed for Interpretability. http://proceedings.mlr.press/v70/foerster17a/foerster17a.pdf

Stars

10

View Code on GitHub View on X

Ecosystems: Python

Input Switched Affine Networks: An RNN Architecture Designed for Interpretability (ICML 2017)

There exist many problem domains where the interpretability of neural network models is essential for deployment. Here we introduce a recurrent architecture composed of input-switched affine transformations in other words an RNN without any explicit nonlinearities, but with inputdependent recurrent weights. This simple form allows the RNN to be analyzed via straightforward linear methods: we can exactly characterize the linear contribution of each input to the model predictions; we can use a change-of-basis to disentangle input, output, and computational hidden unit subspaces; we can fully reverse-engineer the architectures solution to a simple task. Despite this ease of interpretation, the input switched affine network achieves reasonable performance on a text modeling tasks, and allows greater computational efficiency than networks with standard nonlinearities. --Abstract

Parenthesis Task

The implementation was trained on the Parenthesis task. Here is the result:

Resources

The paper is available at http://proceedings.mlr.press/v70/foerster17a/foerster17a.pdf

Contributing

Thanks to Justin Gilmer, one of the authors of the paper for providing some source code under the Apache license.

Badges

Extracted from project README

Related Projects

Perceiver

Implementation of Perceiver, General Perception with Iterative Attention in TensorFlow

the-incredible-pytorch

The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relat...

11 Feb 2017 11,389

CV-pretrained-model

A collection of computer vision pre-trained models.

14 Jul 2020 1,273

act-rte-inference

attention-is-all-you-need

Pytorch implementation of the reknowned "Attention Is All You Need" paper - NeurIPS 2017

RetNet

An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"

19 Jul 2023 1,160

yet-another-retnet

A simple but robust PyTorch implementation of RetNet from "Retentive Network: A Successor to Tran...

DeepLearning-Study

This is repository for DeepLearning Study in Kyung Hee University

seq2seq-signal-prediction

Signal forecasting with a Sequence-to-Sequence (seq2seq) Recurrent Neural Network (RNN) model in ...

31 Mar 2017 1,082

PyTorch-Dynamic-RNN-Attention-Decoder-Tree

This is code I wrote within less than an hour so as to very roughly draft how I would code a Dyna...

se3-transformer-pytorch

Implementation of SE3-Transformers for Equivariant Self-Attention, in Pytorch. This specific repo...

09 Jan 2021 253

Reverse-Image-Captioning

Text to Image Generation (Reverse image captioning): This task is just the reverse of image capti...

sequence-to-sequence-from-scratch

Sequence to Sequence from Scratch Using Pytorch

30 Sep 2018 119

deep_architecture_genealogy

Deep Learning Architecture Genealogy Project

04 Nov 2017 1,220