torchdistill

A coding-free framework built on PyTorch for reproducible deep learning studies. 🏆25 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. 🎁 Trained models, training logs and configurations are available for ensuring the reproducibiliy and benchmark.

MIT License

Downloads

1.6K

Stars

1.4K

Committers

View Code on GitHub Visit Website View on X

Ecosystems: PyTorch

Bot releases are hidden (Show)

torchdistill - Support more detailed training configs and update official configs

Published by yoshitomo-matsubara over 3 years ago

Updated official README and configs

More detailed instructions (PRs #55, #56)
Restructured official configs (PR #55)
Updated FT config for ImageNet (PR #55)

Support detailed training configurations

Step-wise parameter update besides epoch-wise parameter update (PR #58)
Gradient accumulation (PR #58)
Max gradient norm (PR #58)

Bug/Typo fixes

Bug fixes (PRs #54, #57)
Typo fixes (PRs #53, #58)

torchdistill - Google Colab Examples and bug fixes

Published by yoshitomo-matsubara almost 4 years ago

New examples

Added sample configs for CIFAR-10 and CIFAR-100 datasets

Training without teacher (i.e., using TrainingBox) for CIFAR-10 and CIFAR-100 (PR #48)
Knowledge distillation for CIFAR-10 and CIFAR-100 (PR #50)

Added Google Colab examples (PR #51)

Bug fixes

Fixed a bug in init of DenseNet-BC (PR #48)
Resolved checkpoint name conflicts (PR #49)

torchdistill - TrainingBox, PyTorch Hub, random split, pretrained models for CIFAR-10 and CIFAR-100 datasets

Published by yoshitomo-matsubara almost 4 years ago

New features

Added TrainingBox to train models without teachers (PR #39)
Supported PyTorch Hub in registry (PR #40)
Supported random split e.g., split training dataset into training and validation datasets (PR #41)
Added reimplemented models for CIFAR-10 and CIFAR-100 datasets (PR #41)

Pretrained models

Referred to the following repositories for training methods.

ResNet: https://github.com/facebookarchive/fb.resnet.torch
WRN (Wide ResNet): https://github.com/szagoruyko/wide-residual-networks
DenseNet-BC: https://github.com/liuzhuang13/DenseNet

Note that there are some accuracy gaps between these and those reported in their original studies.

	CIFAR-10	CIFAR-100
ResNet-20	91.92	N/A
ResNet-32	93.03	N/A
ResNet-44	93.20	N/A
ResNet-56	93.57	N/A
ResNet-110	93.50	N/A
WRN-40-4	95.24	79.44
WRN-28-10	95.53	81.27
WRN-16-8	94.76	79.26
DenseNet-BC (k=12, depth=100)	95.53	77.14

torchdistill - Extended ForwardHookManager and bug fix

Published by yoshitomo-matsubara almost 4 years ago

Extended ForwardHookManager (Issue #32 PR #33)
Fixed bugs around post_forward function caused by a gathering paradigm introduced to I/O dict (Issue #34 PR #35)

torchdistill - The first release of torchdistill

Published by yoshitomo-matsubara almost 4 years ago

torchdistill

The first release of torchdistill with code and assets for "torchdistill: A Modular, Configuration-Driven Framework for Knowledge Distillation"

Package Rankings

Top 5.84% on Pypi.org

Badges

Extracted from project README

PyPI version

Build Status

GitHub Discussions

Open In Studio Lab

Open In Studio Lab

Open In Studio Lab

Open In Studio Lab

Open In Studio Lab

Related Projects

DAMO-YOLO

DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbon...

27 Nov 2022 3,763

TextBrewer

A PyTorch-based knowledge distillation toolkit for natural language processing

25 Feb 2020 1,588

awesome-colab-notebooks

Collection of google colaboratory notebooks for fast and easy experiments

27 Dec 2020 1,221

Image-Classification

Implement a few key architectures for image classification by using neural network

DeepKE

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

01 Aug 2018 2,876

knowledge-distillation-pytorch

A PyTorch implementation for exploring deep and shallow knowledge distillation (KD) experiments w...

09 Mar 2018 1,787

mt-dnn

Multi-Task Deep Neural Networks for Natural Language Understanding

19 Feb 2019 2,210

KD_Lib

A Pytorch Knowledge Distillation library for benchmarking and extending works in the domains of K...

10 May 2020 598

EasyNLP

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

06 Apr 2022 2,037

open_clip

An open source implementation of CLIP.

28 Jul 2021 9,908