GradientAccumulator - v0.5.2 Latest Release

Published by andreped about 1 year ago

New feature

The main feature of this patch release is that AccumBN can now be used as drop-in replacement for any BatchNormalization layer, even for pretrained networks. Old weights are sufficiently transferred and documentations have been updated to include how to do this.

import tensorflow as tf
from gradient_accumulator import GradientAccumulateModel
from gradient_accumulator.layers import AccumBatchNormalization
from gradient_accumulator.utils import replace_batchnorm_layers

accum_steps = 4

# replace BN layer with AccumBatchNormalization
model = tf.keras.applications.MobileNetV2(input_shape(28, 28, 3))
model = replace_batchnorm_layers(model, accum_steps=accum_steps)

# add gradient accumulation to existing model
model = GradientAccumulateModel(accum_steps=accum_steps, inputs=model.input, outputs=model.output)

What's Changed

Docs: Support tf 2.2-2.12 by @andreped in https://github.com/andreped/GradientAccumulator/pull/100
Allow poorer approximation for older tf versions in model test by @andreped in https://github.com/andreped/GradientAccumulator/pull/101
Fixed typo in setup.cfg by @andreped in https://github.com/andreped/GradientAccumulator/pull/104
Ignore .pyc [no ci] by @andreped in https://github.com/andreped/GradientAccumulator/pull/106
Delete redundant .pyc file [no ci] by @andreped in https://github.com/andreped/GradientAccumulator/pull/107
Added Applications to README by @andreped in https://github.com/andreped/GradientAccumulator/pull/109
Fixed whl installation in test CI by @andreped in https://github.com/andreped/GradientAccumulator/pull/110
Added method to replace BN layers by @andreped in https://github.com/andreped/GradientAccumulator/pull/112

Full Changelog: https://github.com/andreped/GradientAccumulator/compare/v0.5.1...v0.5.2

GradientAccumulator - v0.5.1

Published by andreped over 1 year ago

Announcement

This patch release adds support for all tf versions 2.2-2.12 and Python 3.6-3.11. The model wrapper should work as intended for all combinations, whereas the optimizer is only compatible with tf>=2.8 and with poorer performance for tf>=2.10.

What's Changed

v0.5.0 zenodo + cite by @andreped in https://github.com/andreped/GradientAccumulator/pull/89
Added opt distribute unit test + added model distribute test to CI by @andreped in https://github.com/andreped/GradientAccumulator/pull/91
Further refined the bug report template by @andreped in https://github.com/andreped/GradientAccumulator/pull/97
Fixed dynamic optimizer wrapper inheritance + support tf >= 2.8 by @andreped in https://github.com/andreped/GradientAccumulator/pull/95
Fixed tensorflow-datasets protobuf issue by @andreped in https://github.com/andreped/GradientAccumulator/pull/98
Added model wrapper test for tf<2.8 + refactored tests by @andreped in https://github.com/andreped/GradientAccumulator/pull/99

Full Changelog: https://github.com/andreped/GradientAccumulator/compare/v0.5.0...v0.5.1

GradientAccumulator - v0.5.0

Published by andreped over 1 year ago

New feature!

Multi-GPU support has now been added! Support has been added for both optimizer and model wrappers.
Note that only SGD works with the model wrapper, due to challenges controlling the optimizer state during gradient accumulatiom

What's Changed

DOI badge + bibtex update in README by @andreped in https://github.com/andreped/GradientAccumulator/pull/77
Fixed DOI in README by @andreped in https://github.com/andreped/GradientAccumulator/pull/78
batch size vs mini-batch size by @mhoibo in https://github.com/andreped/GradientAccumulator/pull/79
Docs: README + redirect + tf <=2.10 + python <= 3.11 by @andreped in https://github.com/andreped/GradientAccumulator/pull/80
Added multi-gpu support (model wrapper) by @tno123 in https://github.com/andreped/GradientAccumulator/pull/82
Multi-GPU works + macOS-11 CI + Docs update by @andreped in https://github.com/andreped/GradientAccumulator/pull/83
added parameter count test by @tno123 in https://github.com/andreped/GradientAccumulator/pull/84
Added linting + author update + v0.5.0 by @andreped in https://github.com/andreped/GradientAccumulator/pull/86
Reduced sphinx version + fixed import by @andreped in https://github.com/andreped/GradientAccumulator/pull/87
Docs: added urllib3==1.26.15 to docs req by @andreped in https://github.com/andreped/GradientAccumulator/pull/88

New Contributors

@mhoibo made their first contribution in https://github.com/andreped/GradientAccumulator/pull/79
@tno123 made their first contribution in https://github.com/andreped/GradientAccumulator/pull/82

Full Changelog: https://github.com/andreped/GradientAccumulator/compare/v0.4.2...v0.5.0

GradientAccumulator - v0.4.2

Published by andreped over 1 year ago

What's Changed

Add seemless support for mixed precision in AccumBatchNormalization by @andreped in https://github.com/andreped/GradientAccumulator/pull/66
Cleanup and refactored unit tests by @andreped in https://github.com/andreped/GradientAccumulator/pull/67
CI test dispatch + README improvements by @andreped in https://github.com/andreped/GradientAccumulator/pull/69
Minor fix for Accum BN in 3D [skip ci] by @dbouget in https://github.com/andreped/GradientAccumulator/pull/70
Added mixed precision AccumBN CI test by @andreped in https://github.com/andreped/GradientAccumulator/pull/74
Fixed AccumBN to work ND by @andreped in https://github.com/andreped/GradientAccumulator/pull/75
README technique order update by @andreped in https://github.com/andreped/GradientAccumulator/pull/76

Full Changelog: https://github.com/andreped/GradientAccumulator/compare/v0.4.1...v0.4.2

GradientAccumulator - v0.4.1

Published by andreped over 1 year ago

What's Changed

Added issue templates by @andreped in https://github.com/andreped/GradientAccumulator/pull/59
Fixed bug in AccumBatchNormalizer - identical results to Keras BN by @andreped in https://github.com/andreped/GradientAccumulator/pull/61
Docs: Added AccumBN example + docs README + minor fixes by @andreped in https://github.com/andreped/GradientAccumulator/pull/62
bump v0.4.1 by @andreped in https://github.com/andreped/GradientAccumulator/pull/63

New API

You can now use gradient accumulation with the AccumBatchNormalization layer:

from gradient_accumulator import GradientAccumulateModel, AccumBatchNormalization
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Dense

# define model and add accum BN layer
model = Sequential()
model.add(Dense(32, activation="relu"))
model.add(AccumBatchNormalization(accum_steps=8))
model.add(Dense(10))

# add gradient accumulation to the rest of the model
model = GradientAccumulateModel(accum_steps=8, inputs=model.input, outputs=model.output)

More information about remarks and usage can be found at gradientaccumulator.readthedocs.io

Full Changelog: https://github.com/andreped/GradientAccumulator/compare/v0.4.0...v0.4.1

GradientAccumulator - v0.4.0

Published by andreped over 1 year ago

What's Changed

Added custom AccumBatchNormalization layer with gradient accumulation support.
Added more unit tests -> code coverage = 99%
Made proper documentations which is hosted at gradientaccumulator.readthedocs.io/
Reduced runtime on several unit tests to make CI jobs faster
Added fix for protobuf for tfds in CIs
Reworked README - moved most stuff to the Documentations + added CI section w/badges
Header image by @jpdefrutos in https://github.com/andreped/GradientAccumulator/pull/51

New Contributors

@jpdefrutos made their first contribution in https://github.com/andreped/GradientAccumulator/pull/51

New API feature

from gradient_accumulator import AccumBatchNormalization

layer = AccumBatchNormalization(accum_steps=4)

Can be used as a regular Keras BatchNormalization layer, but with reduced functionality.

Full Changelog: https://github.com/andreped/GradientAccumulator/compare/v0.3.2...v0.4.0

GradientAccumulator - v0.3.2

Published by andreped over 1 year ago

What's changed

Fix for GradientAccumulateOptimizer to support tf >= 2.10 by dynamically inheriting from (legacy) Optimizer related to https://github.com/andreped/GradientAccumulator/issues/37
Deprecated and removed tensorflow-addons (now only tensorflow is required) related to https://github.com/andreped/GradientAccumulator/issues/40
Fix for macOS builds failing due to poor dependency versioning related to https://github.com/andreped/GradientAccumulator/issues/39
Added support for Python 3.11-3.12, see https://github.com/andreped/GradientAccumulator/issues/43
Added support for TensorFlow 2.2 again (after tf-addons removal), see https://github.com/andreped/GradientAccumulator/issues/44
Fixed tensorflor-datasets versioning in unit tests to work across all relevant setups, see https://github.com/andreped/GradientAccumulator/issues/41
Added custom AccumBatchNormalization layer demonstrating similar results to regular keras' BN, see here
Adding notebook for HuggingFaceTF models by @0syrys in https://github.com/andreped/GradientAccumulator/pull/34
Improved code coverage from 46% to 74%

Full Changelog: https://github.com/andreped/GradientAccumulator/compare/v0.3.1...v0.3.2

New Contributors

@0syrys made their first contribution in https://github.com/andreped/GradientAccumulator/pull/34

Full Changelog: https://github.com/andreped/GradientAccumulator/compare/v0.3.1...v0.3.2

How to install?

pip install gradient-accumulator==0.3.2

New API feature

Custom Batch Normalization layer

from gradient_accumulator.layers import AccumBatchNormalization

model = Sequential()
model.add(AccumBatchNormalization())

GradientAccumulator - v0.3.1

Published by andreped over 1 year ago

What's changed

Simplified imports - can now directly import accumulators without middle step
Renamed GAModelWrapper -> GradientAccumulateModel
Renamed GAOptimizerWrapper -> GradientAccumulateOptimizer
Updated README and all CIs accordingly
Deprecated tensorflow==2.2, due to tensorflow-addons incompatiblity. Now tf >= 2.3 supported.

Full Changelog: https://github.com/andreped/GradientAccumulator/compare/v0.3.0...v0.3.1

How to install?

pip install gradient-accumulator==0.3.1

New API!

Model wrapper:

from gradient_accumulator import GradientAccumulateModel

model = Model(...)
model = GradientAccumulateModel(accum_steps=4, inputs=model.input, outputs=model.output)

Optimizer wrapper:

from gradient_accumulator import GradientAccumulateModel

opt = tf.keras.optimizers.SGD(1e-2)
opt = GradientAccumulateOptimizer(accum_steps=4, optimizer=opt)

GradientAccumulator - v0.3.0

Published by andreped over 1 year ago

What's changed

Added experimental Optimizer wrapper solution through GAOptimizerWrapper by @andreped in https://github.com/andreped/GradientAccumulator/pull/28
Added experimental multi-GPU support through GAModelWrapperV2 by @andreped in https://github.com/andreped/GradientAccumulator/pull/28
Added reduction method to Optimizer wrapper and set it to MEAN by default by @andreped in https://github.com/andreped/GradientAccumulator/pull/29
Added fix related to model subclassing, see https://github.com/andreped/GradientAccumulator/issues/23
Added support for and verified that macOS M1 silicon works
Added GAOptimizerWrapper CI unit test

How to install?

pip install gradient-accumulator==0.3.0

How to use?

Method	Usage
`GAModelWrapper`	`model = GAModelWrapper(accum_steps=4, inputs=model.input, outputs=model.output)`
`GAOptimizerWrapper`	`opt = GAOptimizerWrapper(accum_steps=4, optimizer=tf.keras.optimizers.Adam(1e-3))`

Full Changelog: https://github.com/andreped/GradientAccumulator/compare/v0.2.2...v0.3.0

GradientAccumulator - v0.2.2

Published by andreped almost 2 years ago

This is a minor patch release.

What's changed:

Added support for tensorflow-metal, enabling GA on macOS with GPUs
Bug fix related to undefined gradients: https://github.com/andreped/GradientAccumulator/issues/24

Full Changelog: https://github.com/andreped/GradientAccumulator/compare/v0.2.1...v0.2.2

GradientAccumulator - v0.2.1

Published by andreped about 2 years ago

This is a minor patch release.

What's changed:

Fixed typo by renaming use_acg to use_agc.

GradientAccumulator - v0.2.0

Published by andreped over 2 years ago

What's Changed

AGC and mixed precision are now compatible
Support for both float16 and bfloat16 on GPU and TPU, respectively
Set default accum_steps in GAModelWrapper to 1
Improved documentation regarding usage of AGC, GPU/TPU, and recommended model format (SavedModel)
Support for 3D operations in AGC, such as Conv3D, by @dbouget in https://github.com/andreped/GradientAccumulator/pull/17
Corrected mean reduction to before gradient computation @dbouget in https://github.com/andreped/GradientAccumulator/pull/2

New Contributors

@dbouget made their first contribution in https://github.com/andreped/GradientAccumulator/pull/17

Full Changelog: https://github.com/andreped/GradientAccumulator/compare/v0.1.5...v0.2.0

GradientAccumulator - v0.1.5

Published by andreped over 2 years ago

Changes:

Added mixed precision support (only float16 currently, which is compatible with NVIDIA GPUs)
Added adaptive gradient clipping support (normalization-free approach which works with GA)
Added CI test for AGC

Full Changelog: https://github.com/andreped/GradientAccumulator/compare/v0.1.4...v0.1.5

GradientAccumulator - v0.1.4

Published by andreped over 2 years ago

Zenodo DOI release and updated README to contain updated documentation regarding installation and usage.

Changes:

Renamed n_gradients to accum_steps.
Added citation policy and Zenodo citation

Full Changelog: https://github.com/andreped/GradientAccumulator/compare/v0.1.3...v0.1.4

GradientAccumulator - v0.1.3

Published by andreped over 2 years ago

GradientAccumulator is now available on PyPI :
https://pypi.org/project/gradient-accumulator/#files

Changes:

Added experimental mixed precision support
Added support for TF >= 2.2
Added support for Python >3.6
Added pytest to CI for unit testing
Added CI test for mixed precision
Added CI test for multi-input-output models
Added CI test for optimizer invariance
Added CI test for basic mnist training
Added CI test to verify that we get expected result for GA vs regular batch training

Full Changelog: https://github.com/andreped/GradientAccumulator/compare/v0.1.2...v0.1.3

GradientAccumulator - v0.1.2

Published by andreped over 2 years ago

Changes:

Fixed critical bug regarding gradient updates (use MEAN reduction, instead of SUM reduction)
Now, GA yields identical results compared to regular batch training
Added unit tests with pytest to yield AssertionError if results are different
Added compatibility with sample_weight - now GAModelWrapper should be fully compatible with model.compile/fit

Full Changelog: https://github.com/andreped/GradientAccumulator/compare/v0.1.1...v0.1.2

GradientAccumulator - v0.1.1

Published by andreped over 2 years ago

Changes:

Swapped optimizer wrapper solution with Model wrapper solution
Enables adding gradient accumulation support for "any" tf.keras.Model by simply overloading the train_step method
Added convenience class GAModelWrapper that handles all this for you - just provide the model!
Solution should also be more compatible with older TF versions, as train_step overloading was added already in TF 2.2.

Full Changelog: https://github.com/andreped/GradientAccumulator/compare/v0.1.0...v0.1.1

GradientAccumulator - v0.1.0

Published by andreped over 2 years ago

First release of the GradientAccumulator package that enables usage of accumulated gradients in TensorFlow 2.x by simply wrapping an optimizer.

Currently, compatible with Python 3.7-3.9, tested with TensorFlow 2.8.0 and 2.9.1, and cross-platform compatible (Windows, Ubuntu, and macOS).

Full Changelog: https://github.com/andreped/GradientAccumulator/commits/v0.1.0

GradientAccumulator

New feature

What's Changed

Announcement

What's Changed

New feature!

What's Changed

New Contributors

What's Changed

What's Changed

New API

What's Changed

New Contributors

New API feature

What's changed

New Contributors

How to install?

New API feature

Custom Batch Normalization layer

What's changed

How to install?

New API!

Model wrapper:

Optimizer wrapper:

What's changed

How to install?

How to use?

What's changed:

What's Changed

New Contributors

Related Projects

GPflow

DLTK

Gradient-Centralization-TensorFlow

adapt

tf-explain