fairscale

PyTorch extensions for high performance and large scale training.

OTHER License

Stars
3.2K
Committers
70

Bot releases are hidden (Show)

fairscale - 0.4.3 release

Published by min-xu-ai almost 3 years ago

What's Changed

New Contributors

Full Changelog: https://github.com/facebookresearch/fairscale/compare/v0.4.1...v0.4.3

fairscale - FairScale Release v0.4.2

Published by anupambhatnagar almost 3 years ago

fairscale - v0.4.1: [chore] 0.4.1 release (#803)

Published by tmarkstrum about 3 years ago

Released version 0.4.1 for FairScale.

fairscale -

Published by min-xu-ai about 3 years ago

fairscale -

Published by min-xu-ai about 3 years ago

fairscale -

Published by anj-s over 3 years ago

fairscale -

Published by min-xu-ai over 3 years ago

fairscale -

Published by min-xu-ai over 3 years ago

fairscale -

Published by min-xu-ai over 3 years ago

fairscale - v0.3.4

Published by blefaudeux over 3 years ago

[0.3.4] - 2021-04-13

Added

  • FSDP: Add no broadcast optim state option (#560)

Fixed

  • ShardedDDP: Properly handle .eval() mode (#587)
  • ShardedDDP: Handle model being moved back to CPU prior to state consolidation (#573)
  • FSDP: much faster state consolidation (#595)
  • FSDP: Add gradient pre-divide to prevent overflow with large world sizes (#565)
  • Offload: (experimental) Fix activation offloading to CPU (#588
fairscale -

Published by min-xu-ai over 3 years ago

fairscale -

Published by min-xu-ai over 3 years ago

fairscale -

Published by min-xu-ai over 3 years ago

fairscale - v0.3.0

Published by blefaudeux over 3 years ago

[0.3.0] - 2021-02-22

Added

  • FullyShardedDataParallel (FSDP) (#413)
  • ShardedDDP fp16 grad reduction option (#402)
  • Expose experimental algorithms within the pip package (#410)

Fixed

  • Catch corner case when the model is too small with respect to the world size, and shards are empty (#406)
  • Memory leak in checkpoint_wrapper (#412)
fairscale - v0.1.7

Published by blefaudeux over 3 years ago

Fixed

  • ShardedDDP and OSS handle model trainability changes during training (#369)
  • ShardedDDP state dict load/save bug (#386)
  • ShardedDDP handle train/eval modes (#393)
  • AdaScale handling custom scaling factors (#401)

Added

  • ShardedDDP manual reduce option for checkpointing (#389)
fairscale - v0.1.6

Published by blefaudeux over 3 years ago

Added

  • Checkpointing model wrapper (#376)
  • Faster OSS, flatbuffers (#371)
  • Small speedup in OSS clipgradnorm (#363)

Fixed

  • Bug in ShardedDDP with 0.1.5 depending the init (KeyError / OSS)
  • Much refactoring in Pipe (#357, #358, #360, #362, #370, #373)
  • Better pip integration / resident pytorch (#375)
fairscale - v0.1.5

Published by blefaudeux over 3 years ago

Added

  • Pytorch compatibility for OSS checkpoints (#310)
  • Elastic checkpoints for OSS, world size can vary in between save and loads (#310)
  • Tensor views for OSS bucketing, reduced CPU use (#300)
  • Bucket calls in ShardedDDP, for faster inter node communications (#327)
  • FlattenParamWrapper, which flattens module parameters into a single tensor seamlessly (#317)
  • AMPnet experimental support (#304)

Fixed

  • ShardedDDP properly handles device changes via .to() (#353)
  • Add a new interface for AdaScale, AdaScaleWrapper, which makes it compatible with OSS (#347)
fairscale - v0.1.4

Published by blefaudeux almost 4 years ago

Fixed

  • Missing cu files in the pip package
fairscale - v0.1.3

Published by blefaudeux almost 4 years ago

Same as 0.1.2, but with the correct numbering in the source code (see init.py)

fairscale - v0.1.2

Published by blefaudeux almost 4 years ago

Added

  • AdaScale: Added gradient accumulation feature (#202)
  • AdaScale: Added support of torch.lr_scheduler (#229)

Fixed

  • AdaScale: smoothing factor value fixed when using gradient accumulation (#235)
  • Pipe: documentation on balancing functions (#243)
  • ShardedDDP: handle typical NLP models
  • ShardedDDP: better partitioning when finetuning
Package Rankings
Top 6.75% on Proxy.golang.org
Top 16.92% on Spack.io
Top 15.15% on Conda-forge.org
Badges
Extracted from project README
Support Ukraine Documentation Status CircleCI Downloads PRs Welcome Explain Like Iā€™m 5: FairScale
Related Projects