megablocks

MegaBlocks

APACHE-2.0 License

Downloads
15.5K
Stars
1.1K

Bot releases are hidden (Show)

megablocks - v0.5.1 Latest Release

Published by tgale96 9 months ago

What's Changed

New Contributors

Full Changelog: https://github.com/stanford-futuredata/megablocks/compare/v0.5.0...v0.5.1

megablocks - v0.5.0

Published by mvpatel2000 11 months ago

What's New

Several improvements to avoid CPU <> GPU device synchronizations, GLU support, and support for some new models 👀

What's Changed

New Contributors

Full Changelog: https://github.com/stanford-futuredata/megablocks/compare/v0.4.0...v0.5.0

megablocks - v0.4.0

Published by mvpatel2000 12 months ago

What's Changed

Full Changelog: https://github.com/stanford-futuredata/megablocks/compare/v0.3.3...v0.4.0

megablocks - v0.3.3

Published by mvpatel2000 about 1 year ago

What's Changed

Full Changelog: https://github.com/stanford-futuredata/megablocks/compare/v0.3.2...v0.3.3

megablocks -

Published by mvpatel2000 about 1 year ago

What's Changed

  • Support for bfloat16
  • Optimizations for top_k > 1
  • Support for fully-sharded data parallelism
  • Support tensor model parallelism when expert_parallel_world_size > num_experts
  • Optimizations for activation memory
  • Support activation quantization (thanks @dblalock!)
  • Optimizations for SM90 (Hopper)
  • Lots of bug fixes, cleanup and small optimizations

New Contributors

Full Changelog: https://github.com/stanford-futuredata/megablocks/compare/v0.1...v0.3.2

megablocks - v0.3.1

Published by tgale96 about 1 year ago

megablocks - v0.3

Published by tgale96 about 1 year ago

What's Changed

  • Support for bfloat16
  • Optimizations for top_k > 1
  • Support for fully-sharded data parallelism
  • Support tensor model parallelism when expert_parallel_world_size > num_experts
  • Optimizations for activation memory
  • Support activation quantization (thanks @dblalock!)
  • Optimizations for SM90 (Hopper)
  • Lots of bug fixes, cleanup and small optimizations

New Contributors

Full Changelog: https://github.com/stanford-futuredata/megablocks/compare/v0.1...v0.3

megablocks - Version 0.1

Published by tgale96 over 1 year ago

Initial release documenting repository state prior to MLSys'23 camera-ready publication.

Package Rankings
Top 16.35% on Pypi.org
Related Projects