MegaBlocks
APACHE-2.0 License
Bot releases are hidden (Show)
moe_normalize_expert_weights
when top_k=1
by @152334H in https://github.com/stanford-futuredata/megablocks/pull/87
Full Changelog: https://github.com/stanford-futuredata/megablocks/compare/v0.5.0...v0.5.1
Published by mvpatel2000 11 months ago
Several improvements to avoid CPU <> GPU device synchronizations, GLU support, and support for some new models 👀
.cpu()
call by @mvpatel2000 in https://github.com/stanford-futuredata/megablocks/pull/37
Full Changelog: https://github.com/stanford-futuredata/megablocks/compare/v0.4.0...v0.5.0
Published by mvpatel2000 12 months ago
Full Changelog: https://github.com/stanford-futuredata/megablocks/compare/v0.3.3...v0.4.0
Published by mvpatel2000 about 1 year ago
Full Changelog: https://github.com/stanford-futuredata/megablocks/compare/v0.3.2...v0.3.3
Full Changelog: https://github.com/stanford-futuredata/megablocks/compare/v0.1...v0.3.2
Published by tgale96 about 1 year ago
Full Changelog: https://github.com/stanford-futuredata/megablocks/compare/v0.3...v0.3.1
Published by tgale96 about 1 year ago
Full Changelog: https://github.com/stanford-futuredata/megablocks/compare/v0.1...v0.3
Published by tgale96 over 1 year ago
Initial release documenting repository state prior to MLSys'23 camera-ready publication.