mixture-of-attention

Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts

MIT License

Downloads
1.1K
Stars
101
Committers
1
mixture-of-attention - 0.0.3

Published by lucidrains over 1 year ago

mixture-of-attention - 0.0.2

Published by lucidrains over 1 year ago

mixture-of-attention - 0.0.1a

Published by lucidrains over 1 year ago

mixture-of-attention - 0.0.1

Published by lucidrains over 1 year ago

Package Rankings
Top 22.15% on Pypi.org
Related Projects