MegaBlocks
APACHE-2.0 License
A scalable generative AI framework built for researchers and developers working on Large Language...
GLM (General Language Model)
A modular toolbox for meta-learning research with a focus on speed and reproducibility.
In this repository, I will share some useful notes and references about deploying deep learning-b...
A Theano framework for building and training neural networks
Learn about the Neumorphic engineering process of creating large-scale integration (VLSI) systems...
MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Unsupervised Language Modeling at scale for robust sentiment classification
Ongoing research training transformer models at scale
OpenMMLab Foundational Library for Training Deep Learning Models
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating poin...
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed conf...
A fast MoE impl for PyTorch
Minimalistic large language model 3D-parallelism training