GPT implementation in Flax
Statistics for this project are still being loaded, please check back later.
An open platform for training, serving, and evaluating large language models. Release repo for Vi...
Flax is a neural network library for JAX that is designed for flexibility.
VILA - a multi-image visual language model with training, inference and evaluation recipe, deploy...
Unsupervised Language Modeling at scale for robust sentiment classification
GLM (General Language Model)
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
ktrain is a Python library that makes deep learning and AI more accessible and easier to apply
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating poin...
The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relat...
Ongoing research training transformer models at scale
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA a...
Ongoing research training transformer language models at scale, including: BERT & GPT-2