AdamW optimizer for bfloat16 models in pytorch 🔥.
MIT License
Statistics for this project are still being loaded, please check back later.
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 ...
ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning mod...
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Running large language models on a single GPU for throughput-oriented scenarios.
The PyTorch Implementation based on YOLOv4 of the paper: "Complex-YOLO: Real-time 3D Object Detec...
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch
A TensorFlow implementation of the Lion optimizer
Explore training for quantized models
🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly bet...
Adversarially Learned Inference in Pytorch
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
Software Architecture for ML engineers