A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch
MIT License
Regularization, Neural Network Training Dynamics
Official implementation of Conflict-Free Inverse Gradients Method
The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relat...
Examples for TensorFlow Weight Normalization
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow ...
Easy to use class balanced cross entropy and focal loss implementation for Pytorch
phgrad - Cause there are not enough unusable autograd libraries in python
AdamW optimizer for bfloat16 models in pytorch 🔥.
Implementation of E(n)-Equivariant Graph Neural Networks, in Pytorch
Code snippets created for the PyTorch discussion board