Low Precision Arithmetic Simulation in PyTorch
MIT License
4 bits quantization of LLaMA using GPTQ
Tensors and Dynamic neural networks in Python with strong GPU acceleration
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techn...
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 ...
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including LPIPS, FID, NIQE, NRQM(Ma), MUSIQ,...
AIMET is a library that provides advanced quantization and compression techniques for trained neu...
PyTorch installation wheels for Raspberry Pi 64 OS
Explore training for quantized models
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video ...
The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relat...
Seamless analysis of your PyTorch models (RAM usage, FLOPs, MACs, receptive field, etc.)