Experiment of using Tangent to autodiff triton
MIT License
Statistics for this project are still being loaded, please check back later.
ShapeGuard allows you to very succinctly assert the expected shapes of tensors in a dynamic, eins...
A Python module for compiling PyTorch graphs to C
Pragmatic functional programming for Python inspired by F#
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch
Implementation of a Transformer, but completely in Triton
Software Architecture for ML engineers
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 ...
Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer ...
Payton! Kickstart any 3D OpenGL + GTK Ideas in a few seconds!
Fast and memory-efficient exact attention