A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
MIT License
TensorFlow 2 library implementing Graph Neural Networks
CodeBERT
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g...
DirectML is a high-performance, hardware-accelerated DirectX 12 library for machine learning. Dir...
OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, ...
This repository provides code for machine learning algorithms for edge devices developed at Micro...
Samples and Tools for Windows ML.
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Foundation Architecture for (M)LLMs
Dedicated to building industrial foundation models for universal data intelligence across industr...
An efficient implementation of the popular sequence models for text generation, summarization, an...
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Tutel MoE: An Optimized Mixture-of-Experts Implementation