nnfusion | Playwright Ecosystem Directory

Bot releases are visible (Hide)

nnfusion - NNFusion v0.4 Release Candidate Latest Release

Published by wenxcs over 2 years ago

nnfusion - NNFusion v0.3 Release

Published by wenxcs over 3 years ago

Major Feature

Support end-to-end BERT model training (in ONNX format) on real dataset
Add new operator fusion passes for transformer-based model optimization
Provide C++ and JSON interfaces for extending custom operators
Support a new HLSL code generator

Others

Update related documentations
Fix bugs

中文版本说明快捷通道-->#105 (comment)

nnfusion - NNFusion v0.2 Release

Published by jlxue almost 4 years ago

Major Features

Support the use of Python interface to accelerate the training and inference of PyTorch model
Support low-precision and mixed-precision model compilation, e.g., fp16
Provide auto kernel tuner integration:
- Add Antares IR for 60+ ops
- Support auto tuning via Antares tuning service
Support parallel training via SuperScaler
Enable local kernel cache through kernel database

Others

Update related documentations
Some enhancements on user experiences and bug fix

中文版本说明快捷通道-->https://github.com/microsoft/nnfusion/issues/105#issuecomment-747194776

nnfusion - NNFusion v0.1 Release

Published by wenxcs almost 4 years ago

Build and Installation:
- Support out-of-box installation with docker image
- Support source code install on native system and docker
- Support devices like CUDA GPUs, and ROCm GPUs.
Models, Framework and Operators:
- Support DNN model formats including TensorFlow and ONNX
- Support commonly used models including AlexNet, VGG11, ResNet50, seq2seq, BERT, etc.
- Support more than 100 commonly used operators.
Model Compilation and Execution:
- Provide a full-stack optimization mechanism, including data-flow graph optimizations, model-specific kernel selection, kernel co-scheduling, etc.
- Provide ahead-of-time and source-to-source(model-to-code) compilation to reduce runtime overhead
- Remove third-party library or framework dependencies
Usability:
- Provide command line tool nnfusion
- Provide tools for users to freeze TensorFlow and PyTorch models
- Provide flexible way to customize optimization through direct code modification on generated code

中文版本说明快捷通道--> https://github.com/microsoft/nnfusion/issues/72#issuecomment-720338392

Related Projects

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

18 Jun 2021 10,449

tutel

Tutel MoE: An Optimized Mixture-of-Experts Implementation

06 Aug 2021 716

JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

30 Mar 2023 23,583

fastseq

An efficient implementation of the popular sequence models for text generation, summarization, an...

15 Jul 2020 431

MMdnn

MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g...

16 Aug 2017 5,792

EdgeML

This repository provides code for machine learning algorithms for edge devices developed at Micro...

02 Aug 2017 1,581

torchscale

Foundation Architecture for (M)LLMs

17 Nov 2022 3,006

onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

10 Nov 2018 14,384

DirectML

DirectML is a high-performance, hardware-accelerated DirectX 12 library for machine learning. Dir...

12 Mar 2019 2,192

Industrial-Foundation-Models

Dedicated to building industrial foundation models for universal data intelligence across industr...

22 Mar 2024 32

Windows-Machine-Learning

Samples and Tools for Windows ML.

01 Mar 2018 1,015

VRL3

06 Nov 2022 32

only_train_once

OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, ...

19 Oct 2023 21

tf2-gnn

TensorFlow 2 library implementing Graph Neural Networks

25 Feb 2020 370

CodeBERT

17 Jun 2020 2,192