TurboTransformers | PyTorch Ecosystem Directory

Bot releases are visible (Hide)

TurboTransformers - TurboTransformers v0.5.1 Latest Release

Published by fangjiarui almost 4 years ago

Albert Model uses the model-aware-allocator.

TurboTransformers - TurboTransformers v0.5.0

Published by fangjiarui almost 4 years ago

Add Model Aware Allocator for Bert Model.

TurboTransformers - TurboTransformers v0.4.2

Published by feifeibear about 4 years ago

Add Quantized Bert using onnxruntime.

TurboTransformers - TurboTransformers v0.4.1

Published by feifeibear about 4 years ago

Using onnxruntime-cpu as CPU backend, parallel to our own home-grown implementation.

TurboTransformers - TurboTransformer v0.3.0

Published by feifeibear over 4 years ago

Support Transformer decoder used in OpenNMT-py.
New GPU memory allocator.
Be Compatible with Pytorch v1.5.0.

TurboTransformers - TurboTransformer v0.2.1

Published by feifeibear over 4 years ago

Add blis to BLAS options.

TurboTransformers - TurboTransformer v0.0.1

Published by feifeibear over 4 years ago

Bert Acceleration on CPU and GPU.

Package Rankings

Top 5.64% on Proxy.golang.org

Related Projects

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating poin...

20 Sep 2022 1,482

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

29 Oct 2018 132,459

kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of c...

05 Aug 2022 1,525

FasterTransformer

Transformer related optimization, including BERT, GPT

02 Apr 2021 5,805

ao

torchao: PyTorch Architecture Optimization (AO). Performant kernels that work with PyTorch.

03 Nov 2023 193

curated-transformers

🤖 A PyTorch library of curated Transformer models and their composable components

14 Sep 2022 863

DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reprod...

02 May 2018 13,307

Transformers4Rec

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendat...

14 Apr 2021 1,089

quick-deploy

Optimize, convert and deploy machine learning models as fast inference API using Triton and ORT. ...

03 Nov 2021 6

TensorRT

PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT

11 Mar 2020 2,359

js-pytorch

A JavaScript library like PyTorch, with GPU acceleration.

27 Feb 2024 1,043

xla

Enabling PyTorch on XLA Devices (e.g. Google TPU)

05 Nov 2018 2,393

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

31 Aug 2020 9,151

Deep-Learning-in-Production

In this repository, I will share some useful notes and references about deploying deep learning-b...

03 May 2018 4,294

text-generation-inference

Large Language Model Text Generation Inference

08 Oct 2022 7,916