antares

Antares: an automatic engine for multi-platform kernel generation and optimization. Supporting CPU, CUDA, ROCm, DirectX12, GraphCore, SYCL for CPU/GPU, OpenCL for AMD/NVIDIA, Android CPU/GPU backends.

OTHER License

Downloads
8.5K
Stars
440
Committers
10
antares - Antares v0.9.6 Latest Release

Published by ghostplant 6 months ago

  1. Support Win10/11 for CUDA backend;
  2. Add example: python3 -m autort.examples.06_diffuser_no_opt without dedicated device type optimization.
  3. A bunch of Antares IR enhancements.
antares - Antares v0.9.5

Published by ghostplant 9 months ago

Preparing operators to support stable diffusion, turning Pytorch dependencies to 2.2.0

antares - Antares v0.9.4

Published by ghostplant 10 months ago

Add llama example for latency tests: python3 -m autort.examples.03_llama_tiny

antares - Antares v0.9.3

Published by ghostplant 11 months ago

antares - Antares v0.9.2

Published by ghostplant 12 months ago

Add Vulkan Backend for Python 3.12;

antares - Antares v0.9.1

Published by ghostplant about 1 year ago

Bug fixes for python -m autort.examples.mnist in DirectX platform.

antares - Antares v0.9.0

Published by ghostplant about 1 year ago

Experimental Release AutoRT for Windows DirectX 12 / Microsoft XBox / Linux CUDA.

antares - Antares v0.3.20

Published by ghostplant about 2 years ago

A bunch of fixes included.

Installation Command for Linux:

python3 -m pip install --upgrade antares
antares - Antares v0.3.16

Published by ghostplant over 2 years ago

A bunch of enhancement included.

Installation Command for Linux:

python3 -m pip install --upgrade antares
antares - Antares v0.3.1

Published by ghostplant over 2 years ago

First Antares version ported to PYPI.

Installation Command for Linux:

python3 -m pip install --upgrade antares
antares - Antares v0.2.3

Published by ghostplant about 3 years ago

New updates:

  1. Detect Windows HLSL ending lines \r\n in c-hlsl_win64/c-hlsl_xbox backend;
  2. Refine extra overhead in JIT plugin computation for pytorch;
  3. Add IPU/IPU2 evaluator for c-ipu backend;
  4. Enhance in evaluating c-sycl_cuda backend;

Thanks for contributions from mzmssg, Michoumichmich.

antares - Antares v0.2.2

Published by ghostplant over 3 years ago

New updates:

  1. HLSL support native erf & pow operators.
  2. Extend new backend: c-mcpu_android for aarch64 CPU.
  3. TF/Pytotch JIT Plugin: Support extending AVX512 & SYCL kernels.
  4. Collective library: MPI Support for TF-Intel; NCCL/RCCL Support for TF-CUDA/TF-ROCM.
antares - Antares v0.2.1

Published by ghostplant over 3 years ago

New updates:

  1. Change of Auto-scheduling: Enhanced CPU(c-scpu/c-mcpu/c-mcpu_avx512) & IPU(renamed from c-gc to c-ipu) auto tuning search space.
  2. Change of JIT Tuning: Pytorch/Tensorflow JIT Plugin is changed into local tuning by setting ANTARES_ROOT. (Tuning over rest-server is canceled)
  3. Change of Installation: Allow non-root users to install Antares components.
  4. Many other fixes.
antares - Antares v0.2.0

Published by ghostplant over 3 years ago

New features:

  1. Supporting More Backends: e.g. SYCL for CPU, ROCm for Windows, OCL for Android, etc.
  2. Enhanced Tuning Mechanism OpEvo-2: Faster and Effective Tuner than Legacy Ansor.
  3. AB Backend interface for all hardware (e.g. ab::init, ab::launchKernel, ..)
  4. Enhanced Antares HLSL library using DXC-6.0, tuning efficiency is much improved.
  5. Initial support for inter-op tuning (only for small graph in this version, large graph tuning will be supported in the following releases).
antares - Antares v0.1.0

Published by ghostplant over 3 years ago

This version is frozen to keep legacy usage of Antares (intra-op optimizations, v0.1 api for directx12).