AcceleratedKernels.jl

Cross-architecture parallel algorithms for Julia's GPU backends, from a unified KernelAbstractions.jl codebase. Targets Intel oneAPI, AMD ROCm, Apple Metal, Nvidia CUDA.

MIT License

Stars
77

Bot releases are visible (Hide)

AcceleratedKernels.jl - v0.1.0 Latest Release

Published by github-actions[bot] 11 days ago

AcceleratedKernels v0.1.0

Merged pull requests:

  • Remove unnecessary Const (#3) (@pxl-th)
  • Remove redundant Const and inbounds macros (#4) (@pxl-th)

Closed issues:

  • Use GPUArraysCore? (#1)
  • Invalid IR for sortperm! on AMDGPU (#2)
AcceleratedKernels.jl - 0.1.0

Published by anicusan about 1 month ago

First release of AcceleratedKernels.jl, for archiving purposes supporting the "AcceleratedKernels.jl: Cross-Architecture Parallel Algorithms from a Unified, Transpiled Codebase" paper.