ILGPU JIT Compiler for high-performance .Net GPU programs
OTHER License
Bot releases are visible (Hide)
Published by m4rs-mt almost 4 years ago
Note that this version is based on the LLVM compiler framework and contains native dependencies. Please use ILGPU >= v0.5
to use the platform independent ILGPU compiler version.
CPUAccelerator
.CPUAccelerator
.Force32BitFloats
mode.ArrayView<T>
getters.Published by m4rs-mt almost 4 years ago
Note that this version is based on the LLVM compiler framework and contains native dependencies. Please use ILGPU >= v0.5
to use the platform independent ILGPU compiler version.
GPUMath
.IntPtr.ToPointer
functions.ArgumentOfOfRangeException
-check in MemoryBuffer
.ArrayView3D<T>
and ArrayView2D<T>
structures.Unsafe
package.Published by m4rs-mt almost 4 years ago
This new beta version offers significant performance improvements of the generated kernel programs and a set of new features (get the Nuget package).
O2
to O1
(release mode) to improve performance in release builds using an additional of stable optimization passes (#344).Cuda
backend to O1
pipeline to generate vectorized IO operations in release builds (#350).CPU
, Cuda
and OpenCL
accelerators (#342).AlignTo
alignment methods to explicitly align ArrayView
instances to a particular alignment in bytes (#316).LocalMemory
class (#316).PopCount
, CLZ
and CTZ
operations (#324).MemSet
functions to all memory buffers (#338).O2
pipeline (#328).LongGlobalIndex
helper to simplify correct computations using 64-bit integers (#337).CLPlatformVersion
and fixed OpenCL 1.2 compatibility issues (#335).DebugArrayView
implementations (#345).Special thanks to @MoFtZ and @jgiannuzzi for their contributions to this release and to the entire ILGPU community for providing feedback, submitting issues and feature requests.
Published by m4rs-mt almost 4 years ago
The new stable version offers significant performance improvements of the generated kernel programs (get the Nuget package).
Launch
methods to Accelerator
class to launch kernels without pre-loading/compiling them (#319).AggressiveInlining
to improve performance of (usually) performance critical GPU programs (#294).ContextFlags.EnhancedPTXBackendFeatures
(#274, #303).Span<T>
in the scope of MemoryBuffer
copy operations (#122, #276).Major internal changes:
Special thanks to @MoFtZ for his contributions to this release and to the entire ILGPU community for providing feedback, submitting issues and feature requests.
Published by m4rs-mt almost 4 years ago
This new beta version offers significant performance improvements of the generated kernel programs (get the Nuget package).
AggressiveInlining
to improve performance of (usually) performance critical GPU programs (#294).ContextFlags.EnhancedPTXBackendFeatures
(#274, #303).Span<T>
in the scope of MemoryBuffer
copy operations (#122, #276).Major internal changes:
Special thanks to @MoFtZ for his contributions to this release and to the entire ILGPU community for providing feedback, submitting issues and feature requests.
Published by m4rs-mt about 4 years ago
The new stable version offers significant performance improvements of the generated kernel programs (get the Nuget package).
Half2
) types (#273).byte
and ushort
values in the context of method calls (#239).Major internal changes:
Special thanks to @MoFtZ, @Yey007 and @LxBos for their contributions to this release and to the entire ILGPU community for providing feedback, submitting issues and feature requests.
Published by m4rs-mt about 4 years ago
This new beta version offers significant performance improvements of the generated kernel programs (get the Nuget package).
Half2
) types (#273).byte
and ushort
values in the context of method calls (#239).Major internal changes:
Special thanks to @MoFtZ, @Yey007 and @LxBos for their contributions to this release and to the entire ILGPU community for providing feedback, submitting issues and feature requests.