tinyGPUlang

Tutorial on building a gpu compiler backend in LLVM

MIT License

Stars

7

Committers

View Code on GitHub

Ecosystems: LLVM, Cuda

tinyGPUlang

Tutorial on building a gpu compiler backend in LLVM

Goals

The goal of this tutorial is to show a simple example on how to generate ptx from the llvm ir and how to write the IR itself to access cuda features.

For the sake of demonstration a language frontend is also provided. The main idea of the language is to support pointwise (aka elementwise) operations with gpu acceleration.

If you are just curios about the code generation backend, you can jump directly to The code generator for NVPTX backend part.

What is inside the repo?

tinyGPUlang: the compiler, creates ptx from tgl (the example language file)
test: a cuda driver api based test for the generated ptx
examples: example tgl files
docs: documentation for the tutorial

Tutorial content

Build

See the How to build the project? documentation for further details.

References

Related Projects

https://github.com/clara-parabricks/GenomeWorks

SDK for GPU accelerated genome assembly and analysis

31 May 2019 284

librapid

A highly optimised C++ library for mathematical applications and neural networks.

25 May 2021 163

cuda-learning

cuda编程学习入门

nvidia-gpu-ml-library-test

Simple tests for JAX, PyTorch, and TensorFlow to test if the installed NVIDIA drivers are being p...

https://github.com/MrNeRF/gaussian-splatting-cuda

3D Gaussian Splatting, reimagined: Unleashing unmatched speed with C++ and CUDA from the ground up!

30 Jul 2023 862

cuda-design-patterns

Some CUDA design patterns and a bit of template magic for CUDA

16 Nov 2018 145

awesome-gpgpu

A curated list of awesome GPGPU (CUDA/OpenCL/Vulkan) resources

cxbqn

BQN virtual machine

cccl

CUDA C++ Core Libraries

17 Sep 2020 743

https://github.com/m4rs-mt/ILGPU

ILGPU JIT Compiler for high-performance .Net GPU programs

08 Jan 2017 1,343

https://github.com/TensorBFS/CuTropicalGEMM.jl

The fastest Tropical number matrix multiplication on GPU

https://github.com/romnn/nvbit-rs

Rust bindings to the NVIDIA NVBIT binary instrumentation API

PTXprofiler

A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofl...

spbla

Sparse Boolean linear algebra for Nvidia Cuda, OpenCL and CPU computations

penguinV

Computer vision library with focus on heterogeneous systems

09 Jun 2016 118