The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs
OTHER License
A fast & densely stored hashmap and hashset based on robin-hood backward shift deletion
Experimental wrapper over LLVM for generating and compiling code at run-time.
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AV...
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer...
Sparse Boolean linear algebra for Nvidia Cuda, OpenCL and CPU computations
R bindings for xtensor
Converters between Armadillo matrices (C++) and Numpy arrays using Pybind11
CUDA C++ Core Libraries
A C++ header-only library of statistical distribution functions.
Armadillo: fast C++ library for linear algebra (matrix maths) & scientific computing - https://ar...
Some CUDA design patterns and a bit of template magic for CUDA