CudaPerformance

Compare the performance of matrix multiplication among GPU shared memory, GPU global memory and CPU

MIT License

Stars

Committers

View Code on GitHub View on X

Ecosystems: Cuda

Cuda Performance

How to generate PDF Report

Download zip or clone the repository.
Execute the shell script bash run.sh
PDF file will be genereated in to 'output/' folder

Related Projects

https://github.com/clara-parabricks/GenomeWorks

SDK for GPU accelerated genome assembly and analysis

31 May 2019 284

PTXprofiler

A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofl...

11 Jan 2023 37

Apriori-and-Eclat-Frequent-Itemset-Mining

Implementation of the Apriori and Eclat algorithms, two of the best-known basic algorithms for mi...

21 Oct 2018 40

https://github.com/LambdaLabsML/distributed-training-guide

Best practices & guides on how to write distributed pytorch training code

31 Jul 2024 190

https://github.com/TensorBFS/CuTropicalGEMM.jl

The fastest Tropical number matrix multiplication on GPU

29 Jun 2023 1

cuda-learning

cuda编程学习入门

02 Feb 2022 28

https://github.com/johnh2o2/cuvarbase

Python library for fast time-series analysis on CUDA GPUs

22 Jun 2017 24

cuda-design-patterns

Some CUDA design patterns and a bit of template magic for CUDA

16 Nov 2018 145

https://github.com/SamuraiBUPT/CUDA_Code

Codes for learning cuda. Implementation of multiple kernels.

11 Aug 2024 2

cuda-lab

Playing with CUDA and GPUs in Google Colab

16 Jan 2019 9

cuda-toolkit

GitHub Action to install CUDA

10 Mar 2021 146

awesome-gpgpu

A curated list of awesome GPGPU (CUDA/OpenCL/Vulkan) resources

20 Jun 2018 63

hpc

My experiments with MPI and OpenMP

04 May 2022 3

https://github.com/zpzim/SCAMP

The fastest way to compute matrix profiles on CPU and GPU!

02 Apr 2018 157

https://github.com/MAJ0RRR/parallel-processing-cpu-and-gpu-env-and-lib-with-powercap

(2024/2025) A library and environment for parallel processing in a power-limited CPU+GPU cluster ...

16 Aug 2024 2