CudaPerformance

Compare the performance of matrix multiplication among GPU shared memory, GPU global memory and CPU

MIT License

Stars

Committers

View Code on GitHub View on X

Ecosystems: Cuda

Commit Statistics

Past Year

All Time

Total Commits

Total Committers

Avg. Commits Per Committer

0.0

56.0

Bot Commits

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

N/A

Related Projects

cuda-design-patterns

Some CUDA design patterns and a bit of template magic for CUDA

16 Nov 2018 145

awesome-gpgpu

A curated list of awesome GPGPU (CUDA/OpenCL/Vulkan) resources

20 Jun 2018 63

parallel-processing-cpu-and-gpu-env-and-lib-with-powercap

(2024/2025) A library and environment for parallel processing in a power-limited CPU+GPU cluster ...

16 Aug 2024 2

distributed-training-guide

Best practices & guides on how to write distributed pytorch training code

31 Jul 2024 190

CUDA_Code

Codes for learning cuda. Implementation of multiple kernels.

11 Aug 2024 2

cuda-lab

Playing with CUDA and GPUs in Google Colab

16 Jan 2019 9

hpc

My experiments with MPI and OpenMP

04 May 2022 3

GenomeWorks

SDK for GPU accelerated genome assembly and analysis

31 May 2019 284

CuTropicalGEMM.jl

The fastest Tropical number matrix multiplication on GPU

29 Jun 2023 1

Apriori-and-Eclat-Frequent-Itemset-Mining

Implementation of the Apriori and Eclat algorithms, two of the best-known basic algorithms for mi...

21 Oct 2018 40

cuda-learning

cuda编程学习入门

02 Feb 2022 28

SCAMP

The fastest way to compute matrix profiles on CPU and GPU!

02 Apr 2018 157

PTXprofiler

A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofl...

11 Jan 2023 37

cuda_scheduling_examiner_mirror

A tool for examining GPU scheduling behavior.

29 Mar 2017 65

cuvarbase

Python library for fast time-series analysis on CUDA GPUs

22 Jun 2017 24