Jeng Bai-Cheng

major in heterogeneous computing such as CUDA, OpenCL, etc.

Ecosystems: C++, Python, Cuda

Projects

Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++ - Released: 15 Aug 2016 - 21,957

KerasToTensorRT

This is a simple demonstration for running Keras model model on Tensorflow with TensorRT integration(TFTRT) or on TensorRT directly without invoking "freeze_graph.py".

Python - Released: 05 Jun 2018 - 68

openacc_fortran_examples

Simple OpenACC Fortran Examples

Fortran - Released: 20 Feb 2020 - 51

cuGemmProf

A simple tool to profile performance of multiple combinations of GEMM of cuBLAS

C++ - Released: 26 Dec 2019 - 20

cuda_examples

Simple CUDA Examples

C++ - Released: 26 May 2023 - 3

Tensorflow_Inception_v3_TensorRT

This is a simple demonstration for running Tensorflow inception v3 model on TensorRT

C++ - Released: 10 Jan 2018 - 12

trt-se-resnext

a sample, running se-resnext on TensorRT

C++ - Released: 13 Nov 2018 - 6