Implementation of the LULESH mini-app in Accelerate
BSD-3-CLAUSE License
Some CUDA design patterns and a bit of template magic for CUDA
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
A lightweight library to inject LLVM bitcode into JVMs
High Performance C++ Turbulent flow Lattice Boltzmann code
USER-MESO package for LAMMPS
A highly optimised C++ library for mathematical applications and neural networks.
纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行
A fast & densely stored hashmap and hashset based on robin-hood backward shift deletion
CUDA C++ Core Libraries