Liu-xiandong / How_to_optimize_in_GPU

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

Date Created 2021-10-17 (3 years ago)
Commits 58 (last one about a year ago)
Stargazers 937 (4 this week)
Watchers 13 (0 this week)
Forks 149
License apache-2.0
Ranking

RepositoryStats indexes 624,936 repositories, of these Liu-xiandong/How_to_optimize_in_GPU is ranked #56,076 (91st percentile) for total stargazers, and #161,073 for total watchers. Github reports the primary language for this repository as Cuda, for repositories using this language it is ranked #32/386.

Liu-xiandong/How_to_optimize_in_GPU is also tagged with popular topics, for these it's ranked: hpc (#32/313),  high-performance-computing (#24/164)

Other Information

Liu-xiandong/How_to_optimize_in_GPU has Github issues enabled, there are 7 open issues and 9 closed issues.

Star History

Github stargazers over time

1k1k9009008008007007006006005005004004003003002002001001000020222022Jul '22Jul '2220232023Jul '23Jul '2320242024Jul '24Jul '2420252025

Watcher History

Github watchers over time, collection started in '23

131312121111101099887766554420232023Jul '23Jul '2320242024Jul '24Jul '2420252025

Recent Commit History

42 commits on the default branch (master) since jan '22

454540403535303025252020151510105500Jul '22Jul '2220232023Jul '23Jul '2320242024Jul '24Jul '2420252025

Yearly Commits

Commits to the default branch (master) per year

454540403535303025252020151510105500202120212022202220242024

Issue History

Total Issues
Open Issues
Closed Issues
16161414121210108866442200Jul '22Jul '2220232023Jul '23Jul '2320242024Jul '24Jul '2420252025

Languages

The primary language is Cuda but there's also others...

CudaCudaShellShellMakefileMakefile

updated: 2025-03-10 @ 05:38am, id: 418155000 / R_kgDOGOyJ-A