Liu-xiandong / How_to_optimize_in_GPU

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

Date Created 2021-10-17 (2 years ago)
Commits 58 (last one 11 months ago)
Stargazers 749 (15 this week)
Watchers 12 (0 this week)
Forks 118
License apache-2.0
Ranking

RepositoryStats indexes 534,551 repositories, of these Liu-xiandong/How_to_optimize_in_GPU is ranked #63,292 (88th percentile) for total stargazers, and #172,438 for total watchers. Github reports the primary language for this repository as Cuda, for repositories using this language it is ranked #29/279.

Liu-xiandong/How_to_optimize_in_GPU is also tagged with popular topics, for these it's ranked: hpc (#33/274),  high-performance-computing (#22/140)

Other Information

Liu-xiandong/How_to_optimize_in_GPU has Github issues enabled, there are 6 open issues and 9 closed issues.

Star History

Github stargazers over time

Watcher History

Github watchers over time, collection started in '23

Recent Commit History

42 commits on the default branch (master) since jan '22

Yearly Commits

Commits to the default branch (master) per year

Issue History

Languages

The primary language is Cuda but there's also others...

updated: 2024-06-28 @ 09:32pm, id: 418155000 / R_kgDOGOyJ-A