Liu-xiandong / How_to_optimize_in_GPU

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

Date Created 2021-10-17 (3 years ago)
Commits 58 (last one about a year ago)
Stargazers 822 (1 this week)
Watchers 13 (0 this week)
Forks 131
License apache-2.0
Ranking

RepositoryStats indexes 579,555 repositories, of these Liu-xiandong/How_to_optimize_in_GPU is ranked #60,877 (89th percentile) for total stargazers, and #165,293 for total watchers. Github reports the primary language for this repository as Cuda, for repositories using this language it is ranked #31/335.

Liu-xiandong/How_to_optimize_in_GPU is also tagged with popular topics, for these it's ranked: hpc (#33/296),  high-performance-computing (#24/154)

Other Information

Liu-xiandong/How_to_optimize_in_GPU has Github issues enabled, there are 6 open issues and 9 closed issues.

Star History

Github stargazers over time

Watcher History

Github watchers over time, collection started in '23

Recent Commit History

42 commits on the default branch (master) since jan '22

Yearly Commits

Commits to the default branch (master) per year

Issue History

Languages

The primary language is Cuda but there's also others...

updated: 2024-11-05 @ 05:15pm, id: 418155000 / R_kgDOGOyJ-A