1 result found Sort:

A nearly complete collection of prefix sum algorithms implemented in CUDA, D3D12, Unity and WGPU. Theoretically portable to all wave/warp/subgroup sizes.
Created 2023-03-13
122 commits to main branch, last one 7 days ago