30 results found Sort:

518
3.5k
mit
142
HIP: C++ Heterogeneous-Compute Interface for Portability
Created 2016-01-07
7,097 commits to develop branch, last one 3 days ago
88
1.5k
mit
35
Vulkan/CUDA/HIP/OpenCL/Level Zero/Metal Fast Fourier Transform library
Created 2020-08-02
290 commits to master branch, last one 4 months ago
80
1.1k
apache-2.0
29
stdgpu: Efficient STL-like Data Structures on the GPU
Created 2019-08-16
541 commits to master branch, last one 8 days ago
129
439
mit
19
:rocket: Cenit IO - 100% open source integration Platform (iPaaS)
Created 2014-06-01
8,700 commits to master branch, last one about a year ago
86
383
bsd-3-clause
24
Numerical linear algebra software package
Created 2018-01-11
7,743 commits to develop branch, last one 19 hours ago
81
382
mit
29
Portable and vendor neutral framework for parallel programming on heterogeneous platforms.
Created 2015-07-02
3,287 commits to main branch, last one 3 months ago
61
344
gpl-2.0
17
A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)
Created 2015-06-25
223 commits to master branch, last one 4 months ago
69
336
mpl-2.0
21
Abstraction Library for Parallel Kernel Acceleration :llama:
Created 2014-11-05
2,918 commits to develop branch, last one 2 days ago
152
327
other
59
Next generation BLAS implementation for ROCm platform
Created 2015-10-08
5,244 commits to develop branch, last one 19 hours ago
75
202
apache-2.0
18
WeCross跨链路由
Created 2019-08-22
540 commits to master branch, last one 3 months ago
136
198
mit
55
Stretching GPU performance for GEMMs and tensor contractions.
Created 2015-11-05
5,439 commits to develop branch, last one 16 hours ago
70
180
bsd-3-clause
5
This repository has no description...
Created 2020-08-06
4,156 commits to master branch, last one 2 days ago
14
158
mit
25
GPUFORT: S2S translation tool for CUDA Fortran and Fortran+X in the spirit of hipify
Created 2021-02-15
313 commits to main branch, last one 2 years ago
26
155
other
10
chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.
Created 2021-09-15
2,507 commits to main branch, last one 4 days ago
81
154
other
52
Next generation FFT implementation for ROCm
Created 2016-03-03
1,907 commits to develop branch, last one 21 hours ago
67
150
mit
48
ROCm Parallel Primitives
Created 2017-12-13
1,497 commits to develop branch, last one 17 hours ago
17
113
unknown
11
Haskell Image Processing Library
Created 2013-10-25
264 commits to master branch, last one about a year ago
DFT-FE: Real-space DFT calculations using Finite Elements
Created 2018-07-04
5,532 commits to publicGithubDevelop branch, last one 2 days ago
19
107
mit
19
An implementation of HIP that works on CPUs, across OSes.
Created 2020-08-28
177 commits to master branch, last one 3 months ago
63
105
mit
49
RAND library for HIP programming language
Created 2017-07-31
1,302 commits to develop branch, last one 10 days ago
70
104
other
37
ROCm BLAS marshalling library
Created 2017-04-10
1,120 commits to develop branch, last one 21 hours ago
59
103
bsd-3-clause
23
The Arbor multi-compartment neural network simulation library.
Created 2016-10-03
1,699 commits to master branch, last one 5 days ago
8
74
other
4
An Upstream Clang/LLVM-based toolchain for contemporary C++ and heterogeneous programming
Created 2022-02-09
408 commits to main branch, last one 28 days ago
33
66
other
22
Fortran interfaces for ROCm libraries
Created 2020-05-13
300 commits to develop branch, last one 11 days ago
10
58
bsl-1.0
6
pika builds on C++ std::execution with fiber, CUDA, HIP, and MPI support.
Created 2022-01-17
3,019 commits to main branch, last one 2 days ago
A Benchmark Suite for Heterogeneous System Computation
Created 2015-07-20
836 commits to Develop branch, last one 2 years ago
35
52
mit
16
AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/OpenCL/CPU back-ends.
Created 2019-08-28
1,292 commits to develop branch, last one a day ago
14 basic topics for VEGA64 performance optmization
Created 2019-07-29
39 commits to master branch, last one 3 years ago
AMD ROCm Installation Guide on RX 6600 XT + TensorFlow and PyTorch
Created 2023-06-07
5 commits to main branch, last one about a year ago
56
40
mit
14
hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library
Created 2022-09-16
1,122 commits to develop branch, last one 23 hours ago