39 results found Sort:

2.8k
20.3k
apache-2.0
194
A high-throughput and memory-efficient inference and serving engine for LLMs
Created 2023-02-09
1,444 commits to main branch, last one 12 hours ago
3.4k
11.3k
apache-2.0
381
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Created 2016-10-12
12,433 commits to main branch, last one 19 hours ago
776
7.9k
mit
127
NumPy & SciPy for GPU
Created 2016-11-01
28,514 commits to main branch, last one a day ago
280
1.7k
apache-2.0
140
This repository has no description...
This repository has been archived (exclude archived)
Created 2016-06-23
359 commits to master branch, last one 5 years ago
484
1.4k
lgpl-3.0
47
A deep learning package for many-body potential energy representation and molecular dynamics
Created 2017-12-12
2,501 commits to r2 branch, last one about a month ago
78
1.1k
apache-2.0
29
stdgpu: Efficient STL-like Data Structures on the GPU
Created 2019-08-16
537 commits to master branch, last one about a month ago
PygmalionAI's large-scale inference engine
Created 2023-06-23
632 commits to main branch, last one 15 days ago
68
395
mit
60
Dockerfiles for the various software layers defined in the ROCm software platform
Created 2016-02-05
187 commits to master branch, last one 11 days ago
67
328
mpl-2.0
21
Abstraction Library for Parallel Kernel Acceleration :llama:
Created 2014-11-05
2,895 commits to develop branch, last one 4 days ago
152
326
other
59
Next generation BLAS implementation for ROCm platform
Created 2015-10-08
5,196 commits to develop branch, last one 14 hours ago
Agenium Scale vectorization library for CPUs and GPUs
Created 2019-04-10
172 commits to master branch, last one 2 years ago
38
268
other
17
AMD GPU (ROCm) programming in Julia
Created 2020-07-02
1,079 commits to master branch, last one 6 days ago
44
236
apache-2.0
19
Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster
Created 2018-04-03
97 commits to master branch, last one 22 days ago
42
190
apache-2.0
32
AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.
Created 2019-01-19
3,734 commits to aomp-dev branch, last one 19 hours ago
72
184
mit
24
MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimize...
Created 2018-12-20
919 commits to develop branch, last one 16 hours ago
27
179
bsd-3-clause
21
Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm
Created 2018-03-01
559 commits to master branch, last one 19 hours ago
14
157
mit
25
GPUFORT: S2S translation tool for CUDA Fortran and Fortran+X in the spirit of hipify
Created 2021-02-15
313 commits to main branch, last one 2 years ago
81
152
other
52
Next generation FFT implementation for ROCm
Created 2016-03-03
1,886 commits to develop branch, last one 16 hours ago
64
147
mit
48
ROCm Parallel Primitives
Created 2017-12-13
1,491 commits to develop branch, last one 10 days ago
AUTOMATIC1111/stable-diffusion-webui for CUDA and ROCm on NixOS
Created 2022-12-31
17 commits to master branch, last one 6 months ago
Stable Diffusion Docker image preconfigured for usage with AMD Radeon cards
Created 2022-08-29
15 commits to main branch, last one about a year ago
39
114
bsd-3-clause
17
Domain specific library for electronic structure calculations
Created 2015-10-16
8,813 commits to develop branch, last one 22 hours ago
64
103
mit
49
RAND library for HIP programming language
Created 2017-07-31
1,289 commits to develop branch, last one 9 days ago
69
101
other
37
ROCm BLAS marshalling library
Created 2017-04-10
1,103 commits to develop branch, last one a day ago
Install guide of ROCm and Tensorflow on Ubuntu for the RX580
Created 2020-11-05
44 commits to main branch, last one about a year ago
AMD OpenCL userspace drivers for Fedora. Currently not working for fedora 37
This repository has been archived (exclude archived)
Created 2022-01-02
65 commits to master branch, last one about a year ago
43
87
other
28
Next generation LAPACK implementation for ROCm platform
Created 2018-05-22
667 commits to develop branch, last one 3 days ago
The PennyLane-Lightning plugin provides a fast state-vector simulator written in C++ for use with PennyLane
Created 2020-07-06
495 commits to master branch, last one a day ago
32
65
other
23
Fortran interfaces for ROCm libraries
Created 2020-05-13
292 commits to develop branch, last one 3 days ago