39 results found Sort:
- Filter by Primary Language:
- C++ (19)
- Python (4)
- Fortran (3)
- Shell (3)
- Julia (1)
- Jupyter Notebook (1)
- Nix (1)
- Go (1)
- C (1)
- Dockerfile (1)
- Assembly (1)
- +
A high-throughput and memory-efficient inference and serving engine for LLMs
Created
2023-02-09
1,444 commits to main branch, last one 12 hours ago
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Created
2016-10-12
12,433 commits to main branch, last one 19 hours ago
NumPy & SciPy for GPU
Created
2016-11-01
28,514 commits to main branch, last one a day ago
This repository has no description...
This repository has been archived
(exclude archived)
Created
2016-06-23
359 commits to master branch, last one 5 years ago
A deep learning package for many-body potential energy representation and molecular dynamics
Created
2017-12-12
2,501 commits to r2 branch, last one about a month ago
stdgpu: Efficient STL-like Data Structures on the GPU
Created
2019-08-16
537 commits to master branch, last one about a month ago
PygmalionAI's large-scale inference engine
Created
2023-06-23
632 commits to main branch, last one 15 days ago
Dockerfiles for the various software layers defined in the ROCm software platform
Created
2016-02-05
187 commits to master branch, last one 11 days ago
Abstraction Library for Parallel Kernel Acceleration :llama:
Created
2014-11-05
2,895 commits to develop branch, last one 4 days ago
Next generation BLAS implementation for ROCm platform
Created
2015-10-08
5,196 commits to develop branch, last one 14 hours ago
Agenium Scale vectorization library for CPUs and GPUs
Created
2019-04-10
172 commits to master branch, last one 2 years ago
AMD GPU (ROCm) programming in Julia
Created
2020-07-02
1,079 commits to master branch, last one 6 days ago
Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster
Created
2018-04-03
97 commits to master branch, last one 22 days ago
HPC solver for nonlinear optimization problems
Created
2017-12-05
933 commits to develop branch, last one 24 days ago
AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.
Created
2019-01-19
3,734 commits to aomp-dev branch, last one 19 hours ago
MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimize...
Created
2018-12-20
919 commits to develop branch, last one 16 hours ago
Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm
Created
2018-03-01
559 commits to master branch, last one 19 hours ago
GPUFORT: S2S translation tool for CUDA Fortran and Fortran+X in the spirit of hipify
Created
2021-02-15
313 commits to main branch, last one 2 years ago
Next generation FFT implementation for ROCm
Created
2016-03-03
1,886 commits to develop branch, last one 16 hours ago
ROCm Parallel Primitives
Created
2017-12-13
1,491 commits to develop branch, last one 10 days ago
AUTOMATIC1111/stable-diffusion-webui for CUDA and ROCm on NixOS
Created
2022-12-31
17 commits to master branch, last one 6 months ago
Stable Diffusion Docker image preconfigured for usage with AMD Radeon cards
Created
2022-08-29
15 commits to main branch, last one about a year ago
Domain specific library for electronic structure calculations
Created
2015-10-16
8,813 commits to develop branch, last one 22 hours ago
RAND library for HIP programming language
Created
2017-07-31
1,289 commits to develop branch, last one 9 days ago
ROCm BLAS marshalling library
Created
2017-04-10
1,103 commits to develop branch, last one a day ago
Install guide of ROCm and Tensorflow on Ubuntu for the RX580
Created
2020-11-05
44 commits to main branch, last one about a year ago
AMD OpenCL userspace drivers for Fedora. Currently not working for fedora 37
This repository has been archived
(exclude archived)
Created
2022-01-02
65 commits to master branch, last one about a year ago
Next generation LAPACK implementation for ROCm platform
Created
2018-05-22
667 commits to develop branch, last one 3 days ago
The PennyLane-Lightning plugin provides a fast state-vector simulator written in C++ for use with PennyLane
Created
2020-07-06
495 commits to master branch, last one a day ago