41 results found Sort:
- Filter by Primary Language:
- C++ (20)
- Python (4)
- Fortran (3)
- Shell (3)
- Julia (2)
- Jupyter Notebook (1)
- Nix (1)
- Go (1)
- C (1)
- Cuda (1)
- Dockerfile (1)
- Assembly (1)
- +
A high-throughput and memory-efficient inference and serving engine for LLMs
Created
2023-02-09
3,511 commits to main branch, last one 13 hours ago
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Created
2016-10-12
12,740 commits to main branch, last one 2 days ago
NumPy & SciPy for GPU
Created
2016-11-01
29,305 commits to main branch, last one a day ago
This repository has no description...
This repository has been archived
(exclude archived)
Created
2016-06-23
359 commits to master branch, last one 6 years ago
A deep learning package for many-body potential energy representation and molecular dynamics
Created
2017-12-12
2,543 commits to r2 branch, last one 2 days ago
stdgpu: Efficient STL-like Data Structures on the GPU
Created
2019-08-16
565 commits to master branch, last one a day ago
Large-scale LLM inference engine
Created
2023-06-23
825 commits to main branch, last one 21 hours ago
Dockerfiles for the various software layers defined in the ROCm software platform
Created
2016-02-05
197 commits to master branch, last one 3 months ago
Abstraction Library for Parallel Kernel Acceleration :llama:
Created
2014-11-05
3,046 commits to develop branch, last one a day ago
Next generation BLAS implementation for ROCm platform
Created
2015-10-08
5,422 commits to develop branch, last one a day ago
Agenium Scale vectorization library for CPUs and GPUs
Created
2019-04-10
172 commits to master branch, last one 3 years ago
AMD GPU (ROCm) programming in Julia
Created
2020-07-02
1,129 commits to master branch, last one a day ago
Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster
Created
2018-04-03
117 commits to master branch, last one 2 days ago
HPC solver for nonlinear optimization problems
Created
2017-12-05
942 commits to develop branch, last one 9 days ago
AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.
Created
2019-01-19
4,061 commits to aomp-dev branch, last one a day ago
Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm
Created
2018-03-01
560 commits to master branch, last one 27 days ago
MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimize...
Created
2018-12-20
983 commits to develop branch, last one a day ago
Zero-knowledge template library
Created
2022-06-14
350 commits to main branch, last one 15 days ago
Next generation FFT implementation for ROCm
Created
2016-03-03
2,001 commits to develop branch, last one 2 days ago
GPUFORT: S2S translation tool for CUDA Fortran and Fortran+X in the spirit of hipify
Created
2021-02-15
313 commits to main branch, last one 3 years ago
ROCm Parallel Primitives
Created
2017-12-13
1,539 commits to develop branch, last one 8 hours ago
AUTOMATIC1111/stable-diffusion-webui for CUDA and ROCm on NixOS
Created
2022-12-31
17 commits to master branch, last one about a year ago
Domain specific library for electronic structure calculations
Created
2015-10-16
8,835 commits to develop branch, last one 13 days ago
Stable Diffusion Docker image preconfigured for usage with AMD Radeon cards
Created
2022-08-29
15 commits to main branch, last one about a year ago
ROCm BLAS marshalling library
Created
2017-04-10
1,175 commits to develop branch, last one a day ago
Install guide of ROCm and Tensorflow on Ubuntu for the RX580
Created
2020-11-05
45 commits to main branch, last one about a month ago
RAND library for HIP programming language
Created
2017-07-31
1,352 commits to develop branch, last one 11 hours ago
Next generation LAPACK implementation for ROCm platform
Created
2018-05-22
742 commits to develop branch, last one 20 hours ago
The PennyLane-Lightning plugin provides a fast state-vector simulator written in C++ for use with PennyLane
Created
2020-07-06
649 commits to master branch, last one 19 hours ago
AMD OpenCL userspace drivers for Fedora. Currently not working for fedora 37
This repository has been archived
(exclude archived)
Created
2022-01-02
65 commits to master branch, last one about a year ago