41 results found Sort:
- Filter by Primary Language:
- C++ (20)
- Python (4)
- Fortran (3)
- Shell (3)
- Julia (2)
- Jupyter Notebook (1)
- Nix (1)
- Go (1)
- C (1)
- Cuda (1)
- Dockerfile (1)
- Assembly (1)
- +
A high-throughput and memory-efficient inference and serving engine for LLMs
Created
2023-02-09
3,290 commits to main branch, last one 7 hours ago
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Created
2016-10-12
12,722 commits to main branch, last one 2 days ago
NumPy & SciPy for GPU
Created
2016-11-01
29,264 commits to main branch, last one a day ago
This repository has no description...
This repository has been archived
(exclude archived)
Created
2016-06-23
359 commits to master branch, last one 6 years ago
A deep learning package for many-body potential energy representation and molecular dynamics
Created
2017-12-12
2,534 commits to r2 branch, last one about a month ago
stdgpu: Efficient STL-like Data Structures on the GPU
Created
2019-08-16
550 commits to master branch, last one 15 days ago
Large-scale LLM inference engine
Created
2023-06-23
801 commits to main branch, last one 2 days ago
Dockerfiles for the various software layers defined in the ROCm software platform
Created
2016-02-05
197 commits to master branch, last one 2 months ago
Abstraction Library for Parallel Kernel Acceleration :llama:
Created
2014-11-05
3,040 commits to develop branch, last one 27 days ago
Next generation BLAS implementation for ROCm platform
Created
2015-10-08
5,405 commits to develop branch, last one a day ago
Agenium Scale vectorization library for CPUs and GPUs
Created
2019-04-10
172 commits to master branch, last one 3 years ago
AMD GPU (ROCm) programming in Julia
Created
2020-07-02
1,122 commits to master branch, last one 20 days ago
Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster
Created
2018-04-03
116 commits to master branch, last one 16 days ago
HPC solver for nonlinear optimization problems
Created
2017-12-05
940 commits to develop branch, last one about a month ago
AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.
Created
2019-01-19
4,032 commits to aomp-dev branch, last one a day ago
Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm
Created
2018-03-01
560 commits to master branch, last one 13 days ago
MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimize...
Created
2018-12-20
975 commits to develop branch, last one 8 days ago
Zero-knowledge template library
Created
2022-06-14
350 commits to main branch, last one a day ago
Next generation FFT implementation for ROCm
Created
2016-03-03
1,993 commits to develop branch, last one a day ago
GPUFORT: S2S translation tool for CUDA Fortran and Fortran+X in the spirit of hipify
Created
2021-02-15
313 commits to main branch, last one 3 years ago
ROCm Parallel Primitives
Created
2017-12-13
1,527 commits to develop branch, last one a day ago
AUTOMATIC1111/stable-diffusion-webui for CUDA and ROCm on NixOS
Created
2022-12-31
17 commits to master branch, last one 11 months ago
Domain specific library for electronic structure calculations
Created
2015-10-16
8,834 commits to develop branch, last one 21 hours ago
Stable Diffusion Docker image preconfigured for usage with AMD Radeon cards
Created
2022-08-29
15 commits to main branch, last one about a year ago
ROCm BLAS marshalling library
Created
2017-04-10
1,170 commits to develop branch, last one 2 days ago
Install guide of ROCm and Tensorflow on Ubuntu for the RX580
Created
2020-11-05
45 commits to main branch, last one about a month ago
RAND library for HIP programming language
Created
2017-07-31
1,346 commits to develop branch, last one 15 days ago
AMD OpenCL userspace drivers for Fedora. Currently not working for fedora 37
This repository has been archived
(exclude archived)
Created
2022-01-02
65 commits to master branch, last one about a year ago
Next generation LAPACK implementation for ROCm platform
Created
2018-05-22
735 commits to develop branch, last one a day ago
The PennyLane-Lightning plugin provides a fast state-vector simulator written in C++ for use with PennyLane
Created
2020-07-06
634 commits to master branch, last one 2 days ago