Trending repositories for topic hpc

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

JuliaLang/julia

The Julia Programming Language

46,024 (+23)

mit

hpcaitech/ColossalAI

Making large AI models cheaper, faster and more accessible

38,933 (+16)

apache-2.0

luispedro/jug

Parallel programming with Python

443 (+11)

mit

zml/zml

Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild

1,733 (+9)

apache-2.0

ProjectPhysX/FluidX3D

The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.

4,104 (+8)

trevor-vincent/awesome-high-performance-computing

A curated list of awesome high performance computing resources

708 (+6)

volcano-sh/volcano

A Cloud Native Batch System (Project under CNCF)

4,310 (+5)

apache-2.0

openucx/ucx

Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)

1,185 (+4)

open-mpi/ompi

Open MPI main development repository

2,207 (+4)

Liu-xiandong/How_to_optimize_in_GPU

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sg...

861 (+3)

apache-2.0

NVIDIA/cccl

CUDA Core Compute Libraries

1,342 (+3)

envmodules/modules

Environment Modules: provides dynamic modification of a user's environment

727 (+3)

gpl-2.0

apptainer/apptainer

Apptainer: Application containers for Linux

1,165 (+3)

cp2k/cp2k

Quantum chemistry and solid state physics software package

868 (+2)

gpl-2.0

seamplex/feenox

Cloud-first free no-fee no-X uniX-like finite-element(ish) computational engineering tool

72 (+1)

nebius/soperator

Run Slurm in Kubernetes

137 (+1)

apache-2.0

openucx/ucc

Unified Collective Communication Library

219 (+1)

bsd-3-clause

XiaoSong9905/CUDA-Optimization-Guide

Xiao's CUDA Optimization Guide [Active Adding New Contents]

246 (+1)

gpl-3.0

jfalcou/eve

Expressive Vector Engine - SIMD in C++ Goes Brrrr

982 (+1)

bsl-1.0

NVIDIA/MatX

An efficient C++17 GPU numerical computing library with Python-like syntax

1,231 (+1)

bsd-3-clause

Last 3 days (relative gain)

luispedro/jug

Parallel programming with Python

443 (+3%)

mit

seamplex/feenox

Cloud-first free no-fee no-X uniX-like finite-element(ish) computational engineering tool

72 (+1%)

trevor-vincent/awesome-high-performance-computing

A curated list of awesome high performance computing resources

708 (+0.9%)

nebius/soperator

Run Slurm in Kubernetes

137 (+0.7%)

apache-2.0

zml/zml

Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild

1,733 (+0.5%)

apache-2.0

openucx/ucc

Unified Collective Communication Library

219 (+0.5%)

bsd-3-clause

envmodules/modules

Environment Modules: provides dynamic modification of a user's environment

727 (+0.4%)

gpl-2.0

XiaoSong9905/CUDA-Optimization-Guide

Xiao's CUDA Optimization Guide [Active Adding New Contents]

246 (+0.4%)

gpl-3.0

Liu-xiandong/How_to_optimize_in_GPU

861 (+0.3%)

apache-2.0

openucx/ucx

Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)

1,185 (+0.3%)

apptainer/apptainer

Apptainer: Application containers for Linux

1,165 (+0.3%)

cp2k/cp2k

Quantum chemistry and solid state physics software package

868 (+0.2%)

gpl-2.0

NVIDIA/cccl

CUDA Core Compute Libraries

1,342 (+0.2%)

ProjectPhysX/FluidX3D

The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.

4,104 (+0.2%)

open-mpi/ompi

Open MPI main development repository

2,207 (+0.2%)

nextflow-io/nextflow

A DSL for data-driven computational pipelines

2,798 (+0.2%)

apache-2.0

volcano-sh/volcano

A Cloud Native Batch System (Project under CNCF)

4,310 (+0.1%)

apache-2.0

mfem/mfem

Lightweight, general, scalable C++ library for finite element methods

1,770 (+0.1%)

bsd-3-clause

jfalcou/eve

Expressive Vector Engine - SIMD in C++ Goes Brrrr

982 (+0.1%)

bsl-1.0

NVIDIA/MatX

An efficient C++17 GPU numerical computing library with Python-like syntax

1,231 (+0.1%)

bsd-3-clause

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

JuliaLang/julia

The Julia Programming Language

46,024 (+38)

mit

luispedro/jug

Parallel programming with Python

443 (+30)

mit

ProjectPhysX/FluidX3D

The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.

4,104 (+30)

hpcaitech/ColossalAI

Making large AI models cheaper, faster and more accessible

38,933 (+27)

apache-2.0

zml/zml

Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild

1,733 (+23)

apache-2.0

volcano-sh/volcano

A Cloud Native Batch System (Project under CNCF)

4,310 (+11)

apache-2.0

trevor-vincent/awesome-high-performance-computing

A curated list of awesome high performance computing resources

708 (+11)

nebius/soperator

Run Slurm in Kubernetes

137 (+9)

apache-2.0

open-mpi/ompi

Open MPI main development repository

2,207 (+7)

envmodules/modules

Environment Modules: provides dynamic modification of a user's environment

727 (+7)

gpl-2.0

AdaptiveCpp/AdaptiveCpp

Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications ada...

1,425 (+6)

bsd-2-clause

LLNL/sundials

Official development repository for SUNDIALS - a SUite of Nonlinear and DIfferential/ALgebraic equation Solvers. Pull requests are welcome for bug fixes and minor changes.

534 (+6)

bsd-3-clause

nndeploy/nndeploy

nndeploy是一款模型端到端部署框架。以多端推理以及基于有向无环图模型部署为基础，致力为用户提供跨平台、简单易用、高性能的模型部署体验。

665 (+6)

apache-2.0

Liu-xiandong/How_to_optimize_in_GPU

861 (+6)

apache-2.0

NVIDIA/cccl

CUDA Core Compute Libraries

1,342 (+6)

mfem/mfem

Lightweight, general, scalable C++ library for finite element methods

1,770 (+6)

bsd-3-clause

openucx/ucx

Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)

1,185 (+5)

romeric/Fastor

A lightweight high performance tensor algebra framework for modern C++

764 (+5)

mit

apptainer/apptainer

Apptainer: Application containers for Linux

1,165 (+5)

XiaoSong9905/CUDA-Optimization-Guide

Xiao's CUDA Optimization Guide [Active Adding New Contents]

246 (+4)

gpl-3.0

Last week (relative gain)

luispedro/jug

Parallel programming with Python

443 (+7%)

mit

nebius/soperator

Run Slurm in Kubernetes

137 (+7%)

apache-2.0

ProjectPhysX/PTXprofiler

A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.

46 (+2%)

pipefunc/pipefunc

Lightweight fast function pipeline (DAG) creation in pure Python for scientific workflows 🕸️🧪

233 (+2%)

mit

XiaoSong9905/CUDA-Optimization-Guide

Xiao's CUDA Optimization Guide [Active Adding New Contents]

246 (+2%)

gpl-3.0

trevor-vincent/awesome-high-performance-computing

A curated list of awesome high performance computing resources

708 (+2%)

seamplex/feenox

Cloud-first free no-fee no-X uniX-like finite-element(ish) computational engineering tool

72 (+1%)

openucx/ucc

Unified Collective Communication Library

219 (+1%)

bsd-3-clause

zml/zml

Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild

1,733 (+1%)

apache-2.0

LLNL/sundials

Official development repository for SUNDIALS - a SUite of Nonlinear and DIfferential/ALgebraic equation Solvers. Pull requests are welcome for bug fixes and minor changes.

534 (+1%)

bsd-3-clause

rackslab/Slurm-web

Open source web interface for Slurm HPC clusters

356 (+1%)

gpl-3.0

pegasus-isi/pegasus

Pegasus Workflow Management System - Automate, recover, and debug scientific computations.

181 (+1%)

apache-2.0

ThinkParQ/beegfs

Public repository for the BeeGFS Parallel File System

92 (+1%)

envmodules/modules

Environment Modules: provides dynamic modification of a user's environment

727 (+1.0%)

gpl-2.0

nndeploy/nndeploy

nndeploy是一款模型端到端部署框架。以多端推理以及基于有向无环图模型部署为基础，致力为用户提供跨平台、简单易用、高性能的模型部署体验。

665 (+0.9%)

apache-2.0

CHIP-SPV/chipStar

chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.

231 (+0.9%)

SeisSol/SeisSol

A scientific software for the numerical simulation of seismic wave phenomena and earthquake dynamics

273 (+0.7%)

bsd-3-clause

ProjectPhysX/FluidX3D

The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.

4,104 (+0.7%)

Liu-xiandong/How_to_optimize_in_GPU

861 (+0.7%)

apache-2.0

opendilab/DI-hpc

OpenDILab RL HPC OP Lib, including CUDA and Triton kernel

226 (+0.4%)

apache-2.0

Last month (new repositories)

no newly created repositories trending in the last month

Last month (absolute gain)

JuliaLang/julia

The Julia Programming Language

46,024 (+200)

mit

ProjectPhysX/FluidX3D

The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.

4,104 (+166)

hpcaitech/ColossalAI

Making large AI models cheaper, faster and more accessible

38,933 (+110)

apache-2.0

zml/zml

Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild

1,733 (+72)

apache-2.0

volcano-sh/volcano

A Cloud Native Batch System (Project under CNCF)

4,310 (+72)

apache-2.0

spack/spack

A flexible package manager that supports multiple versions, configurations, platforms, and compilers.

4,472 (+65)

NVIDIA/cccl

CUDA Core Compute Libraries

1,342 (+58)

mfem/mfem

Lightweight, general, scalable C++ library for finite element methods

1,770 (+49)

bsd-3-clause

trevor-vincent/awesome-high-performance-computing

A curated list of awesome high performance computing resources

708 (+45)

nebius/soperator

Run Slurm in Kubernetes

137 (+35)

apache-2.0

Liu-xiandong/How_to_optimize_in_GPU

861 (+33)

apache-2.0

AdaptiveCpp/AdaptiveCpp

1,425 (+32)

bsd-2-clause

luispedro/jug

Parallel programming with Python

443 (+32)

mit

open-mpi/ompi

Open MPI main development repository

2,207 (+32)

nndeploy/nndeploy

nndeploy是一款模型端到端部署框架。以多端推理以及基于有向无环图模型部署为基础，致力为用户提供跨平台、简单易用、高性能的模型部署体验。

665 (+31)

apache-2.0

apptainer/apptainer

Apptainer: Application containers for Linux

1,165 (+30)

openucx/ucx

Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)

1,185 (+27)

nextflow-io/nextflow

A DSL for data-driven computational pipelines

2,798 (+27)

apache-2.0

arrayfire/arrayfire

ArrayFire: a general purpose GPU library.

4,591 (+23)

bsd-3-clause

sslotin/amh-code

Complete implementations from "Algorithms for Modern Hardware"

701 (+21)

Last month (relative gain)

nebius/soperator

Run Slurm in Kubernetes

137 (+34%)

apache-2.0

mahendrapaipuri/ceems

A Prometheus exporter and a REST API server to export metrics of compute units of resource managers like SLURM, Openstack, k8s, _etc_

25 (+25%)

gpl-3.0

CLAIRE-Labo/python-ml-research-template

A template for starting reproducible Python machine-learning projects with hardware acceleration. Find an example at https://github.com/CLAIRE-Labo/no-representation-no-trust

80 (+18%)

mit

Qiskit/qiskit-addon-sqd

Sample-based Quantum Diagonalization: Classically postprocess noisy quantum samples to yield more accurate eigenvalue estimations.

37 (+12%)

apache-2.0

icl-utk-edu/slate

SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) systems. It is developed as part of the U.S. Department of Energy...

102 (+9%)

bsd-3-clause

luispedro/jug

Parallel programming with Python

443 (+8%)

mit

pipefunc/pipefunc

Lightweight fast function pipeline (DAG) creation in pure Python for scientific workflows 🕸️🧪

233 (+7%)

mit

LLNL/benchpark

An open collaborative repository for reproducible specifications of HPC benchmarks and cross site benchmarking environments

31 (+7%)

apache-2.0

trevor-vincent/awesome-high-performance-computing

A curated list of awesome high performance computing resources

708 (+7%)

coderonion/awesome-cuda-triton-hpc

🔥🔥🔥 A collection of some awesome public CUDA, cuBLAS, TensorRT, TensorRT-LLM, Triton and High Performance Computing (HPC) projects.

168 (+6%)

chenggroup/ai2-kit

A toolkit featured artificial intelligence × ab initio for computational chemistry research.

52 (+6%)

mit

FluidNumerics/SELF

Spectral Element Library in Fortran

72 (+6%)

bsd-3-clause

FZJ-JSC/tutorial-multi-gpu

Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial

199 (+6%)

mit

DLR-AMR/t8code

Parallel algorithms and data structures for tree-based adaptive mesh refinement (AMR) with arbitrary element shapes.

163 (+6%)

gpl-2.0

openucx/ucc

Unified Collective Communication Library

219 (+5%)

bsd-3-clause

nndeploy/nndeploy

nndeploy是一款模型端到端部署框架。以多端推理以及基于有向无环图模型部署为基础，致力为用户提供跨平台、简单易用、高性能的模型部署体验。

665 (+5%)

apache-2.0

It4innovations/hyperqueue

Scheduler for sub-node tasks for HPC systems with batch scheduling

292 (+5%)

mit

ThinkParQ/beegfs

Public repository for the BeeGFS Parallel File System

92 (+5%)

ProjectPhysX/PTXprofiler

A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.

46 (+5%)

NVIDIA/cccl

CUDA Core Compute Libraries

1,342 (+5%)

Last 12-months (new repositories)

zml/zml

Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild

1,733

apache-2.0

nebius/soperator

Run Slurm in Kubernetes

137

apache-2.0

ThinkParQ/beegfs

Public repository for the BeeGFS Parallel File System

Qiskit/qiskit-addon-sqd

Sample-based Quantum Diagonalization: Classically postprocess noisy quantum samples to yield more accurate eigenvalue estimations.

apache-2.0

PeriHub/PeriLab.jl

Welcome to Peridynamic Laboratory (PeriLab), a powerful software solution designed for tackling Peridynamic problems.

bsd-3-clause

Last 12-months (absolute gain)

hpcaitech/ColossalAI

Making large AI models cheaper, faster and more accessible

38,933 (+3,234)

apache-2.0

JuliaLang/julia

The Julia Programming Language

46,024 (+2,324)

mit

ProjectPhysX/FluidX3D

The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.

4,104 (+1,248)

zml/zml

Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild

1,733 (+1,171)

apache-2.0

NVIDIA/cccl

CUDA Core Compute Libraries

1,342 (+907)

volcano-sh/volcano

A Cloud Native Batch System (Project under CNCF)

4,310 (+801)

apache-2.0

spack/spack

A flexible package manager that supports multiple versions, configurations, platforms, and compilers.

4,472 (+711)

flame/blis

BLAS-like Library Instantiation Software Framework

2,341 (+613)

AdaptiveCpp/AdaptiveCpp

1,425 (+526)

bsd-2-clause

trevor-vincent/awesome-high-performance-computing

A curated list of awesome high performance computing resources

708 (+411)

apptainer/apptainer

Apptainer: Application containers for Linux

1,165 (+386)

nndeploy/nndeploy

nndeploy是一款模型端到端部署框架。以多端推理以及基于有向无环图模型部署为基础，致力为用户提供跨平台、简单易用、高性能的模型部署体验。

665 (+383)

apache-2.0

nextflow-io/nextflow

A DSL for data-driven computational pipelines

2,798 (+383)

apache-2.0

Liu-xiandong/How_to_optimize_in_GPU

861 (+364)

apache-2.0

mfem/mfem

Lightweight, general, scalable C++ library for finite element methods

1,770 (+326)

bsd-3-clause

open-mpi/ompi

Open MPI main development repository

2,207 (+311)

arrayfire/arrayfire

ArrayFire: a general purpose GPU library.

4,591 (+289)

bsd-3-clause

indigo-dc/udocker

A basic user tool to execute simple docker containers in batch or interactive systems without root privileges.

1,389 (+251)

apache-2.0

jfalcou/eve

Expressive Vector Engine - SIMD in C++ Goes Brrrr

982 (+241)

bsl-1.0

pipefunc/pipefunc

Lightweight fast function pipeline (DAG) creation in pure Python for scientific workflows 🕸️🧪

233 (+227)

mit

Last 12-months (relative gain)

pipefunc/pipefunc

Lightweight fast function pipeline (DAG) creation in pure Python for scientific workflows 🕸️🧪

233 (+3,783%)

mit

ThinkParQ/beegfs

Public repository for the BeeGFS Parallel File System

92 (+2,200%)

XFluids/XFluids

a unified cross-architecture heterogeneous CFD solver

26 (+420%)

gpl-3.0

CLAIRE-Labo/python-ml-research-template

A template for starting reproducible Python machine-learning projects with hardware acceleration. Find an example at https://github.com/CLAIRE-Labo/no-representation-no-trust

80 (+371%)

mit

coderonion/awesome-cuda-triton-hpc

🔥🔥🔥 A collection of some awesome public CUDA, cuBLAS, TensorRT, TensorRT-LLM, Triton and High Performance Computing (HPC) projects.

168 (+229%)

cmkobel/CompareM2

🦠📇 Microbial genomes-to-report pipeline

54 (+218%)

gpl-3.0

NVIDIA/cccl

CUDA Core Compute Libraries

1,342 (+209%)

zml/zml

Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild

1,733 (+208%)

apache-2.0

openhackathons-org/End-to-End-AI-for-Science

This repository containts materials for End-to-End AI for Science

114 (+200%)

apache-2.0

instadeepai/flashbax

⚡ Flashbax: Accelerated Replay Buffers in JAX

217 (+186%)

apache-2.0

openhackathons-org/nways_accelerated_programming

N-Ways to GPU Programming Bootcamp

64 (+146%)

apache-2.0

zhenrong-wang/hpc-now

A Cross-Platform, Multi-Cloud High-Performance Computing Platform

250 (+145%)

mit

Foundations-of-HPC/High-Performance-Computing-2023

Slides, exercises and resources for the 2023-2024 course "High Performance Computing" under the "Scientific and Data-Intensive Computing" Naster Program at University of Trieste

31 (+138%)

trevor-vincent/awesome-high-performance-computing

A curated list of awesome high performance computing resources

708 (+138%)

ExtremeFLOW/neko

/ᐠ. ｡.ᐟ\ᵐᵉᵒʷˎˊ˗

178 (+137%)

nndeploy/nndeploy

nndeploy是一款模型端到端部署框架。以多端推理以及基于有向无环图模型部署为基础，致力为用户提供跨平台、简单易用、高性能的模型部署体验。

665 (+136%)

apache-2.0

icl-utk-edu/slate

102 (+132%)

bsd-3-clause

ProjectPhysX/OpenCL-Benchmark

A small OpenCL benchmark program to measure peak GPU/CPU performance.

173 (+122%)

CHIP-SPV/chipStar

chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.

231 (+104%)

A-New-BellHope/bellhopcuda

CUDA and C++ port of BELLHOP / BELLHOP3D underwater acoustics simulator

67 (+103%)

gpl-3.0