Trending repositories for topic parallel-computing

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

mfem/mfem

Lightweight, general, scalable C++ library for finite element methods

1,850 (+16)

bsd-3-clause

taskflow/taskflow

A General-purpose Task-parallel Programming System using Modern C++

10,724 (+7)

NVIDIA/cccl

CUDA Core Compute Libraries

1,574 (+6)

ConorWilliams/libfork

A bleeding-edge, lock-free, wait-free, continuation-stealing tasking library built on C++20's coroutines

682 (+4)

mpl-2.0

chapel-lang/chapel

a Productive Parallel Programming Language

1,853 (+4)

joblib/joblib

Computing with Python functions.

4,022 (+4)

bsd-3-clause

parallel101/course

高性能并行编程与优化 - 课件

3,942 (+4)

amilajack/reading

A list of computer-science readings I recommend

3,305 (+3)

mit

OpenNMT/CTranslate2

Fast inference engine for Transformer models

3,708 (+3)

mit

XiaoSong9905/CUDA-Optimization-Guide

Xiao's CUDA Optimization Guide [Active Adding New Contents]

274 (+2)

gpl-3.0

pyper-dev/pyper

Concurrent Python made simple

1,180 (+2)

mit

JuliaSymbolics/Symbolics.jl

Symbolic programming for the next generation of numerical software

1,400 (+2)

kokkos/kokkos

Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction

2,151 (+2)

BY571/IQN-and-Extensions

PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER, Noisy layer, N-step bootstrapping, Dueling architecture and p...

87 (+1)

mit

DLR-AMR/t8code

Parallel algorithms and data structures for tree-based adaptive mesh refinement (AMR) with arbitrary element shapes.

187 (+1)

gpl-2.0

FEniCS/dolfinx

Next generation FEniCS problem solving environment

863 (+1)

lgpl-3.0

futureverse/future

:rocket: R package: future: Unified Parallel and Distributed Processing in R for Everyone

975 (+1)

Last 3 days (relative gain)

BY571/IQN-and-Extensions

87 (+1%)

mit

mfem/mfem

Lightweight, general, scalable C++ library for finite element methods

1,850 (+0.9%)

bsd-3-clause

XiaoSong9905/CUDA-Optimization-Guide

Xiao's CUDA Optimization Guide [Active Adding New Contents]

274 (+0.7%)

gpl-3.0

ConorWilliams/libfork

A bleeding-edge, lock-free, wait-free, continuation-stealing tasking library built on C++20's coroutines

682 (+0.6%)

mpl-2.0

DLR-AMR/t8code

Parallel algorithms and data structures for tree-based adaptive mesh refinement (AMR) with arbitrary element shapes.

187 (+0.5%)

gpl-2.0

NVIDIA/cccl

CUDA Core Compute Libraries

1,574 (+0.4%)

chapel-lang/chapel

a Productive Parallel Programming Language

1,853 (+0.2%)

pyper-dev/pyper

Concurrent Python made simple

1,180 (+0.2%)

mit

JuliaSymbolics/Symbolics.jl

Symbolic programming for the next generation of numerical software

1,400 (+0.1%)

FEniCS/dolfinx

Next generation FEniCS problem solving environment

863 (+0.1%)

lgpl-3.0

futureverse/future

:rocket: R package: future: Unified Parallel and Distributed Processing in R for Everyone

975 (+0.1%)

parallel101/course

高性能并行编程与优化 - 课件

3,942 (+0.1%)

joblib/joblib

Computing with Python functions.

4,022 (+0.1%)

bsd-3-clause

kokkos/kokkos

Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction

2,151 (+0.1%)

amilajack/reading

A list of computer-science readings I recommend

3,305 (+0.1%)

mit

OpenNMT/CTranslate2

Fast inference engine for Transformer models

3,708 (+0.1%)

mit

taskflow/taskflow

A General-purpose Task-parallel Programming System using Modern C++

10,724 (+0.1%)

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

taskflow/taskflow

A General-purpose Task-parallel Programming System using Modern C++

10,724 (+24)

mfem/mfem

Lightweight, general, scalable C++ library for finite element methods

1,850 (+24)

bsd-3-clause

NVIDIA/cccl

CUDA Core Compute Libraries

1,574 (+21)

chapel-lang/chapel

a Productive Parallel Programming Language

1,853 (+13)

amilajack/reading

A list of computer-science readings I recommend

3,305 (+13)

mit

OpenNMT/CTranslate2

Fast inference engine for Transformer models

3,708 (+9)

mit

ConorWilliams/libfork

A bleeding-edge, lock-free, wait-free, continuation-stealing tasking library built on C++20's coroutines

682 (+6)

mpl-2.0

joblib/joblib

Computing with Python functions.

4,022 (+6)

bsd-3-clause

parallel101/course

高性能并行编程与优化 - 课件

3,942 (+6)

FEniCS/dolfinx

Next generation FEniCS problem solving environment

863 (+4)

lgpl-3.0

XiaoSong9905/CUDA-Optimization-Guide

Xiao's CUDA Optimization Guide [Active Adding New Contents]

274 (+3)

gpl-3.0

AppiumTestDistribution/AppiumTestDistribution

A tool for running android and iOS appium tests in parallel across devices... U like it STAR it !

1,021 (+3)

mit

pyper-dev/pyper

Concurrent Python made simple

1,180 (+3)

mit

ElmerCSC/elmerfem

Official git repository of Elmer FEM software

1,286 (+3)

Jianqoq/Hpt

A high performance N-dimensional array library for Rust

34 (+2)

apache-2.0

chengzeyi/ParaAttention

https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching

228 (+2)

bodo-ai/Bodo

High-Performance Python Compute Engine for Data and AI

239 (+2)

apache-2.0

pipefunc/pipefunc

Lightweight fast function pipeline (DAG) creation in pure Python for scientific workflows 🕸️🧪

346 (+2)

mit

SmileiPIC/Smilei

Particle-in-cell code for plasma simulation

368 (+2)

OpenTimer/OpenTimer

A High-performance Timing Analysis Tool for VLSI Systems

599 (+2)

Last week (relative gain)

Jianqoq/Hpt

A high performance N-dimensional array library for Rust

34 (+6%)

apache-2.0

RevolutionAnalytics/foreach

R package to provide foreach looping construct

53 (+2%)

apache-2.0

PhasicFlow/phasicFlow

Parallel, highly efficient code (CPU and GPU) for DEM and CFD-DEM simulations.

63 (+2%)

gpl-3.0

NVIDIA/cccl

CUDA Core Compute Libraries

1,574 (+1%)

mfem/mfem

Lightweight, general, scalable C++ library for finite element methods

1,850 (+1%)

bsd-3-clause

opensbli/opensbli

A framework for the automated derivation and parallel execution of finite difference solvers on a range of computer architectures.

85 (+1%)

gpl-3.0

BY571/IQN-and-Extensions

87 (+1%)

mit

XiaoSong9905/CUDA-Optimization-Guide

Xiao's CUDA Optimization Guide [Active Adding New Contents]

274 (+1%)

gpl-3.0

ConorWilliams/libfork

A bleeding-edge, lock-free, wait-free, continuation-stealing tasking library built on C++20's coroutines

682 (+0.9%)

mpl-2.0

chengzeyi/ParaAttention

https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching

228 (+0.9%)

bodo-ai/Bodo

High-Performance Python Compute Engine for Data and AI

239 (+0.8%)

apache-2.0

chapel-lang/chapel

a Productive Parallel Programming Language

1,853 (+0.7%)

LLNL/axom

CS infrastructure components for HPC applications

170 (+0.6%)

bsd-3-clause

pipefunc/pipefunc

Lightweight fast function pipeline (DAG) creation in pure Python for scientific workflows 🕸️🧪

346 (+0.6%)

mit

SmileiPIC/Smilei

Particle-in-cell code for plasma simulation

368 (+0.5%)

DLR-AMR/t8code

Parallel algorithms and data structures for tree-based adaptive mesh refinement (AMR) with arbitrary element shapes.

187 (+0.5%)

gpl-2.0

FEniCS/dolfinx

Next generation FEniCS problem solving environment

863 (+0.5%)

lgpl-3.0

owensgroup/RXMesh

GPU-accelerated triangle mesh processing

250 (+0.4%)

bsd-2-clause

amilajack/reading

A list of computer-science readings I recommend

3,305 (+0.4%)

mit

OpenTimer/OpenTimer

A High-performance Timing Analysis Tool for VLSI Systems

599 (+0.3%)

Last month (new repositories)

no newly created repositories trending in the last month

Last month (absolute gain)

taskflow/taskflow

A General-purpose Task-parallel Programming System using Modern C++

10,724 (+121)

OpenNMT/CTranslate2

Fast inference engine for Transformer models

3,708 (+86)

mit

NVIDIA/cccl

CUDA Core Compute Libraries

1,574 (+83)

mfem/mfem

Lightweight, general, scalable C++ library for finite element methods

1,850 (+40)

bsd-3-clause

amilajack/reading

A list of computer-science readings I recommend

3,305 (+39)

mit

chengzeyi/ParaAttention

https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching

228 (+33)

bodo-ai/Bodo

High-Performance Python Compute Engine for Data and AI

239 (+31)

apache-2.0

joblib/joblib

Computing with Python functions.

4,022 (+31)

bsd-3-clause

parallel101/course

高性能并行编程与优化 - 课件

3,942 (+31)

pyper-dev/pyper

Concurrent Python made simple

1,180 (+30)

mit

kokkos/kokkos

Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction

2,151 (+26)

Jianqoq/Hpt

A high performance N-dimensional array library for Rust

34 (+23)

apache-2.0

ConorWilliams/libfork

A bleeding-edge, lock-free, wait-free, continuation-stealing tasking library built on C++20's coroutines

682 (+19)

mpl-2.0

FEniCS/dolfinx

Next generation FEniCS problem solving environment

863 (+19)

lgpl-3.0

ElmerCSC/elmerfem

Official git repository of Elmer FEM software

1,286 (+19)

zwang4/awesome-machine-learning-in-compilers

Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation

1,517 (+18)

cc0-1.0

chapel-lang/chapel

a Productive Parallel Programming Language

1,853 (+17)

OSGeo/grass

GRASS - free and open-source geospatial processing engine

901 (+16)

jmcarpenter2/swifter

A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner

2,594 (+14)

mit

OpenTimer/OpenTimer

A High-performance Timing Analysis Tool for VLSI Systems

599 (+13)

Last month (relative gain)

Jianqoq/Hpt

A high performance N-dimensional array library for Rust

34 (+209%)

apache-2.0

PhasicFlow/PhasicFlowPlus

Fluid-particle coupling for multiphase flow based on PhasicFlow and OpenFOAM

39 (+18%)

gpl-3.0

chengzeyi/ParaAttention

https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching

228 (+17%)

bodo-ai/Bodo

High-Performance Python Compute Engine for Data and AI

239 (+15%)

apache-2.0

PhasicFlow/phasicFlow

Parallel, highly efficient code (CPU and GPU) for DEM and CFD-DEM simulations.

63 (+11%)

gpl-3.0

microsoft/nnscaler

nnScaler: Compiling DNN models for Parallel Training

103 (+6%)

mit

zhandawei/Bayesian_Optimization_Algorithms

standard, high-dimensional, parallel, constrained, and multiobjective Bayesian optimization algorithms

36 (+6%)

gdtk-uq/gdtk

The Gas Dynamics Toolkit (GDTk) is a set of software tools for simulating high speed fluid flow, maintained at The University of Queensland and the University of Southern Queensland, Australia.

73 (+6%)

NVIDIA/cccl

CUDA Core Compute Libraries

1,574 (+6%)

axonn-ai/axonn

A parallel framework for training deep neural networks

57 (+6%)

apache-2.0

opensbli/opensbli

A framework for the automated derivation and parallel execution of finite difference solvers on a range of computer architectures.

85 (+5%)

gpl-3.0

houkensjtu/taichi-fluid

A collection of CFD related resources for Taichi developers.

131 (+5%)

mit

lanl/Fierro

Fierro is a C++ code designed to aid the research and development of numerical methods, testing of user-specified models, and creating multi-scale models related to quasi-static solid mechanics and co...

45 (+5%)

bsd-3-clause

Autodesk/Neon

Multi-GPU Framework for Voxel Grid Computations

48 (+4%)

c1570/Connomore64

Realtime cycle exact emulation of the C64 using multiple microcontrollers in parallel.

26 (+4%)

DLR-AMR/t8code

Parallel algorithms and data structures for tree-based adaptive mesh refinement (AMR) with arbitrary element shapes.

187 (+4%)

gpl-2.0

pipefunc/pipefunc

Lightweight fast function pipeline (DAG) creation in pure Python for scientific workflows 🕸️🧪

346 (+4%)

mit

in03/proxima

Transcode source media directly from DaVinci Resolve using multiple machines for encoding. Great for creating proxies quickly.

59 (+4%)

mit

tiagoantao/python-performance

Repository for the book Fast Python - published by Manning

91 (+3%)

shikokuchuo/mirai

mirai - Minimalist Async Evaluation Framework for R

220 (+3%)

gpl-3.0

Last 12-months (new repositories)

jofpin/turbit

Build applications, scripts, and automations powered by high-performance multicore computing using Node.js

2,862

mit

pyper-dev/pyper

Concurrent Python made simple

1,180

mit

bodo-ai/Bodo

High-Performance Python Compute Engine for Data and AI

239

apache-2.0

chengzeyi/ParaAttention

https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching

228

microsoft/nnscaler

nnScaler: Compiling DNN models for Parallel Training

103

mit

Jianqoq/Hpt

A high performance N-dimensional array library for Rust

apache-2.0

Pastifier/miniRT

miniRT is the final C project of the 42 Common Core: our very first ray-tracer. Our miniRT focused on optimising CPU-rendered graphics, to achieve a real-time renderer with movement controls and extra...

mit

c1570/Connomore64

Realtime cycle exact emulation of the C64 using multiple microcontrollers in parallel.

gh0stintheshe11/CUDA-Accelerated-AES-Encryption

University of Toronto / ECE1782 - Programming Massively Parallel Multiprocessors and Heterogeneous Systems / Project: an optimized CUDA Implementation of AES 128-bit Encryption, support any file types...

Last 12-months (absolute gain)

jofpin/turbit

Build applications, scripts, and automations powered by high-performance multicore computing using Node.js

2,862 (+2,861)

mit

taskflow/taskflow

A General-purpose Task-parallel Programming System using Modern C++

10,724 (+1,270)

pyper-dev/pyper

Concurrent Python made simple

1,180 (+1,179)

mit

OpenNMT/CTranslate2

Fast inference engine for Transformer models

3,708 (+1,018)

mit

NVIDIA/cccl

CUDA Core Compute Libraries

1,574 (+866)

parallel101/course

高性能并行编程与优化 - 课件

3,942 (+761)

amilajack/reading

A list of computer-science readings I recommend

3,305 (+471)

mit

kokkos/kokkos

Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction

2,151 (+447)

joblib/joblib

Computing with Python functions.

4,022 (+396)

bsd-3-clause

mfem/mfem

Lightweight, general, scalable C++ library for finite element methods

1,850 (+339)

bsd-3-clause

pipefunc/pipefunc

Lightweight fast function pipeline (DAG) creation in pure Python for scientific workflows 🕸️🧪

346 (+338)

mit

mtmucha/coros

An easy-to-use and fast library for task-based parallelism, utilizing coroutines.

323 (+264)

bsl-1.0

ConorWilliams/libfork

A bleeding-edge, lock-free, wait-free, continuation-stealing tasking library built on C++20's coroutines

682 (+240)

mpl-2.0

bodo-ai/Bodo

High-Performance Python Compute Engine for Data and AI

239 (+237)

apache-2.0

chengzeyi/ParaAttention

https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching

228 (+226)

FEniCS/dolfinx

Next generation FEniCS problem solving environment

863 (+223)

lgpl-3.0

zwang4/awesome-machine-learning-in-compilers

Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation

1,517 (+211)

cc0-1.0

ElmerCSC/elmerfem

Official git repository of Elmer FEM software

1,286 (+207)

geatpy-dev/geatpy

Evolutionary algorithm toolbox and framework with high performance for Python

2,067 (+178)

lgpl-3.0

mindspore-courses/step_into_llm

MindSpore online courses: Step into LLM

455 (+162)

apache-2.0

Last 12-months (relative gain)

pipefunc/pipefunc

Lightweight fast function pipeline (DAG) creation in pure Python for scientific workflows 🕸️🧪

346 (+4,225%)

mit

microsoft/nnscaler

nnScaler: Compiling DNN models for Parallel Training

103 (+2,475%)

mit

mtmucha/coros

An easy-to-use and fast library for task-based parallelism, utilizing coroutines.

323 (+447%)

bsl-1.0

Pastifier/miniRT

30 (+150%)

mit

PhasicFlow/PhasicFlowPlus

Fluid-particle coupling for multiphase flow based on PhasicFlow and OpenFOAM

39 (+129%)

gpl-3.0

luminousmen/grokking_concurrency

"Grokking Concurrency" book code examples

97 (+126%)

apache-2.0

PhasicFlow/phasicFlow

Parallel, highly efficient code (CPU and GPU) for DEM and CFD-DEM simulations.

63 (+125%)

gpl-3.0

lanl/Fierro

45 (+125%)

bsd-3-clause

NVIDIA/cccl

CUDA Core Compute Libraries

1,574 (+122%)

axonn-ai/axonn

A parallel framework for training deep neural networks

57 (+111%)

apache-2.0

vincentjzy/OpenCorr

Digital Image Correlation & Digital Volume Correlation Library

226 (+100%)

mpl-2.0

JuliaLang/Distributed.jl

Create and control multiple Julia processes remotely for distributed computing. Ships as a Julia stdlib.

36 (+100%)

mit

Foundations-of-HPC/High-Performance-Computing-2023

Slides, exercises and resources for the 2023-2024 course "High Performance Computing" under the "Scientific and Data-Intensive Computing" Naster Program at University of Trieste

34 (+100%)

alugowski/poolSTL

Light and self-contained implementation of C++17 parallel algorithms.

34 (+89%)

bsd-2-clause

DLR-AMR/t8code

Parallel algorithms and data structures for tree-based adaptive mesh refinement (AMR) with arbitrary element shapes.

187 (+87%)

gpl-2.0

tiagoantao/python-performance

Repository for the book Fast Python - published by Manning

91 (+86%)

JackKelly/light-speed-io

Read & decompress many chunks of files at high speed

63 (+85%)

mit

XiaoSong9905/CUDA-Optimization-Guide

Xiao's CUDA Optimization Guide [Active Adding New Contents]

274 (+81%)

gpl-3.0

shikokuchuo/mirai

mirai - Minimalist Async Evaluation Framework for R

220 (+75%)

gpl-3.0

deib-polimi/renoir

Reactive Network of Operators In Rust. Framework for Parallel and distributed computation inspired from the DataFlow model

73 (+70%)

lgpl-3.0