29 results found Sort:

369
2.3k
other
78
BLAS-like Library Instantiation Software Framework
Created 2014-01-22
2,369 commits to master branch, last one about a month ago
376
1.9k
other
53
High-efficiency floating-point neural network inference operators for mobile, server, and Web
Created 2019-09-13
7,187 commits to master branch, last one a day ago
This repository has no description...
Created 2016-08-09
138 commits to master branch, last one 2 years ago
317
1.7k
bsd-2-clause
101
Acceleration package for neural networks on multi-core CPUs
Created 2016-03-21
374 commits to master branch, last one about a year ago
202
1.1k
apache-2.0
58
Tuned OpenCL BLAS
Created 2015-05-30
1,483 commits to master branch, last one 13 days ago
183
850
bsd-3-clause
51
Library for specialized dense and sparse matrix operations, and deep learning primitives.
Created 2014-09-23
18,196 commits to main branch, last one 2 days ago
103
483
unknown
16
BLISlab: A Sandbox for Optimizing GEMM
Created 2016-04-20
176 commits to master branch, last one 5 years ago
High-Performance FP32 Matrix Multiplication on CPU
Created 2024-07-01
80 commits to main branch, last one 4 days ago
15
278
apache-2.0
14
The HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats a...
Created 2018-10-13
401 commits to master branch, last one 10 months ago
28
232
mit
12
A library and extension that provides objects for scientific computing in PHP.
Created 2018-10-03
315 commits to master branch, last one 4 months ago
151
223
mit
56
Stretching GPU performance for GEMMs and tensor contractions.
Created 2015-11-05
5,537 commits to develop branch, last one a day ago
💥 Fast matrix-multiplication as a self-contained Python library – no system dependencies!
Created 2017-10-15
593 commits to master branch, last one 2 months ago
27
196
bsd-3-clause
21
Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm
Created 2018-03-01
560 commits to master branch, last one 27 days ago
Sparse matrix formats for linear algebra supporting scientific and machine learning applications
Created 2017-05-16
269 commits to master branch, last one 3 years ago
47
135
gpl-2.0
20
DBCSR: Distributed Block Compressed Sparse Row matrix library
Created 2018-06-05
3,463 commits to develop branch, last one 2 days ago
10
103
other
13
[Experimental] LLVM-accelerated Generic Linear Algebra Subprograms
Created 2016-10-14
190 commits to master branch, last one 2 years ago
29
101
unknown
13
Meta.Numerics is library for advanced numerical computing on the .NET platform. It offers an object-oriented API for statistical analysis, advanced functions, Fourier transforms, numerical integration...
Created 2017-04-04
163 commits to master branch, last one 4 years ago
Python wrapper for Intel Math Kernel Library (MKL) matrix multiplication
Created 2019-12-08
221 commits to release branch, last one 13 days ago
89
63
mit
17
hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library
Created 2022-09-16
1,519 commits to develop branch, last one a day ago
N-dimensional matrix class for Rust
Created 2015-06-22
149 commits to master branch, last one 8 years ago
🧮 alphatensor matrix breakthrough algorithms + simd + rust.
Created 2022-10-10
3 commits to main branch, last one 2 years ago
26
56
gpl-2.0
9
M4RI is a library for fast arithmetic with dense matrices over GF(2)
Created 2014-10-16
809 commits to master branch, last one 27 days ago
Parallel Matrix Multiplication Using OpenMP, Phtreads, and MPI
Created 2014-02-20
2 commits to master branch, last one 2 years ago
6
54
cc0-1.0
6
Cayley hashing as in "Navigating in the Cayley Graph of SL₂(𝔽ₚ)"
Created 2020-11-24
58 commits to master branch, last one 2 years ago
The only library allowing to create Tensors (matrices extension) with custom types
Created 2020-07-12
146 commits to master branch, last one 2 years ago
Decentralized Computing Backend for Artificial Intelligence, Web3, Metaverse, and Gaming Application
Created 2020-06-29
292 commits to master branch, last one about a year ago
The simplest but fast implementation of matrix multiplication in CUDA.
Created 2024-04-05
33 commits to master branch, last one 3 months ago