17 results found Sort:

330
4.4k
apache-2.0
48
Performance-portable, length-agnostic SIMD with runtime dispatch
Created 2019-09-06
2,779 commits to master branch, last one 3 days ago
261
2.3k
bsd-3-clause
73
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
Created 2016-02-19
1,715 commits to master branch, last one 7 days ago
TensorFlow binaries supporting AVX, FMA, SSE
This repository has been archived (exclude archived)
Created 2017-05-18
78 commits to master branch, last one 5 years ago
151
1.5k
bsd-3-clause
67
SIMD Vector Classes for C++
Created 2014-02-25
5,010 commits to 1.4 branch, last one 8 months ago
68
1.2k
apache-2.0
20
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & S...
Created 2023-03-14
1,189 commits to main branch, last one 10 days ago
206
566
lgpl-3.0
53
The Vector Optimized Library of Kernels
Created 2015-02-02
2,078 commits to main branch, last one 11 days ago
53
497
bsd-3-clause
30
A simple C library for compressing lists of integers using binary packing
Created 2014-02-05
228 commits to master branch, last one about a year ago
A C++ library to compress and intersect sorted lists of integers using SIMD instructions
Created 2013-12-16
268 commits to master branch, last one about a year ago
Agenium Scale vectorization library for CPUs and GPUs
Created 2019-04-10
172 commits to master branch, last one 3 years ago
32
263
apache-2.0
17
High performance algorithms in C#: SIMD/SSE, multi-core and faster
Created 2018-02-19
865 commits to master branch, last one 11 months ago
43
236
apache-2.0
9
TensorFlow binaries supporting AVX, FMA, SSE
Created 2020-02-04
22 commits to main branch, last one 3 months ago
19
120
apache-2.0
8
Fast decoder for VByte-compressed integers
Created 2014-11-19
48 commits to master branch, last one 8 months ago
Faster.Map provides high-performance hashmaps, each with unique features to suit different needs. It’s built to be faster and more efficient than standard Dictionary and ConcurrentDictionary.
Created 2021-11-03
265 commits to main branch, last one 15 days ago
A fast implementation of single-pattern substring search using SIMD acceleration.
Created 2020-07-06
124 commits to master branch, last one 4 months ago
DSP library for signal processing
Created 2020-03-31
820 commits to master branch, last one about a month ago
A few classes for extremely fast json parsing/serializing in modern C++. Possibly the fastest json parser in C++. Possibly the fastest json serializer in C++.
Created 2022-10-19
33 commits to main branch, last one 6 days ago