46 results found Sort:

1.0k
19.5k
apache-2.0
242
Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks
Created 2018-03-23
2,854 commits to master branch, last one 3 days ago
322
4.3k
apache-2.0
48
Performance-portable, length-agnostic SIMD with runtime dispatch
Created 2019-09-06
2,705 commits to master branch, last one a day ago
97
4.1k
mit
728
Visual Studio extension for assembly syntax highlighting and code completion in assembly files and the disassembly window
Created 2016-02-19
949 commits to master branch, last one 9 months ago
1.0k
3.7k
apache-2.0
181
oneAPI Deep Neural Network Library (oneDNN)
Created 2016-05-09
18,070 commits to main branch, last one 22 hours ago
Implementations of SIMD instruction sets for systems which don't natively support them.
Created 2017-03-28
3,076 commits to master branch, last one a day ago
260
2.2k
bsd-3-clause
71
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
Created 2016-02-19
1,707 commits to master branch, last one 5 days ago
417
2.1k
mit
116
C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, NEON for ARM.
Created 2015-03-25
3,764 commits to master branch, last one a day ago
255
1.7k
gpl-2.0
63
Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Created 2016-06-29
1,140 commits to main branch, last one 29 days ago
151
1.5k
bsd-3-clause
66
SIMD Vector Classes for C++
Created 2014-02-25
5,010 commits to 1.4 branch, last one 6 months ago
129
1.3k
bsl-1.0
77
Portable header-only C++ low level SIMD library
Created 2013-05-08
1,232 commits to master branch, last one 3 months ago
66
1.2k
apache-2.0
20
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & S...
Created 2023-03-14
1,163 commits to main branch, last one 3 days ago
42
1.0k
other
23
World's fastest log analysis: λ + SQL + JSON + S3
Created 2022-03-25
1,264 commits to master branch, last one 11 months ago
121
984
apache-2.0
38
Accelerate SHA256 computations in pure Go using AVX512, SHA Extensions for x86 and ARM64 for ARM. On AVX512 it provides an up to 8x improvement (over 3 GB/s per core). SHA Extensions give a performan...
Created 2016-07-16
92 commits to master branch, last one about a year ago
123
968
bsd-2-clause
46
🚀 Fast prime number generator
Created 2013-09-24
4,742 commits to master branch, last one about a month ago
59
900
bsd-3-clause
22
C++ template library for high performance SIMD based sorting algorithms
Created 2022-10-19
582 commits to main branch, last one 23 days ago
187
853
bsd-3-clause
51
Library for specialized dense and sparse matrix operations, and deep learning primitives.
Created 2014-09-23
18,202 commits to main branch, last one a day ago
136
677
bsl-1.0
34
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Created 2016-01-03
484 commits to master branch, last one a day ago
38
588
other
22
std::experimental::simd for GCC [ISO/IEC TS 19570:2018]
Created 2019-05-07
2,233 commits to master branch, last one about a year ago
38
333
bsd-2-clause
23
🚀 Fast C/C++ bit population count library
Created 2016-11-28
370 commits to master branch, last one 5 months ago
49
330
bsd-2-clause
31
SIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html
Created 2015-04-04
141 commits to master branch, last one 8 months ago
Agenium Scale vectorization library for CPUs and GPUs
Created 2019-04-10
172 commits to master branch, last one 3 years ago
42
323
bsd-2-clause
28
Storage for my snippets, toy programs, etc.
Created 2013-12-03
1,311 commits to master branch, last one 11 hours ago
41
313
bsd-2-clause
23
🚀 Fast prime counting function implementations
Created 2013-06-09
5,348 commits to master branch, last one about a month ago
20
308
agpl-3.0
25
Open Source Architecture Code Analyzer
Created 2017-03-02
1,033 commits to master branch, last one 25 days ago
41
281
gpl-3.0
15
Turbo Base64 - Fastest Base64 SIMD:SSE/AVX2/AVX512/Neon/Altivec - Faster than memcpy!
Created 2016-12-17
726 commits to master branch, last one about a year ago
29
243
bsd-2-clause
24
SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
Created 2015-04-05
121 commits to master branch, last one 2 years ago
Examples of C# code compiled to GPU by hybridizer
Created 2017-03-09
300 commits to master branch, last one about a year ago
18
181
apache-2.0
12
Accelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Created 2020-04-20
123 commits to master branch, last one 2 years ago
⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Created 2015-09-30
1,005 commits to master branch, last one 19 days ago
14
157
bsd-2-clause
17
Base64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Created 2016-09-04
259 commits to master branch, last one 8 months ago