44 results found Sort:

1.0k
19.4k
apache-2.0
241
Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks
Created 2018-03-23
2,839 commits to master branch, last one 12 days ago
321
4.2k
apache-2.0
47
Performance-portable, length-agnostic SIMD with runtime dispatch
Created 2019-09-06
2,675 commits to master branch, last one 18 hours ago
98
4.1k
mit
729
Visual Studio extension for assembly syntax highlighting and code completion in assembly files and the disassembly window
Created 2016-02-19
949 commits to master branch, last one 8 months ago
1.0k
3.6k
apache-2.0
181
oneAPI Deep Neural Network Library (oneDNN)
Created 2016-05-09
17,873 commits to main branch, last one 18 hours ago
Implementations of SIMD instruction sets for systems which don't natively support them.
Created 2017-03-28
3,052 commits to master branch, last one about a month ago
258
2.2k
bsd-3-clause
71
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
Created 2016-02-19
1,701 commits to master branch, last one 7 days ago
414
2.1k
mit
115
C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, NEON for ARM.
Created 2015-03-25
3,743 commits to master branch, last one 2 days ago
255
1.7k
gpl-2.0
63
Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Created 2016-06-29
1,137 commits to main branch, last one 10 days ago
151
1.5k
bsd-3-clause
66
SIMD Vector Classes for C++
Created 2014-02-25
5,010 commits to 1.4 branch, last one 5 months ago
130
1.2k
bsl-1.0
77
Portable header-only C++ low level SIMD library
Created 2013-05-08
1,232 commits to master branch, last one 2 months ago
42
1.0k
other
23
World's fastest log analysis: λ + SQL + JSON + S3
Created 2022-03-25
1,264 commits to master branch, last one 10 months ago
121
985
apache-2.0
37
Accelerate SHA256 computations in pure Go using AVX512, SHA Extensions for x86 and ARM64 for ARM. On AVX512 it provides an up to 8x improvement (over 3 GB/s per core). SHA Extensions give a performan...
Created 2016-07-16
92 commits to master branch, last one about a year ago
59
985
apache-2.0
18
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & S...
Created 2023-03-14
1,127 commits to main branch, last one 20 hours ago
123
962
bsd-2-clause
46
🚀 Fast prime number generator
Created 2013-09-24
4,742 commits to master branch, last one 3 days ago
58
887
bsd-3-clause
22
C++ template library for high performance SIMD based sorting algorithms
Created 2022-10-19
579 commits to main branch, last one 8 days ago
183
850
bsd-3-clause
51
Library for specialized dense and sparse matrix operations, and deep learning primitives.
Created 2014-09-23
18,196 commits to main branch, last one 2 days ago
132
667
bsl-1.0
34
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Created 2016-01-03
479 commits to master branch, last one 9 days ago
37
579
other
22
std::experimental::simd for GCC [ISO/IEC TS 19570:2018]
Created 2019-05-07
2,233 commits to master branch, last one about a year ago
37
330
bsd-2-clause
23
🚀 Fast C/C++ bit population count library
Created 2016-11-28
370 commits to master branch, last one 4 months ago
Agenium Scale vectorization library for CPUs and GPUs
Created 2019-04-10
172 commits to master branch, last one 3 years ago
48
326
bsd-2-clause
32
SIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html
Created 2015-04-04
141 commits to master branch, last one 7 months ago
42
320
bsd-2-clause
28
Storage for my snippets, toy programs, etc.
Created 2013-12-03
1,275 commits to master branch, last one 7 months ago
41
311
bsd-2-clause
23
🚀 Fast prime counting function implementations
Created 2013-06-09
5,348 commits to master branch, last one 3 days ago
20
301
agpl-3.0
25
Open Source Architecture Code Analyzer
Created 2017-03-02
1,032 commits to master branch, last one about a month ago
41
278
gpl-3.0
15
Turbo Base64 - Fastest Base64 SIMD:SSE/AVX2/AVX512/Neon/Altivec - Faster than memcpy!
Created 2016-12-17
726 commits to master branch, last one about a year ago
29
240
bsd-2-clause
24
SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
Created 2015-04-05
121 commits to master branch, last one 2 years ago
Examples of C# code compiled to GPU by hybridizer
Created 2017-03-09
300 commits to master branch, last one about a year ago
18
176
apache-2.0
11
Accelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Created 2020-04-20
123 commits to master branch, last one 2 years ago
⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Created 2015-09-30
1,004 commits to master branch, last one 19 days ago
14
157
bsd-2-clause
17
Base64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)
Created 2016-09-04
259 commits to master branch, last one 7 months ago