43 results found Sort:

977
18.6k
apache-2.0
239
Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks
Created 2018-03-23
2,779 commits to master branch, last one a day ago
94
4.1k
mit
730
Visual Studio extension for assembly syntax highlighting and code completion in assembly files and the disassembly window
Created 2016-02-19
949 commits to master branch, last one 2 months ago
295
3.7k
apache-2.0
44
Performance-portable, length-agnostic SIMD with runtime dispatch
Created 2019-09-06
2,533 commits to master branch, last one 16 hours ago
957
3.5k
apache-2.0
184
oneAPI Deep Neural Network Library (oneDNN)
Created 2016-05-09
16,418 commits to main branch, last one 12 hours ago
Implementations of SIMD instruction sets for systems which don't natively support them.
Created 2017-03-28
3,017 commits to master branch, last one 8 days ago
247
2.1k
bsd-3-clause
71
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
Created 2016-02-19
1,679 commits to master branch, last one 3 days ago
406
2.0k
mit
115
C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM.
Created 2015-03-25
3,608 commits to master branch, last one 16 hours ago
247
1.6k
gpl-2.0
65
Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Created 2016-06-29
1,123 commits to main branch, last one 24 days ago
153
1.4k
bsd-3-clause
67
SIMD Vector Classes for C++
Created 2014-02-25
5,008 commits to 1.4 branch, last one 3 months ago
132
1.2k
bsl-1.0
76
Portable header-only C++ low level SIMD library
Created 2013-05-08
1,133 commits to master branch, last one about a year ago
40
975
other
22
World's fastest log analysis: λ + SQL + JSON + S3
Created 2022-03-25
1,264 commits to master branch, last one 4 months ago
117
946
apache-2.0
37
Accelerate SHA256 computations in pure Go using AVX512, SHA Extensions for x86 and ARM64 for ARM. On AVX512 it provides an up to 8x improvement (over 3 GB/s per core). SHA Extensions give a performan...
Created 2016-07-16
92 commits to master branch, last one about a year ago
121
916
bsd-2-clause
47
🚀 Fast prime number generator
Created 2013-09-24
4,705 commits to master branch, last one about a month ago
181
813
bsd-3-clause
49
Library for specialized dense and sparse matrix operations, and deep learning primitives.
Created 2014-09-23
18,172 commits to main branch, last one a day ago
48
805
bsd-3-clause
22
C++ template library for high performance SIMD based sorting algorithms
Created 2022-10-19
544 commits to main branch, last one a day ago
39
757
apache-2.0
15
Up to 200x Faster Inner Products and Vector Similarity — for Python, JavaScript, Rust, C, and Swift, supporting f64, f32, f16 real & complex, i8, and binary vectors using SIMD for both x86 AVX2 & AVX-...
Created 2023-03-14
612 commits to main branch, last one a day ago
PygmalionAI's large-scale inference engine
Created 2023-06-23
632 commits to main branch, last one 15 days ago
125
598
bsl-1.0
34
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Created 2016-01-03
418 commits to master branch, last one about a month ago
37
556
other
22
std::experimental::simd for GCC [ISO/IEC TS 19570:2018]
Created 2019-05-07
2,233 commits to master branch, last one about a year ago
Agenium Scale vectorization library for CPUs and GPUs
Created 2019-04-10
172 commits to master branch, last one 2 years ago
48
313
bsd-2-clause
30
SIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html
Created 2015-04-04
141 commits to master branch, last one 2 months ago
37
311
bsd-2-clause
28
Storage for my snippets, toy programs, etc.
Created 2013-12-03
1,275 commits to master branch, last one about a month ago
41
305
bsd-2-clause
23
🚀 Fast prime counting function implementations
Created 2013-06-09
5,259 commits to master branch, last one about a month ago
36
299
bsd-2-clause
23
🚀 Fast C/C++ bit population count library
Created 2016-11-28
318 commits to master branch, last one 2 months ago
15
275
agpl-3.0
22
Open Source Architecture Code Analyzer
Created 2017-03-02
1,003 commits to master branch, last one 28 days ago
37
254
gpl-3.0
14
Turbo Base64 - Fastest Base64 SIMD:SSE/AVX2/AVX512/Neon/Altivec - Faster than memcpy!
Created 2016-12-17
726 commits to master branch, last one 9 months ago
Examples of C# code compiled to GPU by hybridizer
Created 2017-03-09
300 commits to master branch, last one 7 months ago
27
230
bsd-2-clause
23
SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
Created 2015-04-05
121 commits to master branch, last one 2 years ago
18
168
apache-2.0
10
Accelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Created 2020-04-20
123 commits to master branch, last one about a year ago
⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Created 2015-09-30
995 commits to master branch, last one 2 months ago