44 results found Sort:
- Filter by Primary Language:
- C++ (21)
- C (11)
- C# (3)
- Go (3)
- Python (1)
- Assembly (1)
- Rust (1)
- HTML (1)
- JavaScript (1)
- Jupyter Notebook (1)
- +
Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks
Created
2018-03-23
2,839 commits to master branch, last one 12 days ago
Performance-portable, length-agnostic SIMD with runtime dispatch
Created
2019-09-06
2,675 commits to master branch, last one 18 hours ago
Visual Studio extension for assembly syntax highlighting and code completion in assembly files and the disassembly window
Created
2016-02-19
949 commits to master branch, last one 8 months ago
oneAPI Deep Neural Network Library (oneDNN)
Created
2016-05-09
17,873 commits to main branch, last one 18 hours ago
Implementations of SIMD instruction sets for systems which don't natively support them.
Created
2017-03-28
3,052 commits to master branch, last one about a month ago
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
Created
2016-02-19
1,701 commits to master branch, last one 7 days ago
C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, NEON for ARM.
Created
2015-03-25
3,743 commits to master branch, last one 2 days ago
Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Created
2016-06-29
1,137 commits to main branch, last one 10 days ago
SIMD Vector Classes for C++
Created
2014-02-25
5,010 commits to 1.4 branch, last one 5 months ago
Portable header-only C++ low level SIMD library
Created
2013-05-08
1,232 commits to master branch, last one 2 months ago
World's fastest log analysis: λ + SQL + JSON + S3
Created
2022-03-25
1,264 commits to master branch, last one 10 months ago
Accelerate SHA256 computations in pure Go using AVX512, SHA Extensions for x86 and ARM64 for ARM. On AVX512 it provides an up to 8x improvement (over 3 GB/s per core). SHA Extensions give a performan...
Created
2016-07-16
92 commits to master branch, last one about a year ago
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & S...
Created
2023-03-14
1,127 commits to main branch, last one 20 hours ago
🚀 Fast prime number generator
Created
2013-09-24
4,742 commits to master branch, last one 3 days ago
C++ template library for high performance SIMD based sorting algorithms
Created
2022-10-19
579 commits to main branch, last one 8 days ago
Library for specialized dense and sparse matrix operations, and deep learning primitives.
Created
2014-09-23
18,196 commits to main branch, last one 2 days ago
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Created
2016-01-03
479 commits to master branch, last one 9 days ago
std::experimental::simd for GCC [ISO/IEC TS 19570:2018]
Created
2019-05-07
2,233 commits to master branch, last one about a year ago
🚀 Fast C/C++ bit population count library
Created
2016-11-28
370 commits to master branch, last one 4 months ago
Agenium Scale vectorization library for CPUs and GPUs
Created
2019-04-10
172 commits to master branch, last one 3 years ago
SIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html
Created
2015-04-04
141 commits to master branch, last one 7 months ago
Storage for my snippets, toy programs, etc.
Created
2013-12-03
1,275 commits to master branch, last one 7 months ago
🚀 Fast prime counting function implementations
Created
2013-06-09
5,348 commits to master branch, last one 3 days ago
Open Source Architecture Code Analyzer
Created
2017-03-02
1,032 commits to master branch, last one about a month ago
Turbo Base64 - Fastest Base64 SIMD:SSE/AVX2/AVX512/Neon/Altivec - Faster than memcpy!
Created
2016-12-17
726 commits to master branch, last one about a year ago
SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
Created
2015-04-05
121 commits to master branch, last one 2 years ago
Examples of C# code compiled to GPU by hybridizer
Created
2017-03-09
300 commits to master branch, last one about a year ago
Accelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Created
2020-04-20
123 commits to master branch, last one 2 years ago
⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Created
2015-09-30
1,004 commits to master branch, last one 19 days ago