46 results found Sort:
- Filter by Primary Language:
- C++ (23)
- C (11)
- C# (3)
- Go (3)
- Python (1)
- Assembly (1)
- Rust (1)
- HTML (1)
- JavaScript (1)
- Jupyter Notebook (1)
- +
Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks
Created
2018-03-23
2,854 commits to master branch, last one 3 days ago
Performance-portable, length-agnostic SIMD with runtime dispatch
Created
2019-09-06
2,705 commits to master branch, last one a day ago
Visual Studio extension for assembly syntax highlighting and code completion in assembly files and the disassembly window
Created
2016-02-19
949 commits to master branch, last one 9 months ago
oneAPI Deep Neural Network Library (oneDNN)
Created
2016-05-09
18,070 commits to main branch, last one 22 hours ago
Implementations of SIMD instruction sets for systems which don't natively support them.
Created
2017-03-28
3,076 commits to master branch, last one a day ago
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
Created
2016-02-19
1,707 commits to master branch, last one 5 days ago
C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, NEON for ARM.
Created
2015-03-25
3,764 commits to master branch, last one a day ago
Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Created
2016-06-29
1,140 commits to main branch, last one 29 days ago
SIMD Vector Classes for C++
Created
2014-02-25
5,010 commits to 1.4 branch, last one 6 months ago
Portable header-only C++ low level SIMD library
Created
2013-05-08
1,232 commits to master branch, last one 3 months ago
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & S...
Created
2023-03-14
1,163 commits to main branch, last one 3 days ago
World's fastest log analysis: λ + SQL + JSON + S3
Created
2022-03-25
1,264 commits to master branch, last one 11 months ago
Accelerate SHA256 computations in pure Go using AVX512, SHA Extensions for x86 and ARM64 for ARM. On AVX512 it provides an up to 8x improvement (over 3 GB/s per core). SHA Extensions give a performan...
Created
2016-07-16
92 commits to master branch, last one about a year ago
🚀 Fast prime number generator
Created
2013-09-24
4,742 commits to master branch, last one about a month ago
C++ template library for high performance SIMD based sorting algorithms
Created
2022-10-19
582 commits to main branch, last one 23 days ago
Library for specialized dense and sparse matrix operations, and deep learning primitives.
Created
2014-09-23
18,202 commits to main branch, last one a day ago
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Created
2016-01-03
484 commits to master branch, last one a day ago
std::experimental::simd for GCC [ISO/IEC TS 19570:2018]
Created
2019-05-07
2,233 commits to master branch, last one about a year ago
🚀 Fast C/C++ bit population count library
Created
2016-11-28
370 commits to master branch, last one 5 months ago
SIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html
Created
2015-04-04
141 commits to master branch, last one 8 months ago
Agenium Scale vectorization library for CPUs and GPUs
Created
2019-04-10
172 commits to master branch, last one 3 years ago
Storage for my snippets, toy programs, etc.
Created
2013-12-03
1,311 commits to master branch, last one 11 hours ago
🚀 Fast prime counting function implementations
Created
2013-06-09
5,348 commits to master branch, last one about a month ago
Open Source Architecture Code Analyzer
Created
2017-03-02
1,033 commits to master branch, last one 25 days ago
Turbo Base64 - Fastest Base64 SIMD:SSE/AVX2/AVX512/Neon/Altivec - Faster than memcpy!
Created
2016-12-17
726 commits to master branch, last one about a year ago
SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
Created
2015-04-05
121 commits to master branch, last one 2 years ago
Examples of C# code compiled to GPU by hybridizer
Created
2017-03-09
300 commits to master branch, last one about a year ago
Accelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.
Created
2020-04-20
123 commits to master branch, last one 2 years ago
⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
Created
2015-09-30
1,005 commits to master branch, last one 19 days ago