Search Results - RepositoryStats

1.0k

19.4k

apache-2.0

241

Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks

arm x64 avx2 json neon simd arm64 clang cpp11 sse42 avx512 vs2019 aarch64 clang-cl loongarch c-plus-plus json-parser gcc-compiler json-pointer

Created 2018-03-23

2,839 commits to master branch, last one 12 days ago

highway google

321

4.2k

apache-2.0

47

Performance-portable, length-agnostic SIMD with runtime dispatch

avx avx2 neon simd wasm sse42 avx512 avx-512 intrinsics simd-library simd-intrinsics avx-instructions simd-parallelism simd-programming simd-instructions

Created 2019-09-06

2,675 commits to master branch, last one 18 hours ago

asm-dude HJLebbink

98

4.1k

mit

729

Visual Studio extension for assembly syntax highlighting and code completion in assembly files and the disassembly window

avx2 masm nasm avx512 x86-64 assembly assembler disassembly visual-studio code-completion syntax-highlighting visual-studio-extension assembly-language-programming

Created 2016-02-19

949 commits to master branch, last one 8 months ago

oneDNN oneapi-src

1.0k

3.6k

apache-2.0

181

oneAPI Deep Neural Network Library (oneDNN)

amx cpp tbb x64 sycl vnni avx512 oneapi onednn openmp x86-64 aarch64 library bfloat16 performance deep-learning xe-architecture deep-neural-networks

Created 2016-05-09

17,873 commits to main branch, last one 18 hours ago

simde simd-everywhere

253

2.4k

mit

52

Implementations of SIMD instruction sets for systems which don't natively support them.

Created 2017-03-28

3,052 commits to master branch, last one about a month ago

xsimd xtensor-stack

258

2.2k

bsd-3-clause

71

C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))

avx cpp sse sve neon simd avx512 vectorization c-plus-plus-11 simd-intrinsics simd-instructions mathematical-functions

Created 2016-02-19

1,701 commits to master branch, last one 7 days ago

Simd ermig1979

414

2.1k

mit

115

C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, NEON for ARM.

amx arm avx lbp sse neon simd avx512 c-plus-plus haar-cascade simd-library neural-network image-processing machine-learning

Created 2015-03-25

3,743 commits to master branch, last one 2 days ago

kfr kfrlib

255

1.7k

gpl-2.0

63

Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)

avx cxx dft dsp fft simd audio clang cpp14 cpp17 avx512 cplusplus header-only cplusplus-14 cplusplus-17 audio-processing fast-fourier-transform digital-signal-processing discrete-fourier-transform

Created 2016-06-29

1,137 commits to main branch, last one 10 days ago

Vc VcDevel

151

1.5k

bsd-3-clause

66

SIMD Vector Classes for C++

avx cpp sse avx2 neon simd cpp11 cpp14 cpp17 avx512 parallel portable c-plus-plus simd-vector data-parallel vectorization simd-programming simd-instructions parallel-computing

Created 2014-02-25

5,010 commits to 1.4 branch, last one 5 months ago

libsimdpp p12tic

130

1.2k

bsl-1.0

77

Portable header-only C++ low level SIMD library

msa sse vsx avx2 neon simd avx512 altivec

Created 2013-05-08

1,232 commits to master branch, last one 2 months ago

sneller SnellerInc

42

1.0k

other

23

World's fastest log analysis: λ + SQL + JSON + S3

go s3 log sql json simd avx512 indexless schemaless serverless vectorized query-engine high-performance

Created 2022-03-25

1,264 commits to master branch, last one 10 months ago

sha256-simd minio

121

985

apache-2.0

37

Accelerate SHA256 computations in pure Go using AVX512, SHA Extensions for x86 and ARM64 for ARM. On AVX512 it provides an up to 8x improvement (over 3 GB/s per core). SHA Extensions give a performan...

arm avx intel plan9 avx512 golang assembly avx-instructions

Created 2016-07-16

92 commits to master branch, last one about a year ago

SimSIMD ashvardanian

59

985

apache-2.0

18

Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & S...

Created 2023-03-14

1,127 commits to main branch, last one 20 hours ago

primesieve kimwalisch

123

962

bsd-2-clause

46

🚀 Fast prime number generator

sse math neon sieve avx512 primes eratosthenes prime-numbers sieve-of-eratosthenes

Created 2013-09-24

4,742 commits to master branch, last one 3 days ago

x86-simd-sort intel

58

887

bsd-3-clause

22

C++ template library for high performance SIMD based sorting algorithms

x86 avx2 sort avx512 argsort quicksort partialsort quickselect

Created 2022-10-19

579 commits to main branch, last one 8 days ago

libxsmm libxsmm

183

850

bsd-3-clause

51

Library for specialized dense and sparse matrix operations, and deep learning primitives.

amx avx jit sse avx2 blas simd intel avx512 matrix sparse tensor vector fortran bfloat16 transpose convolution machine-learning matrix-multiplication

Created 2014-09-23

18,196 commits to main branch, last one 2 days ago

sleef shibatch

132

667

bsl-1.0

34

SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT

Created 2016-01-03

479 commits to master branch, last one 9 days ago

std-simd VcDevel

37

579

other

22

std::experimental::simd for GCC [ISO/IEC TS 19570:2018]

avx gcc sse neon simd wg21 cpp17 avx512 libstdcxx

Created 2019-05-07

2,233 commits to master branch, last one about a year ago

libpopcnt kimwalisch

37

330

bsd-2-clause

23

🚀 Fast C/C++ bit population count library

c cpp sve avx2 neon simd avx512 popcnt popcount

Created 2016-11-28

370 commits to master branch, last one 4 months ago

nsimd agenium-scale

28

328

mit

26

Agenium Scale vectorization library for CPUs and GPUs

avx hpc sve avx2 cuda neon rocm simd sse2 cpp20 sse42 avx512 aarch64 neon128 simd-library cpp20-library simd-programming simd-instructions vectorization-library

Created 2019-04-10

172 commits to master branch, last one 3 years ago

sse-popcount WojciechMula

48

326

bsd-2-clause

32

SIMD (SSE) population count --- http://0x80.pl/articles/sse-popcount.html

sse avx2 avx512 aarch64 arm-neon popcount

Created 2015-04-04

141 commits to master branch, last one 7 months ago

toys WojciechMula

42

320

bsd-2-clause

28

Storage for my snippets, toy programs, etc.

sse avx2 avx512 string-algorithms

Created 2013-12-03

1,275 commits to master branch, last one 7 months ago

primecount kimwalisch

41

311

bsd-2-clause

23

🚀 Fast prime counting function implementations

math avx512 openmp primes arm-sve primepi number-theory prime-numbers

Created 2013-06-09

5,348 commits to master branch, last one 3 days ago

OSACA RRZE-HPC

20

301

agpl-3.0

25

Open Source Architecture Code Analyzer

Created 2017-03-02

1,032 commits to master branch, last one about a month ago

Turbo-Base64 powturbo

41

278

gpl-3.0

15

Turbo Base64 - Fastest Base64 SIMD:SSE/AVX2/AVX512/Neon/Altivec - Faster than memcpy!

arm avx sse avx2 neon simd avx512 base64 library encoding benchmark base64-decoding base64-encoding encoding-library

Created 2016-12-17

726 commits to master branch, last one about a year ago

sse4-strstr WojciechMula

29

240

bsd-2-clause

24

SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification

sse avx2 neon avx512 string-manipulation

Created 2015-04-05

121 commits to master branch, last one 2 years ago

hybridizer-basic-samples altimesh

32

237

mit

23

Examples of C# code compiled to GPU by hybridizer

avx gpu avx2 cuda avx512 dotnet compiler parallel optimization vectorization visual-studio hybridizer-essentials

Created 2017-03-09

300 commits to master branch, last one about a year ago

md5-simd minio

18

176

apache-2.0

11

Accelerate aggregated MD5 hashing performance up to 8x for AVX512 and 4x for AVX2. Useful for server applications that need to compute many MD5 sums in parallel.

md5 avx2 simd avx512 golang hashing assembly performance

Created 2020-04-20

123 commits to master branch, last one 2 years ago

Corrfunc manodeep

53

167

mit

11

⚡️⚡️⚡️Blazing fast correlation functions on the CPU.

c avx avx2 simd sse42 avx512 openmp python galaxies cosmology intrinsics astrophysics pair-counting correlation-functions large-scale-structure

Created 2015-09-30

1,004 commits to master branch, last one 19 days ago

base64simd WojciechMula

14

157

bsd-2-clause

17

Base64 coding and decoding with SIMD instructions (SSE/AVX2/AVX512F/AVX512BW/AVX512VBMI/ARM Neon)

sse avx2 neon simd avx512 base64

Created 2016-09-04

259 commits to master branch, last one 7 months ago