15 results found Sort:
- Filter by Primary Language:
- Python (5)
- Java (3)
- C++ (3)
- TypeScript (1)
- JavaScript (1)
- Clojure (1)
- Rust (1)
- +
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
Created
2015-03-20
237 commits to master branch, last one 7 months ago
Quickly search, compare, and analyze genomic and metagenomic data sets.
Created
2016-04-09
2,051 commits to latest branch, last one a day ago
JS implementation of probabilistic data structures: Bloom Filter (and its derived), HyperLogLog, Count-Min Sketch, Top-K and MinHash
Created
2017-01-23
224 commits to master branch, last one about a month ago
Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
This repository has been archived
(exclude archived)
Created
2014-09-03
63 commits to master branch, last one about a year ago
C++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings
Created
2016-11-21
1,275 commits to main branch, last one 3 months ago
Sketching Algorithms for Clojure (bloom filter, min-hash, hyper-loglog, count-min sketch)
Created
2013-06-15
31 commits to master branch, last one about a year ago
Detect and visualize text reuse
This repository has been archived
(exclude archived)
Created
2017-12-30
280 commits to master branch, last one 2 months ago
Weighted MinHash implementation on CUDA (multi-gpu).
Created
2016-10-25
78 commits to master branch, last one 11 months ago
Dynatrace hash library for Java
Created
2022-01-26
769 commits to main branch, last one 15 days ago
Locality Sensitive Hashing
Created
2021-02-01
139 commits to master branch, last one about a year ago
High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datasets
Created
2024-01-21
22 commits to main branch, last one 3 months ago
Elasticsearch plugin for b-bit minhash algorism
Created
2014-09-21
401 commits to master branch, last one 4 months ago
Union, intersection, and set cardinality in loglog space
Created
2018-05-04
48 commits to master branch, last one about a year ago
Quickly estimate the similarity between many sets
Created
2018-03-11
19 commits to master branch, last one 2 years ago
SetSketch: Filling the Gap between MinHash and HyperLogLog
sketch
jaccard
minhash
estimation
hyperloglog
intersection
minwise-hashing
minhash-sketches
sketch-algorithm
cosine-similarity
jaccard-similarity
minhash-similarity
inclusion-exclusion
hyperloglog-sketches
minhash-lsh-algorithm
cardinality-estimation
sketch-data-structures
minwise-hashing-algorithm
locality-sensitive-hashing
jaccard-similarity-estimation
Created
2020-12-29
11 commits to master branch, last one 3 years ago