15 results found Sort:

290
2.4k
mit
48
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
Created 2015-03-20
237 commits to master branch, last one 3 months ago
Quickly search, compare, and analyze genomic and metagenomic data sets.
Created 2016-04-09
1,955 commits to latest branch, last one 3 days ago
JS implementation of probabilistic data structures: Bloom Filter (and its derived), HyperLogLog, Count-Min Sketch, Top-K and MinHash
Created 2017-01-23
223 commits to master branch, last one 17 days ago
77
275
mit
10
Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
Created 2014-09-03
63 commits to master branch, last one about a year ago
14
151
mit
8
C++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings
Created 2016-11-21
1,273 commits to main branch, last one 10 months ago
18
147
other
22
Sketching Algorithms for Clojure (bloom filter, min-hash, hyper-loglog, count-min sketch)
Created 2013-06-15
31 commits to master branch, last one about a year ago
23
112
other
11
Weighted MinHash implementation on CUDA (multi-gpu).
Created 2016-10-25
78 commits to master branch, last one 7 months ago
10
112
unknown
10
Detect and visualize text reuse
Created 2017-12-30
273 commits to master branch, last one 2 years ago
Elasticsearch plugin for b-bit minhash algorism
Created 2014-09-21
401 commits to master branch, last one 12 days ago
Union, intersection, and set cardinality in loglog space
Created 2018-05-04
48 commits to master branch, last one about a year ago
6
52
mit
1
Locality Sensitive Hashing
Created 2021-02-01
139 commits to master branch, last one about a year ago
Quickly estimate the similarity between many sets
Created 2018-03-11
19 commits to master branch, last one 2 years ago
High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datasets
Created 2024-01-21
20 commits to main branch, last one 3 days ago