15 results found Sort:

293
2.6k
mit
48
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
Created 2015-03-20
237 commits to master branch, last one 7 months ago
Quickly search, compare, and analyze genomic and metagenomic data sets.
Created 2016-04-09
2,051 commits to latest branch, last one a day ago
JS implementation of probabilistic data structures: Bloom Filter (and its derived), HyperLogLog, Count-Min Sketch, Top-K and MinHash
Created 2017-01-23
224 commits to master branch, last one about a month ago
80
281
mit
10
Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
This repository has been archived (exclude archived)
Created 2014-09-03
63 commits to master branch, last one about a year ago
13
153
mit
8
C++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings
Created 2016-11-21
1,275 commits to main branch, last one 3 months ago
18
147
other
22
Sketching Algorithms for Clojure (bloom filter, min-hash, hyper-loglog, count-min sketch)
Created 2013-06-15
31 commits to master branch, last one about a year ago
10
115
unknown
10
Detect and visualize text reuse
This repository has been archived (exclude archived)
Created 2017-12-30
280 commits to master branch, last one 2 months ago
24
114
other
11
Weighted MinHash implementation on CUDA (multi-gpu).
Created 2016-10-25
78 commits to master branch, last one 11 months ago
6
70
mit
1
Locality Sensitive Hashing
Created 2021-02-01
139 commits to master branch, last one about a year ago
High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datasets
Created 2024-01-21
22 commits to main branch, last one 3 months ago
Elasticsearch plugin for b-bit minhash algorism
Created 2014-09-21
401 commits to master branch, last one 4 months ago
Union, intersection, and set cardinality in loglog space
Created 2018-05-04
48 commits to master branch, last one about a year ago
Quickly estimate the similarity between many sets
Created 2018-03-11
19 commits to master branch, last one 2 years ago