15 results found Sort:

297
2.6k
mit
49
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
Created 2015-03-20
237 commits to master branch, last one 11 months ago
Quickly search, compare, and analyze genomic and metagenomic data sets.
Created 2016-04-09
2,137 commits to latest branch, last one a day ago
JS implementation of probabilistic data structures: Bloom Filter (and its derived), HyperLogLog, Count-Min Sketch, Top-K and MinHash
Created 2017-01-23
228 commits to master branch, last one 3 months ago
80
284
mit
10
Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
This repository has been archived (exclude archived)
Created 2014-09-03
63 commits to master branch, last one about a year ago
13
153
mit
8
C++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings
Created 2016-11-21
1,275 commits to main branch, last one 7 months ago
18
148
other
22
Sketching Algorithms for Clojure (bloom filter, min-hash, hyper-loglog, count-min sketch)
Created 2013-06-15
31 commits to master branch, last one about a year ago
24
118
other
11
Weighted MinHash implementation on CUDA (multi-gpu).
Created 2016-10-25
78 commits to master branch, last one about a year ago
11
118
unknown
10
Detect and visualize text reuse
This repository has been archived (exclude archived)
Created 2017-12-30
280 commits to master branch, last one 5 months ago
High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datasets
Created 2024-01-21
49 commits to main branch, last one 4 days ago
7
70
mit
1
Locality Sensitive Hashing
Created 2021-02-01
139 commits to master branch, last one about a year ago
Elasticsearch plugin for b-bit minhash algorism
Created 2014-09-21
401 commits to master branch, last one 8 months ago
Union, intersection, and set cardinality in loglog space
Created 2018-05-04
48 commits to master branch, last one about a year ago
Quickly estimate the similarity between many sets
Created 2018-03-11
19 commits to master branch, last one 3 years ago