15 results found Sort:

296
2.7k
mit
48
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
Created 2015-03-20
237 commits to master branch, last one 12 months ago
Quickly search, compare, and analyze genomic and metagenomic data sets.
Created 2016-04-09
2,159 commits to latest branch, last one 4 days ago
JS implementation of probabilistic data structures: Bloom Filter (and its derived), HyperLogLog, Count-Min Sketch, Top-K and MinHash
Created 2017-01-23
228 commits to master branch, last one 4 months ago
80
285
mit
9
Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
This repository has been archived (exclude archived)
Created 2014-09-03
63 commits to master branch, last one about a year ago
13
152
mit
8
C++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings
Created 2016-11-21
1,275 commits to main branch, last one 8 months ago
18
148
other
21
Sketching Algorithms for Clojure (bloom filter, min-hash, hyper-loglog, count-min sketch)
Created 2013-06-15
31 commits to master branch, last one about a year ago
11
118
unknown
9
Detect and visualize text reuse
This repository has been archived (exclude archived)
Created 2017-12-30
280 commits to master branch, last one 6 months ago
24
117
other
10
Weighted MinHash implementation on CUDA (multi-gpu).
Created 2016-10-25
78 commits to master branch, last one about a year ago
High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datasets
Created 2024-01-21
49 commits to main branch, last one about a month ago
7
72
mit
1
Locality Sensitive Hashing
Created 2021-02-01
139 commits to master branch, last one about a year ago
Elasticsearch plugin for b-bit minhash algorism
Created 2014-09-21
401 commits to master branch, last one 9 months ago
Union, intersection, and set cardinality in loglog space
Created 2018-05-04
48 commits to master branch, last one about a year ago
Quickly estimate the similarity between many sets
Created 2018-03-11
19 commits to master branch, last one 3 years ago