2 results found Sort:
High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datasets
Created
2024-01-21
22 commits to main branch, last one 3 months ago
SetSketch: Filling the Gap between MinHash and HyperLogLog
sketch
jaccard
minhash
estimation
hyperloglog
intersection
minwise-hashing
minhash-sketches
sketch-algorithm
cosine-similarity
jaccard-similarity
minhash-similarity
inclusion-exclusion
hyperloglog-sketches
minhash-lsh-algorithm
cardinality-estimation
sketch-data-structures
minwise-hashing-algorithm
locality-sensitive-hashing
jaccard-similarity-estimation
Created
2020-12-29
11 commits to master branch, last one 3 years ago