8 results found Sort:

290
2.4k
mit
48
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
Created 2015-03-20
237 commits to master branch, last one 3 months ago
19
289
mit
4
Golang metrics for calculating string similarity and other string utility functions
Created 2019-11-14
128 commits to master branch, last one 23 days ago
23
208
bsd-3-clause
5
Compare html similarity using structural and style metrics
Created 2017-10-26
74 commits to master branch, last one 3 years ago
A package to compute medical segmentation metrics.
Created 2020-06-17
175 commits to master branch, last one 10 days ago
Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.
Created 2014-12-13
292 commits to master branch, last one about a year ago
A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard si...
Created 2017-03-02
203 commits to master branch, last one 2 years ago
Spark functions to run popular phonetic and string matching algorithms
Created 2017-09-05
45 commits to main branch, last one 2 years ago