13 results found Sort:

119
2.7k
mit
25
Rapid fuzzy string matching in Python using various string metrics
Created 2020-02-29
810 commits to main branch, last one 13 days ago
Finds degree of similarity between two strings, based on Dice's Coefficient, which is mostly better than Levenshtein distance.
This repository has been archived (exclude archived)
Created 2014-11-23
63 commits to master branch, last one about a year ago
21
310
mit
4
Go metrics for calculating string similarity and other string utility functions
Created 2019-11-14
133 commits to master branch, last one about a month ago
16
276
gpl-2.0
5
The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
Created 2021-05-19
171 commits to main branch, last one 10 days ago
Rapid fuzzy string matching in C++ using the Levenshtein Distance
Created 2020-04-13
406 commits to main branch, last one 13 days ago
23
105
mit
12
Record Linkage ToolKit (Find and link entities)
Created 2017-02-15
636 commits to master branch, last one 3 years ago
Rust edit distance routines accelerated using SIMD. Supports fast Hamming, Levenshtein, restricted Damerau-Levenshtein, etc. distance calculations and string search.
Created 2020-03-23
144 commits to master branch, last one about a year ago
The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
Created 2022-09-27
19 commits to main branch, last one 10 days ago
Lightweight string similarity function for javascript
Created 2018-06-21
45 commits to master branch, last one 3 years ago
A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard si...
Created 2017-03-02
203 commits to master branch, last one 2 years ago
Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity
Created 2022-01-07
59 commits to main branch, last one 10 months ago
Accelerating the deduplication and collapsing process for reads with Unique Molecular Identifiers (UMI). Heavily optimized for scalability and orders of magnitude faster than a previous tool.
Created 2019-02-25
99 commits to master branch, last one 7 months ago
Beda is a golang library for detecting how similar a two string
Created 2020-05-28
3 commits to master branch, last one 3 years ago