14 results found Sort:

126
3.0k
mit
24
Rapid fuzzy string matching in Python using various string metrics
Created 2020-02-29
845 commits to main branch, last one 5 days ago
Finds degree of similarity between two strings, based on Dice's Coefficient, which is mostly better than Levenshtein distance.
This repository has been archived (exclude archived)
Created 2014-11-23
63 commits to master branch, last one about a year ago
The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
Created 2021-05-19
187 commits to main branch, last one 6 days ago
Rapid fuzzy string matching in C++ using the Levenshtein Distance
Created 2020-04-13
420 commits to main branch, last one 2 months ago
The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
Created 2022-09-27
24 commits to main branch, last one about a month ago
23
110
mit
11
Record Linkage ToolKit (Find and link entities)
Created 2017-02-15
636 commits to master branch, last one 3 years ago
Rust edit distance routines accelerated using SIMD. Supports fast Hamming, Levenshtein, restricted Damerau-Levenshtein, etc. distance calculations and string search.
Created 2020-03-23
144 commits to master branch, last one 2 years ago
Lightweight string similarity function for javascript
Created 2018-06-21
45 commits to master branch, last one 3 years ago
A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard si...
Created 2017-03-02
203 commits to master branch, last one 2 years ago
Accelerating the deduplication and collapsing process for reads with Unique Molecular Identifiers (UMI). Heavily optimized for scalability and orders of magnitude faster than a previous tool.
Created 2019-02-25
99 commits to master branch, last one about a year ago
Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity
Created 2022-01-07
59 commits to main branch, last one about a year ago
Beda is a golang library for detecting how similar a two string
Created 2020-05-28
3 commits to master branch, last one 4 years ago
2
34
gpl-3.0
2
Fuzzy string matching in R. Inspired by Python's thefuzz (but without the Python).
Created 2021-02-02
124 commits to main branch, last one 10 months ago