13 results found Sort:
- Filter by Primary Language:
- Python (3)
- Go (2)
- C++ (2)
- JavaScript (2)
- Java (1)
- Rust (1)
- Scala (1)
- +
Rapid fuzzy string matching in Python using various string metrics
Created
2020-02-29
822 commits to main branch, last one 2 days ago
Finds degree of similarity between two strings, based on Dice's Coefficient, which is mostly better than Levenshtein distance.
This repository has been archived
(exclude archived)
Created
2014-11-23
63 commits to master branch, last one about a year ago
Go metrics for calculating string similarity and other string utility functions
Created
2019-11-14
138 commits to master branch, last one 27 days ago
The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
Created
2021-05-19
175 commits to main branch, last one 15 days ago
Rapid fuzzy string matching in C++ using the Levenshtein Distance
Created
2020-04-13
414 commits to main branch, last one a day ago
Record Linkage ToolKit (Find and link entities)
Created
2017-02-15
636 commits to master branch, last one 3 years ago
The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
Created
2022-09-27
20 commits to main branch, last one 21 days ago
Rust edit distance routines accelerated using SIMD. Supports fast Hamming, Levenshtein, restricted Damerau-Levenshtein, etc. distance calculations and string search.
Created
2020-03-23
144 commits to master branch, last one 2 years ago
Lightweight string similarity function for javascript
Created
2018-06-21
45 commits to master branch, last one 3 years ago
A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard si...
jaro
ngram
jaccard
soundex
levenshtein
jaro-winkler
hacktoberfest
jaro-distance
fuzzy-matching
dice-coefficient
hamming-distance
cosine-similarity
soundex-algorithm
string-similarity
jaccard-similarity
levenshtein-distance
jaro-winkler-distance
sorensen-dice-distance
cosine-similarity-scores
longest-common-subsequence
Created
2017-03-02
203 commits to master branch, last one 2 years ago
Accelerating the deduplication and collapsing process for reads with Unique Molecular Identifiers (UMI). Heavily optimized for scalability and orders of magnitude faster than a previous tool.
Created
2019-02-25
99 commits to master branch, last one 10 months ago
Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity
Created
2022-01-07
59 commits to main branch, last one about a year ago
Beda is a golang library for detecting how similar a two string
Created
2020-05-28
3 commits to master branch, last one 3 years ago