13 results found Sort:

Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
Created 2019-11-22
9,271 commits to master branch, last one 7 days ago
156
997
bsd-3-clause
32
A powerful and modular toolkit for record linkage and duplicate detection in Python
Created 2015-10-18
912 commits to master branch, last one about a year ago
A list of free data matching and record linkage software.
Created 2018-01-01
53 commits to master branch, last one about a year ago
Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4
Created 2017-11-25
144 commits to master branch, last one 2 years ago
22
189
other
9
🔎 Finds fuzzy matches between CSV files
Created 2015-12-08
145 commits to master branch, last one 18 days ago
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
Created 2021-01-28
373 commits to main branch, last one 2 years ago
Resources for tackling record linkage / deduplication / data matching problems
Created 2017-11-04
29 commits to master branch, last one 2 years ago
9
96
gpl-3.0
6
Link Wikidata items to large catalogs
Created 2018-07-11
2,087 commits to master branch, last one 3 years ago
12
76
apache-2.0
4
An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.
Created 2022-07-04
338 commits to main branch, last one 12 days ago
2
57
apache-2.0
7
Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.
Created 2018-11-05
146 commits to main branch, last one 3 months ago
A browser user interface for manual labeling of record pairs.
Created 2019-11-02
42 commits to master branch, last one 2 years ago
Welcome to Snowman App – a Data Matching Benchmark Platform.
Created 2021-02-19
1,760 commits to main branch, last one 2 years ago
2
34
gpl-3.0
2
Fuzzy string matching in R. Inspired by Python's thefuzz (but without the Python).
Created 2021-02-02
124 commits to main branch, last one 10 months ago