34 results found Sort:

558
4.3k
mit
119
:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
Created 2012-04-20
3,332 commits to main branch, last one 5 months ago
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
Created 2019-11-22
9,271 commits to master branch, last one 3 days ago
125
1.0k
agpl-3.0
16
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Created 2021-08-25
2,601 commits to main branch, last one 13 hours ago
156
994
bsd-3-clause
32
A powerful and modular toolkit for record linkage and duplicate detection in Python
Created 2015-10-18
912 commits to master branch, last one about a year ago
138
909
other
21
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
Created 2020-08-22
1,540 commits to master branch, last one 4 months ago
Insightful Tutorials and Papers about Knowledge Graphs
Created 2019-03-10
1,074 commits to master branch, last one 14 hours ago
:id: Examples for using the dedupe library
Created 2014-04-02
1,356 commits to main branch, last one 8 months ago
A list of free data matching and record linkage software.
Created 2018-01-01
53 commits to master branch, last one about a year ago
Recent trends of Entity Linking, Disambiguation, and Representation.
Created 2019-02-18
65 commits to master branch, last one 3 years ago
This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) project, by the Knowledge Computing group at Microso...
Created 2019-07-25
103 commits to master branch, last one about a year ago
47
218
apache-2.0
26
An open source, high scalability toolkit in Java for Entity Resolution.
Created 2016-03-16
650 commits to master branch, last one 12 months ago
44
212
apache-2.0
18
ReFinED is an efficient and accurate entity linking (EL) system.
Created 2022-05-03
21 commits to main branch, last one 3 months ago
22
189
other
9
🔎 Finds fuzzy matches between CSV files
Created 2015-12-08
145 commits to master branch, last one 14 days ago
28
159
apache-2.0
9
Entity resolution for Elasticsearch.
Created 2018-02-11
278 commits to main branch, last one 2 months ago
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
Created 2021-01-28
373 commits to main branch, last one 2 years ago
How to construct knowledge graphs from unstructured data sources
Created 2024-07-31
18 commits to main branch, last one 6 months ago
Resources for tackling record linkage / deduplication / data matching problems
Created 2017-11-04
29 commits to master branch, last one 2 years ago
A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning
Created 2023-08-01
130 commits to main branch, last one 5 days ago
OpenRefine reconciliation services for VIAF, ORCID, and Open Library + framework for creating more.
Created 2016-10-25
247 commits to main branch, last one about a month ago
23
110
mit
11
Record Linkage ToolKit (Find and link entities)
Created 2017-02-15
636 commits to master branch, last one 3 years ago
9
96
gpl-3.0
6
Link Wikidata items to large catalogs
Created 2018-07-11
2,087 commits to master branch, last one 3 years ago
Python package for deduplication/entity resolution using active learning
Created 2021-04-13
294 commits to main branch, last one 7 months ago
11
76
apache-2.0
4
An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.
Created 2022-07-04
338 commits to main branch, last one 9 days ago
8
65
apache-2.0
7
Python implementation of anonymous linkage using cryptographic linkage keys
Created 2017-05-30
470 commits to main branch, last one about a year ago
19
64
gpl-3.0
6
SparkER: an Entity Resolution framework for Apache Spark
Created 2017-02-10
80 commits to master branch, last one about a year ago
List of entity resolution software and resources.
Created 2023-10-23
9 commits to main branch, last one about a month ago
2
57
apache-2.0
7
Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.
Created 2018-11-05
146 commits to main branch, last one 3 months ago
9
57
other
10
Distributed Bayesian Entity Resolution in Apache Spark
Created 2018-08-27
152 commits to master branch, last one 3 years ago
This repository contains code and extensive prompt examples to reproduce and extend the experiments in our papers "Using ChatGPT for Entity Matching" and "Entity Matching using Large Language Models".
Created 2023-04-28
41 commits to main branch, last one 5 months ago