33 results found Sort:
- Filter by Primary Language:
- Python (18)
- Java (4)
- Jupyter Notebook (3)
- Scala (2)
- TypeScript (1)
- +
:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
Created
2012-04-20
3,332 commits to main branch, last one 19 days ago
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
Created
2019-11-22
9,008 commits to master branch, last one a day ago
A powerful and modular toolkit for record linkage and duplicate detection in Python
Created
2015-10-18
912 commits to master branch, last one about a year ago
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Created
2021-08-25
2,218 commits to main branch, last one 21 hours ago
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
t5
nlu
pandas
seq2seq
streamlit
lemmatizer
transformers
spell-checker
bert-embedding
text-translation
entity-resolution
dependency-parsing
language-detection
sentiment-analysis
text-summarization
sentence-embeddings
text-classification
sentiment-classifier
named-entity-recognition
natural-language-understanding
Created
2020-08-22
1,539 commits to master branch, last one about a month ago
Insightful Tutorials and Papers about Knowledge Graphs
Created
2019-03-10
969 commits to master branch, last one 2 days ago
On-device Speech-to-Intent engine powered by deep learning
Created
2018-10-28
830 commits to master branch, last one a day ago
:id: Examples for using the dedupe library
Created
2014-04-02
1,356 commits to main branch, last one 3 months ago
A list of free data matching and record linkage software.
Created
2018-01-01
53 commits to master branch, last one about a year ago
Recent trends of Entity Linking, Disambiguation, and Representation.
Created
2019-02-18
65 commits to master branch, last one 3 years ago
This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) project, by the Knowledge Computing group at Microso...
Created
2019-07-25
103 commits to master branch, last one 8 months ago
An open source, high scalability toolkit in Java for Entity Resolution.
Created
2016-03-16
650 commits to master branch, last one 7 months ago
ReFinED is an efficient and accurate entity linking (EL) system.
Created
2022-05-03
19 commits to main branch, last one about a year ago
🔎 Finds fuzzy matches between CSV files
Created
2015-12-08
134 commits to master branch, last one 7 months ago
Entity resolution for Elasticsearch.
Created
2018-02-11
277 commits to main branch, last one 5 months ago
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
Created
2021-01-28
373 commits to main branch, last one 2 years ago
Resources for tackling record linkage / deduplication / data matching problems
Created
2017-11-04
29 commits to master branch, last one 2 years ago
OpenRefine reconciliation services for VIAF, ORCID, and Open Library + framework for creating more.
Created
2016-10-25
240 commits to main branch, last one about a year ago
Record Linkage ToolKit (Find and link entities)
Created
2017-02-15
636 commits to master branch, last one 3 years ago
A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning
Created
2023-08-01
118 commits to main branch, last one 5 months ago
Link Wikidata items to large catalogs
Created
2018-07-11
2,087 commits to master branch, last one 2 years ago
How to construct knowledge graphs from unstructured data sources
Created
2024-07-31
18 commits to main branch, last one about a month ago
Python package for deduplication/entity resolution using active learning
Created
2021-04-13
294 commits to main branch, last one 2 months ago
An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.
Created
2022-07-04
323 commits to main branch, last one 10 days ago
Python implementation of anonymous linkage using cryptographic linkage keys
Created
2017-05-30
470 commits to main branch, last one about a year ago
SparkER: an Entity Resolution framework for Apache Spark
Created
2017-02-10
80 commits to master branch, last one 7 months ago
Distributed Bayesian Entity Resolution in Apache Spark
Created
2018-08-27
152 commits to master branch, last one 3 years ago
Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.
Created
2018-11-05
144 commits to main branch, last one 22 days ago
This repository contains code and extensive prompt examples to reproduce and extend the experiments in our papers "Using ChatGPT for Entity Matching" and "Entity Matching using Large Language Models".
Created
2023-04-28
41 commits to main branch, last one about a month ago
Low effort linking and easy de-duplication. Databricks ARC provides a simple, automated, lakehouse integrated entity resolution solution for intra and inter data linking.
Created
2023-01-09
391 commits to main branch, last one 29 days ago