25 results found Sort:

A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Created 2018-09-01
256 commits to master branch, last one 5 months ago
Portuguese pre-trained BERT models
Created 2020-01-14
20 commits to master branch, last one 2 years ago
The hands-on NLTK tutorial for NLP in Python
Created 2018-03-25
42 commits to main branch, last one about a year ago
59
495
unknown
24
A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Created 2018-11-22
383 commits to master branch, last one 2 years ago
61
367
unknown
8
chinese NLP corpus of chinese science fiction,chinese science fiction corpus : About 4675 Chinese science fiction novels 大约有4675本科幻小说,中文科幻小说自然语言处理语料库,中文科幻小说文本语料库,中文科幻小说文本数据库,科幻小说语料
Created 2020-07-31
5 commits to master branch, last one 2 years ago
53
352
cc0-1.0
24
My NLP datasets for Russian language
Created 2017-11-06
105 commits to master branch, last one about a year ago
This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) project, by the Knowledge Computing group at Microso...
Created 2019-07-25
103 commits to master branch, last one 8 months ago
A lexicon for Sudachi
Created 2019-04-01
120 commits to develop branch, last one 4 months ago
This is a continuously updated handbook for readers to easily track the latest NL2SQL (Text2SQL) techniques in the literature and provide practical guidance for researchers and practitioners.
Created 2024-05-21
88 commits to main branch, last one 2 days ago
29
198
mit
11
A Dutch RoBERTa-based language model
Created 2019-12-19
155 commits to master branch, last one 7 months ago
19
173
unknown
11
TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition (ACL 2020)
Created 2020-04-12
28 commits to master branch, last one 3 years ago
summaries of all the papers I read
Created 2017-10-04
2,860 commits to master branch, last one 15 days ago
22
96
unknown
3
chinese NLP corpus of chinese science fiction, chinese science fiction corpus: Archive of the Ark Plan of Ula Science Fiction Website 乌拉科幻小说网方舟计划存档,中文科幻小说自然语言处理语料库,中文科幻小说文本语料库,中文科幻小说文本数据库,科幻小说语料
Created 2020-05-04
8 commits to master branch, last one 2 years ago
A modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.
Created 2017-01-06
259 commits to master branch, last one 4 years ago
This repository has no description...
Created 2018-05-11
24 commits to master branch, last one about a year ago
25
88
gpl-3.0
7
An open information extraction system that provides compact extractions
Created 2017-07-06
45 commits to master branch, last one 5 years ago
Arabic NLP tools List inventory
Created 2018-12-14
9 commits to master branch, last one 2 years ago
A Python module that fetches a page of a word/phrase from the Online Indonesian Dictionary (https://kbbi.kemdikbud.go.id).
Created 2017-12-01
153 commits to master branch, last one 3 years ago
11
73
unknown
8
Natural Language Processing (NLP). Covering topics such as Tokenization, Part Of Speech tagging (POS), Machine translation, Named Entity Recognition (NER), Classification, and Sentiment analysis.
Created 2021-08-30
16 commits to main branch, last one 10 months ago
Resources to go with the Indic NLP Library
Created 2014-10-20
31 commits to master branch, last one 3 years ago
Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine le...
Created 2020-05-26
58 commits to master branch, last one about a year ago
Natural Language Processing Courses with Resources
Created 2023-07-21
17 commits to main branch, last one 12 days ago
A list of Romanian NLP Datasets
Created 2023-05-23
45 commits to main branch, last one about a month ago