17 results found Sort:
- Filter by Primary Language:
- Python (4)
- Jupyter Notebook (2)
- HTML (1)
- PHP (1)
- Rust (1)
- Astro (1)
- TypeScript (1)
- Go (1)
- +
An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
Created
2018-08-23
1,450 commits to main branch, last one a day ago
My book list
sql
book
android
reading
algorithms
data-mining
data-science
reading-list
deep-learning
computer-vision
data-structures
design-patterns
database-systems
machine-learning
corpus-linguistics
computer-networking
discrete-mathematics
statistical-learning
artificial-intelligence
natural-language-processing
Created
2017-09-21
139 commits to master branch, last one 4 months ago
A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
Created
2020-03-31
63 commits to master branch, last one 2 years ago
Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German
Created
2018-06-12
88 commits to master branch, last one 7 days ago
A list of Indonesian NLP resources.
Created
2018-04-07
30 commits to master branch, last one 2 years ago
A web-based engine for creating and annotating textual corpora
Created
2014-03-27
2,786 commits to master branch, last one about a year ago
A curated list of NLP resources for Hungarian
nlp
nlu
corpus
parser
tagger
awesome
dataset
hungarian
text-mining
awesome-list
nlp-resources
opinion-mining
corpus-linguistics
hungarian-language
information-retrieval
information-extraction
named-entity-recognition
computational-linguistics
natural-language-processing
natural-language-understanding
Created
2017-04-16
119 commits to master branch, last one about a year ago
Crawler for linguistic corpora
Created
2017-09-08
392 commits to master branch, last one 11 months ago
:spider: The pipeline for the OSCAR corpus
Created
2021-02-15
419 commits to main branch, last one 12 months ago
Kanji usage frequency data collected from various sources
Created
2016-01-24
177 commits to master branch, last one 24 days ago
Data for the quantitative study of (Vedic) Sanskrit
Created
2018-08-18
49 commits to master branch, last one 8 days ago
An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.
This repository has been archived
(exclude archived)
Created
2019-03-01
17 commits to master branch, last one 3 years ago
Quran, Hadith, Translations, Tafaseer, Corpus Linguistics. Everything for NLP
Created
2022-02-20
137 commits to master branch, last one 7 months ago
Large silver standart Russian corpus with NER, morphology and syntax markup
Created
2018-09-10
206 commits to master branch, last one about a year ago
An advanced, extensible web front-end for the Manatee-open corpus search engine
Created
2015-04-14
12,748 commits to master branch, last one a day ago
SpeCT - Speech Corpus Toolkit for Praat. Documentation: https://lennes.github.io/spect/
Created
2017-03-08
49 commits to master branch, last one about a year ago
A large high-quality corpus of Chinese synonyms 一个大型、高质量的中文同义词语料库。
Created
2021-11-02
5 commits to main branch, last one 2 years ago