10 results found Sort:

Resources for conservation, development, and documentation of low resource (human) languages.
Created 2014-07-23
428 commits to master branch, last one 9 months ago
41
261
unknown
6
This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Compu...
Created 2021-06-26
17 commits to master branch, last one 10 months ago
47
147
unknown
9
This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Pr...
Created 2020-10-05
22 commits to master branch, last one 3 months ago
A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.
Created 2021-05-01
49 commits to main branch, last one 9 months ago
Open-source benchmark datasets and pretrained transformer models in the Filipino language.
Created 2020-05-04
68 commits to master branch, last one 5 months ago
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
Created 2022-12-02
95 commits to main branch, last one about a year ago
NLP pipelines for Tagalog using spaCy
Created 2022-10-17
279 commits to master branch, last one a day ago
10
46
unknown
8
CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates
Created 2019-07-28
76 commits to master branch, last one about a year ago
SemEval2024-task 11: Bridging the Gap in Text-Based Emotion Detection
Created 2024-06-04
485 commits to main branch, last one 2 days ago