7 results found Sort:

Resources for conservation, development, and documentation of low resource (human) languages.
Created 2014-07-23
428 commits to master branch, last one 2 months ago
42
248
unknown
6
This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Compu...
Created 2021-06-26
17 commits to master branch, last one 3 months ago
45
144
unknown
10
This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Pr...
Created 2020-10-05
20 commits to master branch, last one about a year ago
A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.
Created 2021-05-01
49 commits to main branch, last one 2 months ago
6
76
apache-2.0
4
GlotLID: Language Identification with Support for More Than 2000 Labels -- EMNLP 2023
Created 2023-09-26
13 commits to main branch, last one about a month ago
NLP pipelines for Tagalog using spaCy
Created 2022-10-17
220 commits to master branch, last one 4 days ago
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
Created 2022-12-02
95 commits to main branch, last one 6 months ago