6 results found Sort:

279
3.9k
apache-2.0
31
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
Created 2019-04-08
1,591 commits to master branch, last one 2 days ago
Document Layout Analysis resources repos for development with PdfPig.
Created 2019-09-02
181 commits to master branch, last one about a year ago
hand-written dictionaries from the FreeDict project
Created 2015-08-05
1,725 commits to master branch, last one 4 months ago
The main TEI Publisher app
Created 2020-06-03
2,808 commits to master branch, last one 3 months ago
52
55
unknown
28
ParlaMint: Comparable Parliamentary Corpora
Created 2020-12-09
3,963 commits to main branch, last one about a month ago
Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project
Created 2019-11-07
234 commits to master branch, last one 3 months ago