9 results found Sort:

298
3.2k
mit
71
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Created 2014-03-25
488 commits to master branch, last one about a month ago
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
Created 2018-11-19
98 commits to master branch, last one 6 months ago
97
1.2k
apache-2.0
8
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
Created 2024-01-25
282 commits to main branch, last one 2 days ago
Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Created 2018-08-13
292 commits to master branch, last one 15 days ago
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashta...
Created 2017-02-07
77 commits to master branch, last one 2 years ago
A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).
Created 2018-05-21
117 commits to master branch, last one about a year ago
Fast Word Segmentation with Triangular Matrix
Created 2018-04-21
33 commits to master branch, last one 3 years ago
14
59
apache-2.0
10
Emoji Segmenter
Created 2019-01-24
18 commits to master branch, last one 15 days ago