9 results found Sort:

281
3.1k
mit
71
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Created 2014-03-25
485 commits to master branch, last one about a month ago
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
Created 2018-11-19
98 commits to master branch, last one 18 days ago
Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Created 2018-08-13
261 commits to master branch, last one 2 months ago
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashta...
Created 2017-02-07
77 commits to master branch, last one about a year ago
A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).
Created 2018-05-21
117 commits to master branch, last one about a year ago
26
294
apache-2.0
5
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
Created 2024-01-25
37 commits to main branch, last one 4 days ago
Fast Word Segmentation with Triangular Matrix
Created 2018-04-21
33 commits to master branch, last one 2 years ago
13
59
apache-2.0
11
Emoji Segmenter
Created 2019-01-24
7 commits to master branch, last one 3 months ago