8 results found Sort:

302
3.2k
mit
70
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Created 2014-03-25
492 commits to master branch, last one 10 days ago
645
2.1k
unknown
162
Deep Learning Chinese Word Segment
Created 2016-11-25
88 commits to master branch, last one 7 years ago
260
1.3k
mit
56
"結巴"中文分詞:做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chinese for "to stutter") Chinese text segmentation: built to be the best PHP Chinese word segmentation module.
Created 2015-04-23
177 commits to master branch, last one 2 years ago
212
922
apache-2.0
90
Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction imp...
Created 2014-03-31
680 commits to master branch, last one about a year ago
Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Created 2018-08-13
322 commits to master branch, last one 4 days ago
86
736
other
19
zhparser is a PostgreSQL extension for full-text search of Chinese language
Created 2013-02-24
134 commits to master branch, last one about a month ago
50
341
apache-2.0
2
Pytorch-NLU,一个中文文本分类、序列标注工具包,支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label c...
Created 2021-08-29
13 commits to main branch, last one 8 months ago
74
203
lgpl-2.1
16
Tokenizer support Lucene5/6/7/8/9+ version, LTS
Created 2014-11-13
73 commits to master branch, last one about a year ago