8 results found Sort:

300
3.2k
mit
71
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Created 2014-03-25
490 commits to master branch, last one 9 days ago
646
2.1k
unknown
164
Deep Learning Chinese Word Segment
Created 2016-11-25
88 commits to master branch, last one 7 years ago
260
1.3k
mit
56
"結巴"中文分詞:做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chinese for "to stutter") Chinese text segmentation: built to be the best PHP Chinese word segmentation module.
Created 2015-04-23
177 commits to master branch, last one 2 years ago
212
916
apache-2.0
91
Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction imp...
Created 2014-03-31
680 commits to master branch, last one about a year ago
Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Created 2018-08-13
294 commits to master branch, last one about a month ago
86
725
other
20
zhparser is a PostgreSQL extension for full-text search of Chinese language
Created 2013-02-24
134 commits to master branch, last one 8 days ago
50
341
apache-2.0
3
Pytorch-NLU,一个中文文本分类、序列标注工具包,支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label c...
Created 2021-08-29
13 commits to main branch, last one 6 months ago
74
204
lgpl-2.1
16
Tokenizer support Lucene5/6/7/8/9+ version, LTS
Created 2014-11-13
73 commits to master branch, last one about a year ago