8 results found Sort:

303
3.2k
mit
70
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Created 2014-03-25
493 commits to master branch, last one 24 days ago
645
2.1k
unknown
162
Deep Learning Chinese Word Segment
Created 2016-11-25
88 commits to master branch, last one 7 years ago
260
1.3k
mit
56
"結巴"中文分詞:做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chinese for "to stutter") Chinese text segmentation: built to be the best PHP Chinese word segmentation module.
Created 2015-04-23
178 commits to master branch, last one 11 days ago
212
922
apache-2.0
90
Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction imp...
Created 2014-03-31
680 commits to master branch, last one about a year ago
Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Created 2018-08-13
323 commits to master branch, last one 24 days ago
86
737
other
19
zhparser is a PostgreSQL extension for full-text search of Chinese language
Created 2013-02-24
134 commits to master branch, last one 2 months ago
50
344
apache-2.0
2
Pytorch-NLU,一个中文文本分类、序列标注工具包,支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label c...
Created 2021-08-29
13 commits to main branch, last one 9 months ago
74
205
lgpl-2.1
16
Tokenizer support Lucene5/6/7/8/9+ version, LTS
Created 2014-11-13
73 commits to master branch, last one about a year ago