8 results found Sort:

299
3.2k
mit
71
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Created 2014-03-25
491 commits to master branch, last one 17 days ago
645
2.1k
unknown
164
Deep Learning Chinese Word Segment
Created 2016-11-25
88 commits to master branch, last one 7 years ago
260
1.3k
mit
56
"結巴"中文分詞:做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chinese for "to stutter") Chinese text segmentation: built to be the best PHP Chinese word segmentation module.
Created 2015-04-23
177 commits to master branch, last one 2 years ago
211
916
apache-2.0
91
Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction imp...
Created 2014-03-31
680 commits to master branch, last one about a year ago
Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Created 2018-08-13
294 commits to master branch, last one 2 months ago
86
731
other
20
zhparser is a PostgreSQL extension for full-text search of Chinese language
Created 2013-02-24
134 commits to master branch, last one 26 days ago
50
340
apache-2.0
3
Pytorch-NLU,一个中文文本分类、序列标注工具包,支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label c...
Created 2021-08-29
13 commits to main branch, last one 7 months ago
74
204
lgpl-2.1
16
Tokenizer support Lucene5/6/7/8/9+ version, LTS
Created 2014-11-13
73 commits to master branch, last one about a year ago