8 results found Sort:
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Created
2014-03-25
490 commits to master branch, last one 9 days ago
Deep Learning Chinese Word Segment
Created
2016-11-25
88 commits to master branch, last one 7 years ago
"結巴"中文分詞:做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chinese for "to stutter") Chinese text segmentation: built to be the best PHP Chinese word segmentation module.
Created
2015-04-23
177 commits to master branch, last one 2 years ago
Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction imp...
nlp
java
jcseg
mmseg
chinese-nlp
pos-tagging
solr-plugin
jcseg-analyzer
lucene-analyzer
lucene-tokenizer
keywords-extraction
opensearch-analyzer
opensearch-tokenizer
elasticsearch-analyzer
elasticsearch-tokenizer
nlp-keywords-extraction
chinese-text-segmentation
chinese-word-segmentation
natural-language-processing
Created
2014-03-31
680 commits to master branch, last one about a year ago
Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Created
2018-08-13
294 commits to master branch, last one about a month ago
zhparser is a PostgreSQL extension for full-text search of Chinese language
Created
2013-02-24
134 commits to master branch, last one 8 days ago
Pytorch-NLU,一个中文文本分类、序列标注工具包,支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label c...
Created
2021-08-29
13 commits to main branch, last one 6 months ago
Tokenizer support Lucene5/6/7/8/9+ version, LTS
Created
2014-11-13
73 commits to master branch, last one about a year ago