Search Results - RepositoryStats

303

3.2k

mit

70

SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

spelling symspell spellcheck levenshtein spell-check fuzzy-search edit-distance fuzzy-matching text-segmentation word-segmentation damerau-levenshtein spelling-correction levenshtein-distance chinese-text-segmentation chinese-word-segmentation approximate-string-matching

Created 2014-03-25

493 commits to master branch, last one 24 days ago

kcws koth

645

2.1k

unknown

162

Deep Learning Chinese Word Segment

nlp pos-tagger tensorflow deep-learning chinese-text-segmentation

Created 2016-11-25

88 commits to master branch, last one 7 years ago

jieba-php fukuball

260

1.3k

mit

56

"結巴"中文分詞：做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chinese for "to stutter") Chinese text segmentation: built to be the best PHP Chinese word segmentation module.

nlp machine-learning chinese-text-segmentation natural-language-processing

Created 2015-04-23

178 commits to master branch, last one 11 days ago

jcseg lionsoul2014

212

922

apache-2.0

90

Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction imp...

nlp java jcseg mmseg chinese-nlp pos-tagging solr-plugin jcseg-analyzer lucene-analyzer lucene-tokenizer keywords-extraction opensearch-analyzer opensearch-tokenizer elasticsearch-analyzer elasticsearch-tokenizer nlp-keywords-extraction chinese-text-segmentation chinese-word-segmentation natural-language-processing

Created 2014-03-31

680 commits to master branch, last one about a year ago

symspellpy mammothb

124

824

mit

15

Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

python spelling symspell spellcheck levenshtein spell-check fuzzy-search edit-distance fuzzy-matching text-segmentation word-segmentation damerau-levenshtein spelling-correction levenshtein-distance chinese-text-segmentation chinese-word-segmentation approximate-string-matching

Created 2018-08-13

323 commits to master branch, last one 24 days ago

zhparser amutu

86

737

other

19

zhparser is a PostgreSQL extension for full-text search of Chinese language

scws chinese zhparser extension postgresql chinese-nlp chinese-text-segmentation

Created 2013-02-24

134 commits to master branch, last one 2 months ago

Pytorch-NLU yongzhuo

50

344

apache-2.0

2

Pytorch-NLU，一个中文文本分类、序列标注工具包，支持中文长文本、短文本的多类、多标签分类任务，支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label c...

bert python3 pytorch pos-tagging transformers pretrained-models sequence-labeling word-segmentation text-classification named-entity-recognition chinese-text-segmentation chinese-text-classification

Created 2021-08-29

13 commits to main branch, last one 9 months ago

ik-analyzer blueshen

74

205

lgpl-2.1

16

Tokenizer support Lucene5/6/7/8/9+ version, LTS

java solr lucene lucene9 solrcloud ik-analyzer elasticsearch search-engine chinese-text-segmentation

Created 2014-11-13

73 commits to master branch, last one about a year ago