Search Results - RepositoryStats

10.5k

34.9k

apache-2.0

1.1k

Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification

nlp hanlp pos-tagging semantic-parsing dependency-parser text-classification named-entity-recognition natural-language-processing

Created 2014-10-09

1,795 commits to master branch, last one 3 months ago

NLP-Models-Tensorflow mesolitica

726

1.8k

mit

95

Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0

nlp lstm chatbot embedded attention luong-api dnc-seq2seq pos-tagging deep-learning summarization speech-to-text lstm-seq2seq-tf machine-learning language-detection neural-machine-translation optical-character-recognition

This repository has been archived (exclude archived)

Created 2018-04-23

275 commits to master branch, last one 4 years ago

underthesea undertheseanlp

281

1.5k

gpl-3.0

80

Underthesea - Vietnamese NLP Toolkit

ner nlp vietnamese nlp-library pos-tagging vietnamese-nlp word-segmenter dependency-parser dependency-parsing vietnamese-tokenizer sentence-segmentation named-entity-recognition natural-language-processing

Created 2017-03-01

864 commits to main branch, last one about a month ago

hazm roshan-research

188

1.3k

mit

23

Persian NLP Toolkit

nlp farsi python persian tokenizer embeddings persian-nlp pos-tagging lemmatization normalization text-processing dependency-parser natural-language-processing

Created 2013-10-29

1,411 commits to master branch, last one 10 months ago

wink-nlp winkjs

59

1.3k

mit

15

Developer friendly Natural Language Processing ✨

Created 2018-12-15

309 commits to master branch, last one 4 months ago

jcseg lionsoul2014

212

922

apache-2.0

90

Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction imp...

nlp java jcseg mmseg chinese-nlp pos-tagging solr-plugin jcseg-analyzer lucene-analyzer lucene-tokenizer keywords-extraction opensearch-analyzer opensearch-tokenizer elasticsearch-analyzer elasticsearch-tokenizer nlp-keywords-extraction chinese-text-segmentation chinese-word-segmentation natural-language-processing

Created 2014-03-31

680 commits to master branch, last one about a year ago

kagome ikawaha

56

857

mit

22

Self-contained Japanese Morphological Analyzer written in pure Go

korean japanese tokenizer nlp-library pos-tagging segmentation hacktoberfest japanese-language morphological-analysis

Created 2014-06-26

821 commits to v2 branch, last one 14 days ago

Sudachi WorksApplications

72

840

unknown

43

A Japanese Tokenizer for Business

nlp-library pos-tagging segmentation morphological-analysis

Created 2017-08-21

871 commits to develop branch, last one 4 months ago

PhoBERT VinAIResearch

102

708

mit

22

PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)

Created 2020-03-03

46 commits to master branch, last one 8 months ago

VnCoreNLP vncorenlp

149

610

other

30

A Vietnamese natural language processing toolkit (NAACL 2018)

ner nlp java parsing python3 vnmarmot vncorenlp pos-tagger vietnamese pos-tagging rdrsegmenter vietnamese-nlp word-segmenter word-segmentation dependency-parsing vietnamese-tokenizer sentence-segmentation named-entity-recognition natural-language-processing

Created 2017-12-30

46 commits to master branch, last one 2 years ago

malaya mesolitica

127

486

mit

27

Natural Language Toolkit for Malaysian language, https://malaya.readthedocs.io/

ner malay malay-nlp normalizer tensorflow pos-tagging bahasa-malaysia emotion-analysis entity-framework language-detection sentiment-analysis subjectivity-analysis natural-language-processing

Created 2018-03-12

905 commits to master branch, last one about a month ago

cogcomp-nlp CogComp

144

475

other

61

CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer, t...

ner nlp pos cogcomp big-data tokenizer lemmatizer similarity data-mining pos-tagging lemmatization transliteration dependency-parsing relation-extraction parts-of-speech-tagging named-entity-recognition natural-language-processing natural-language-understanding

Created 2015-10-18

2,621 commits to master branch, last one 2 years ago

camel_tools CAMeL-Lab

75

447

mit

18

A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.

nlp arabic nlp-apis stemming nlp-library pos-tagging arabic-dialects sentiment-analysis dialect-identification morphological-analysis morphological-generation named-entity-recognition morphological-reinflection morphological-disambiguation

Created 2017-10-05

461 commits to master branch, last one 7 months ago

ArticutAPI Droidtown

39

411

mit

13

API of Articut 中文斷詞 (兼具語意詞性標記)：「斷詞」又稱「分詞」，是中文資訊處理的基礎。Articut 不用機器學習，不需資料模型，只用現代白話中文語法規則，即能達到 SIGHAN 2005 F1-measure 94% 以上，Recall 96% 以上的成績。

cws nlp nlu pos-tagger pos-tagging part-of-speech-tagger artificial-intelligence part-of-speech-embdding natural-language-processing natural-language-understanding

Created 2019-04-26

431 commits to master branch, last one 13 days ago

nlpnet erickrf

104

407

mit

34

A neural network architecture for NLP tasks, using cython for fast performance. Currently, it can perform POS tagging, SRL and dependency parsing.

nlp parsing pos-tagging neural-network semantic-role-labeling natural-language-processing

Created 2013-02-26

279 commits to master branch, last one 3 years ago

SudachiPy WorksApplications

52

404

apache-2.0

23

Python version of Sudachi, a Japanese tokenizer.

nlp-library pos-tagging segmentation morphological-analysis

This repository has been archived (exclude archived)

Created 2017-09-13

408 commits to develop branch, last one 2 years ago

nagisa taishi-i

23

397

mit

10

A Japanese tokenizer based on recurrent neural networks

nlp dynet japanese tokenizer nlp-library pos-tagging sequence-labeling word-segmentation natural-language-processing

Created 2018-02-14

194 commits to master branch, last one 10 months ago

jumanpp ku-nlp

44

388

apache-2.0

31

Juman++ (a Morphological Analyzer Toolkit)

cjk nlp juman japanese tokenizer pos-tagger pos-tagging word-segmentation part-of-speech-tagger morphological-analyser morphological-analysis

Created 2016-10-11

1,093 commits to master branch, last one 2 years ago

sudachi.rs WorksApplications

38

348

apache-2.0

7

Sudachi in Rust 🦀 and new generation of SudachiPy

rust python sudachi nlp-libary pos-tagging segmentation tokenization morphological-analysis

Created 2019-11-23

482 commits to develop branch, last one 3 months ago

Pytorch-NLU yongzhuo

50

343

apache-2.0

2

Pytorch-NLU，一个中文文本分类、序列标注工具包，支持中文长文本、短文本的多类、多标签分类任务，支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label c...

bert python3 pytorch pos-tagging transformers pretrained-models sequence-labeling word-segmentation text-classification named-entity-recognition chinese-text-segmentation chinese-text-classification

Created 2021-08-29

13 commits to main branch, last one 9 months ago

engtagger yohasebe

49

271

gpl-2.0

12

English Part-of-Speech Tagger Library; a Ruby port of Lingua::EN::Tagger

nlp ruby english rubynlp pos-tagging

Created 2012-06-05

60 commits to master branch, last one 2 months ago

bi-lstm-crf jidasheng

48

249

mit

8

A PyTorch implementation of the BI-LSTM-CRF model.

crf ner nlp bilstm pytorch lstm-crf crf-model bilstm-crf bi-lstm-crf pos-tagging sequence-tagging sequence-labeling word-segmentation

Created 2019-12-03

6 commits to master branch, last one 5 months ago

SudachiDict WorksApplications

19

248

unknown

12

A lexicon for Sudachi

pos-tagging segmentation nlp-resources morphological-analysis

Created 2019-04-01

122 commits to develop branch, last one 2 months ago

monpa monpa-team

25

246

other

22

MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型

ner nlp pos bert albert pos-tagging word-segmentation named-entity-recognition chinese-word-segmentation

Created 2019-07-23

55 commits to master branch, last one about a month ago

nlp-cheat-sheet-python janlukasschroeder

73

239

unknown

10

NLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition

nlp nltk spacy spans lexnlp python spacy-nlp cheat-sheet pos-tagging starter-kit introduction tokenization lemmatization machine-learning dependency-parsing sentence-similarity named-entity-recognition

Created 2019-09-07

38 commits to master branch, last one 2 years ago

vntk vunb

63

216

mit

21

Vietnamese NLP Toolkit for Node

tf-idf vietnamese pos-tagging vietnamese-nlp vietnamese-tokenizer language-identification named-entity-recognition natural-language-processing vietnamese-text-classification

Created 2016-09-01

140 commits to master branch, last one about a year ago

udpipe bnosac

33

215

mpl-2.0

14

R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit

r nlp rcpp conll r-pkg udpipe r-package tokenizer pos-tagging text-mining lemmatization dependency-parser natural-language-processing

Created 2017-08-25

394 commits to master branch, last one 2 years ago

pytorch-pos-tagging bentrevett

27

180

mit

3

A tutorial on how to implement models for part-of-speech tagging using PyTorch and TorchText.

cnn pos rnn lstm pytorch tutorial torchtext tutorials pos-tagging pytorch-nlp pytorch-tutorial pytorch-tutorials part-of-speech-tagger pytorch-implementation recurrent-neural-networks natural-language-processing

Created 2019-09-18

30 commits to master branch, last one 3 years ago

PhoNLP VinAIResearch

19

141

bsd-3-clause

8

PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)

ner pos-tagging language-model vietnamese-nlp dependency-parsing multi-task-learning named-entity-recognition

Created 2020-12-17

74 commits to master branch, last one 3 months ago

Qutuf Qutuf

17

132

unknown

6

Qutuf (قُطُوْف): An Arabic Morphological analyzer and Part-Of-Speech tagger as an Expert System.

Created 2017-09-15

21 commits to master branch, last one 2 years ago