18 results found Sort:

jieba analysis plugin for elasticsearch 7.0.0, 6.4.0, 6.0.0, 5.4.0,5.3.0, 5.2.2, 5.2.1, 5.2, 5.1.2, 5.1.1
This repository has been archived (exclude archived)
Created 2017-01-17
107 commits to master branch, last one about a year ago
A collection of languages stemmers and stopwords for Lunr Javascript library
Created 2014-04-20
169 commits to master branch, last one about a year ago
All languages stopwords collection
Created 2016-10-04
55 commits to master branch, last one 4 years ago
243
324
cc-by-4.0
16
List of common stop words in various languages.
Created 2014-05-06
97 commits to master branch, last one 2 years ago
Largest list of Arabic stop words on Github. أكبر قائمة لمستبعدات الفهرسة العربية على جيت هاب
Created 2016-05-27
17 commits to master branch, last one 7 months ago
128
291
unknown
12
Default English stopword lists from many different sources
Created 2016-10-14
23 commits to master branch, last one 5 years ago
A keyword and phrase extraction library based on the Rapid Automatic Keyword Extraction algorithm (RAKE).
Created 2016-09-02
84 commits to master branch, last one 4 months ago
This repository consists of a complete guide on natural language processing (NLP) in Python where we'll learn various techniques for implementing NLP including parsing & text processing and understand...
Created 2021-11-08
19 commits to main branch, last one 2 years ago
Persian (Farsi) Stop Words List
Created 2015-10-22
16 commits to master branch, last one 3 years ago
25
142
other
8
Removes most frequent words (stop words) from a text content. Based on a Curated list of language statistics.
Created 2015-10-16
24 commits to master branch, last one 6 years ago
84
127
other
20
🍊 :page_facing_up: Text Mining add-on for Orange3
Created 2015-06-26
2,368 commits to master branch, last one 26 days ago
14
110
unknown
13
A data package containing lexicons and dictionaries for text analysis
Created 2016-03-15
123 commits to master branch, last one 3 years ago
PHP | A collection of stop words for e.g. search-functions.
Created 2017-05-04
20 commits to master branch, last one 2 years ago
the list of ~2000 ukrainian stopwords (with numbers)
Created 2020-09-29
14 commits to master branch, last one 3 years ago
A collection of Persian stopwords - فهرست کلمات ایست فارسی
Created 2016-10-13
10 commits to master branch, last one 3 years ago
📒 An Aho-Corasick algorithm based string-searching utility for Go. It supports tokenization, ignoring case, replacing text. So you can use it to find keywords in an article, filter sensitive words, e...
Created 2022-04-21
18 commits to main branch, last one 2 years ago
This project employs emotion detection in textual data, specifically trained on Twitter data comprising tweets labeled with corresponding emotions. It seamlessly takes text inputs and provides the mos...
Created 2023-04-02
22 commits to main branch, last one 8 days ago
📒 An Aho-Corasick algorithm based string-searching utility for Java. It supports tokenization, ignoring case, replacing text. So you can use it to find keywords in an article, filter sensitive words,...
Created 2022-04-20
10 commits to main branch, last one 2 years ago