11 results found Sort:

104
668
mit
10
NeuSpell: A Neural Spelling Correction Toolkit
Created 2020-07-12
39 commits to master branch, last one 3 years ago
103
612
mit
11
Modern spell checking library - accurate, fast, multi-language
Created 2017-11-12
233 commits to master branch, last one 2 months ago
Covid-19 Twitter dataset for non-commercial research use and pre-processing scripts - under active development
Created 2020-03-23
332 commits to master branch, last one about a year ago
Next-token prediction in JavaScript — build fast language and diffusion models.
Created 2024-04-10
65 commits to master branch, last one about a month ago
20
128
mit
13
A C++ library providing fast language model queries in compressed space.
Created 2017-04-12
87 commits to master branch, last one about a year ago
20
124
gpl-3.0
12
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dy...
Created 2013-09-21
1,393 commits to master branch, last one 11 months ago
NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.
Created 2017-05-12
136 commits to master branch, last one 8 months ago
A fast and reliable PHP library for detecting languages
Created 2016-09-17
93 commits to master branch, last one 10 months ago
A flexible and general-purpose ngrams library written in Ruby. Raingrams supports ngram sizes greater than 1, text/non-text grams, multiple parsing styles and open/closed vocabulary models.
Created 2009-03-08
139 commits to master branch, last one 3 years ago
This Repository Contains Solution to the Assignments of the Natural Language Processing Specialization from Deeplearning.ai on Coursera Taught by Younes Bensouda Mourri, Łukasz Kaiser, Eddy Shyu
Created 2023-03-17
318 commits to main branch, last one about a year ago
Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code
Created 2022-08-20
34 commits to main branch, last one about a year ago