8 results found Sort:

64
1.1k
apache-2.0
11
The most accurate natural language detection library for Go, suitable for short text and mixed-language text
Created 2020-11-27
101 commits to main branch, last one 5 months ago
43
991
apache-2.0
12
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
Created 2021-07-13
183 commits to main branch, last one 2 months ago
35
848
apache-2.0
8
The most accurate natural language detection library for Rust, suitable for short text and mixed-language text
Created 2020-06-17
402 commits to main branch, last one 2 months ago
60
665
apache-2.0
11
The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Created 2018-11-15
512 commits to main branch, last one 4 months ago
14
154
apache-2.0
2
:spider: The pipeline for the OSCAR corpus
Created 2021-02-15
419 commits to main branch, last one 7 months ago
6
85
apache-2.0
9
An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.
This repository has been archived (exclude archived)
Created 2019-03-01
17 commits to master branch, last one 3 years ago
6
76
apache-2.0
4
GlotLID: Language Identification with Support for More Than 2000 Labels -- EMNLP 2023
Created 2023-09-26
13 commits to main branch, last one about a month ago
A Language Classifier powered by Recurrent Neural Network implemented in Python without AI libraries. AI from scratch.
Created 2020-10-09
19 commits to master branch, last one 2 years ago