5 results found Sort:

785
7.3k
other
68
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Created 2022-11-24
4,728 commits to main branch, last one a day ago
A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text
Created 2016-03-23
46 commits to master branch, last one 3 years ago
中文标点符号模型,可以给文本添加标点符号。
Created 2022-09-14
16 commits to master branch, last one about a year ago
13
66
gpl-3.0
13
Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to...
Created 2013-03-26
1,584 commits to master branch, last one 2 days ago
A small seq2seq punctuator tool based on DistilBERT
Created 2020-11-19
64 commits to main branch, last one 2 years ago