5 results found Sort:

459
4.0k
other
49
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Created 2022-11-24
4,267 commits to main branch, last one a day ago
A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text
Created 2016-03-23
46 commits to master branch, last one 2 years ago
中文标点符号模型,可以给文本添加标点符号。
Created 2022-09-14
16 commits to master branch, last one 10 months ago
13
63
gpl-3.0
13
Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to...
Created 2013-03-26
1,564 commits to master branch, last one 14 days ago
A small seq2seq punctuator tool based on DistilBERT
Created 2020-11-19
64 commits to main branch, last one about a year ago