25 results found Sort:

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
Created 2023-05-17
94 commits to main branch, last one about a month ago
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Created 2024-06-04
25 commits to main branch, last one 3 months ago
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Created 2023-09-05
115 commits to main branch, last one 19 days ago
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Created 2021-10-06
16 commits to main branch, last one 2 years ago
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aim...
Created 2021-08-24
10 commits to main branch, last one 2 years ago
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
Created 2022-02-14
3 commits to main branch, last one 2 years ago
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
Created 2021-04-26
9 commits to main branch, last one 3 years ago
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)
Created 2021-06-03
3 commits to main branch, last one 2 years ago
Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023
Created 2022-06-02
10 commits to main branch, last one about a year ago
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
Created 2021-06-09
17 commits to main branch, last one 2 years ago
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
Created 2021-04-30
5 commits to main branch, last one 3 years ago
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
Created 2021-10-16
7 commits to main branch, last one 2 years ago
Paper Lists, Notes and Slides, Focus on NLP. For summarization, please refer to https://github.com/xcfcode/Summarization-Papers
This repository has been archived (exclude archived)
Created 2019-06-05
124 commits to master branch, last one 2 years ago
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimat...
Created 2022-03-30
4 commits to main branch, last one 2 years ago
Reparameterized Discrete Diffusion Models for Text Generation
Created 2023-02-09
2 commits to main branch, last one about a year ago
[NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"
Created 2024-02-12
3 commits to main branch, last one 10 months ago
PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
Created 2021-06-30
3 commits to main branch, last one 3 years ago
PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.
Created 2021-07-15
4 commits to main branch, last one 3 years ago
12
67
unknown
6
A length-controllable and non-autoregressive image captioning model.
Created 2020-03-07
8 commits to master branch, last one 4 years ago
PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Created 2021-06-18
6 commits to main branch, last one 3 years ago
4
62
unknown
5
A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.
Created 2024-06-03
40 commits to main branch, last one 2 months ago
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
Created 2021-08-09
6 commits to main branch, last one 3 years ago
1
34
unknown
2
Codes for our paper "Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation" (EMNLP 2023 Findings)
Created 2022-03-31
41 commits to main branch, last one about a year ago