25 results found Sort:
- Filter by Primary Language:
- Python (22)
- Jupyter Notebook (1)
- +
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
Created
2023-05-17
94 commits to main branch, last one about a month ago
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Created
2024-06-04
25 commits to main branch, last one 3 months ago
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Created
2023-09-05
115 commits to main branch, last one 19 days ago
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Created
2021-10-06
16 commits to main branch, last one 2 years ago
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aim...
Created
2021-08-24
10 commits to main branch, last one 2 years ago
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
Created
2022-02-14
3 commits to main branch, last one 2 years ago
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
Created
2021-04-26
9 commits to main branch, last one 3 years ago
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)
Created
2021-06-03
3 commits to main branch, last one 2 years ago
Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023
Created
2022-06-02
10 commits to main branch, last one about a year ago
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
Created
2021-06-09
17 commits to main branch, last one 2 years ago
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
Created
2021-04-30
5 commits to main branch, last one 3 years ago
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
Created
2021-10-16
7 commits to main branch, last one 2 years ago
Paper Lists, Notes and Slides, Focus on NLP. For summarization, please refer to https://github.com/xcfcode/Summarization-Papers
This repository has been archived
(exclude archived)
Created
2019-06-05
124 commits to master branch, last one 2 years ago
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimat...
Created
2022-03-30
4 commits to main branch, last one 2 years ago
Reparameterized Discrete Diffusion Models for Text Generation
Created
2023-02-09
2 commits to main branch, last one about a year ago
[NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"
Created
2024-02-12
3 commits to main branch, last one 10 months ago
[AAAI 2024] GLOP: Learning Global Partition and Local Construction for Solving Large-scale Routing Problems in Real-time
transformer
divide-and-conquer
non-autoregressive
graph-neural-networks
reinforcement-learning
vehicle-routing-problem
deep-reinforcement-learning
travelling-salesman-problem
autoregressive-neural-networks
neural-combinatorial-optimization
capacitated-vehicle-routing-problem
hierarchical-reinforcement-learning
prize-collecting-travelling-salesman-problem
Created
2022-09-13
136 commits to master branch, last one about a month ago
PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
Created
2021-06-30
3 commits to main branch, last one 3 years ago
PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.
Created
2021-07-15
4 commits to main branch, last one 3 years ago
A length-controllable and non-autoregressive image captioning model.
Created
2020-03-07
8 commits to master branch, last one 4 years ago
PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Created
2021-06-18
6 commits to main branch, last one 3 years ago
A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.
Created
2024-06-03
40 commits to main branch, last one 2 months ago
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
Created
2021-08-09
6 commits to main branch, last one 3 years ago
Codes for our paper "Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation" (EMNLP 2023 Findings)
Created
2022-03-31
41 commits to main branch, last one about a year ago
awesome-LLM-controlled-constrained-generation
Created
2024-06-20
47 commits to main branch, last one 4 months ago