10 results found Sort:
- Filter by Primary Language:
- Python (8)
- TeX (1)
- +
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
Created
2019-02-14
61 commits to master branch, last one 21 days ago
ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".
Created
2021-04-05
50 commits to main branch, last one about a year ago
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processi...
asr
vad
icassp
denoising
icassp2023
icassp2024
face-recognition
image-generation
keyword-spotting
music-generation
domain-adaptation
generative-models
language-modeling
signal-processing
signal-restoration
speech-recognition
multimodal-learning
semantic-segmentation
self-supervised-learning
spoken-language-understanding
Created
2023-08-01
975 commits to main branch, last one a day ago
Code & Data for "Tabular Transformers for Modeling Multivariate Time Series" (ICASSP, 2021)
Created
2020-10-20
21 commits to main branch, last one 2 years ago
Reading list for research topics in Sound AI
Created
2020-11-28
62 commits to main branch, last one 4 months ago
[ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition".
Created
2022-11-29
52 commits to main branch, last one 7 months ago
This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data Augmentation of End-to-End Emotion Recognition".
Created
2020-02-09
38 commits to master branch, last one 3 years ago
The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementatio...
Created
2023-08-01
72 commits to main branch, last one 7 months ago
This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Multi-Aspect Attention (ICASSP 2023).
Created
2023-09-23
10 commits to main branch, last one 7 months ago