9 results found Sort:

Audio Captioning datasets for PyTorch.
Created 2022-05-19
13 commits to main branch, last one 8 months ago
13
95
unknown
6
Using pretrained encoder and language models to generate captions from multimedia inputs.
Created 2022-01-23
507 commits to main branch, last one about a year ago
Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)
Created 2021-04-19
14 commits to main branch, last one 2 years ago
Song Describer is a data collection platform for annotating music with textual descriptions.
Created 2022-11-21
27 commits to main branch, last one 5 months ago
Code base for WaveTransformer: A novel architecture for automated audio captioning
Created 2020-10-11
77 commits to main branch, last one 3 years ago
Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.
Created 2022-09-20
18 commits to main branch, last one 8 months ago
Tracking states of the arts and recent results (bibliography) on sound tasks.
Created 2022-12-16
11 commits to main branch, last one about a year ago
PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation"
Created 2024-01-04
9 commits to main branch, last one 10 months ago