Search Results - RepositoryStats

24

420

unknown

10

PyTorch implementation of Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities.

audio-reasoning audio-captioning audio-language-models audio-question-answering multimodal-large-language-models

Created 2024-05-20

19 commits to main branch, last one 14 days ago

awesome-sound_event_detection soham97

10

179

mit

8

Reading list for research topics in Sound AI

icassp interspeech audio-retrieval audio-captioning audio-generation audio-processing zero-shot-learning sound-event-detection representation-learning acoustic-scene-classification

Created 2020-11-28

62 commits to main branch, last one 7 months ago

aac-datasets Labbeti

6

115

mit

2

Audio Captioning datasets for PyTorch.

audio caption dataset pytorch datasets captioning deep-learning audio-captioning

Created 2022-05-19

13 commits to main branch, last one about a year ago

ClipCap TheoCoombes

13

94

unknown

5

Using pretrained encoder and language models to generate captions from multimedia inputs.

vqa language-model encoder-decoder audio-captioning image-captioning vision-transformer

Created 2022-01-23

507 commits to main branch, last one 2 years ago

muscaps ilaria-manco

7

80

gpl-3.0

5

Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)

mir audio-captioning multimodal-deep-learning music-information-retrieval

Created 2021-04-19

16 commits to main branch, last one 3 months ago

song-describer ilaria-manco

5

57

mit

4

Song Describer is a data collection platform for annotating music with textual descriptions.

annotations music-dataset data-collection audio-captioning

Created 2022-11-21

29 commits to main branch, last one 3 months ago

aac-metrics Labbeti

3

43

mit

2

Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.

text audio metrics captioning audio-captioning

Created 2022-09-20

19 commits to main branch, last one 2 months ago

wavetransformer an-tran528

9

43

other

1

Code base for WaveTransformer: A novel architecture for automated audio captioning

audio-captioning

Created 2020-10-11

77 commits to main branch, last one 4 years ago

beats-conformer-bart-audio-captioner slSeanWU

1

36

apache-2.0

2

PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation"

pytorch transformers clotho-dataset dcase-challenge audio-captioning

Created 2024-01-04

9 commits to main branch, last one about a year ago

sound_ai_progress soham97

1

32

unknown

5

Tracking states of the arts and recent results (bibliography) on sound tasks.

audio-retrieval audio-captioning audio-generation audio-processing music-classification sound-event-detection acoustic-scene-classification

Created 2022-12-16

11 commits to main branch, last one 2 years ago