1 result found Sort:

PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation"
Created 2024-01-04
9 commits to main branch, last one 5 months ago