Clotho Analysis Set
Beskrivning
This dataset is derived from the evaluation subset of Clotho dataset (https://zenodo.org/doi/10.5281/zenodo.3490683). It is designed to analyze the behavior of the captioning system under certain perturbation in order to try and identify some open challenges in automated audio captioning. The original audio clips are transformed with audio_degrader. The transformations applied are the following: Microphone response simulation Mixup with another clip from the dataset (ratio -6dB, -3dB and 0dB) Additive noise from DESED (ratio -12dB, -6dB, 0dB)
Visa merPubliceringsår
2022
Typ av data
Upphovspersoner
Huang Xie - Upphovsperson
Konstantinos Drossos - Upphovsperson
Samuel Lipping - Upphovsperson
Tuomas Virtanen - Upphovsperson
Unknown organization
Felix Gontier - Upphovsperson
Romain Serizel - Upphovsperson
Zenodo - Utgivare
Projekt
Övriga uppgifter
Vetenskapsområden
Data- och informationsvetenskap
Språk
engelska
Öppen tillgång
Öppet