Clotho Analysis Set

Beskrivning

This dataset is derived from the evaluation subset of Clotho dataset (https://zenodo.org/doi/10.5281/zenodo.3490683). It is designed to analyze the behavior of the captioning system under certain perturbation in order to try and identify some open challenges in automated audio captioning. The original audio clips are transformed with audio_degrader. The transformations applied are the following: Microphone response simulation Mixup with another clip from the dataset (ratio -6dB, -3dB and 0dB) Additive noise from DESED (ratio -12dB, -6dB, 0dB)
Visa mer

Publiceringsår

2022

Typ av data

Upphovspersoner

Huang Xie - Upphovsperson

Konstantinos Drossos - Upphovsperson

Samuel Lipping - Upphovsperson

Tuomas Virtanen - Upphovsperson

Unknown organization

Felix Gontier - Upphovsperson

Romain Serizel - Upphovsperson

Zenodo - Utgivare

Projekt

Övriga uppgifter

Vetenskapsområden

Data- och informationsvetenskap

Språk

engelska

Öppen tillgång

Öppet

Licens

Creative Commons Attribution 4.0 International (CC BY 4.0)

Nyckelord

Computer and information sciences

Ämnesord

Temporal täckning

undefined

Relaterade till denna forskningsdata