AVID: Aalto Vocal Intensity Database
Beskrivning
Data description: AVID includes speech and EGG produced by 50 speakers (25 males, 25 females) who varied their vocal intensity in four categories (soft, normal, loud, and very loud). Recordings were conducted using a constant mouth-to-microphone distance and by recording a calibration tone. The speech data was labeled sentence-wise using a total of 19 labels that support the utilisation of the data in ML-based studies of vocal intensity based on supervised learning. Further information can be found in the 'readme.docx' file from the upload. when collected the data: Data is collected in 2021
Citation: P. Alku, M. Kodali, L. Laaksonen, S.R. Kadiri, AVID: A speech database for machine learning studies on vocal intensity, Speech Communication, Vol. 157, Article 103039, 2024. https://doi.org/10.1016/j.specom.2024.103039
Visa merPubliceringsår
2024
Typ av data
Upphovspersoner
Zenodo - Utgivare
Projekt
Övriga uppgifter
Vetenskapsområden
Språk
Öppen tillgång
Öppet