Christmas Gospel text-to-speech in four Uralic languages, source

Beskrivning

This resource is available for download in Kielipankki – the Language Bank of Finland. This resource consists of .txt and .wav files in four languages pertaining to the Finnish Christmas Gospel verses Luke 2. 1–20 The four languages include Komi-Zyrian (kpv), Erzya (myv), Karelian (krl) and Olonets-Karelian (olo, aka Livvi), whose texts are all included in the Parallel Biblical Verses for Uralic Studies (PaBiVUS) corpus (LUK.2.1–20). The .wav files have been produced by Aleksei Ivanov, Niko Partanen and Jack Rueter, who have made use of the Facebook MMS (https://ai.meta.com/blog/multilingual-model-speech-recognition/) with language support for hundreds of languages (https://huggingface.co/facebook/mms-tts#supported-languages). While the Komi-Zyrian (kpv), Erzya (myv), Karelian (krl) are reported as having coverage for ASR, TTS and LID, Olonets-Karelian (olo, aka Livvi) is not mentioned at all. The Olonets-Karelian .wav has been produced using the krl TTS, as both language forms use writing systems with basically the same character to sound correlation.
Visa mer

Publiceringsår

2023

Typ av data

Upphovspersoner

Raamatunkäännösinstituutti ry - Upphovsperson

Projekt

Övriga uppgifter

Vetenskapsområden

Språkvetenskaper

Språk

karelska, Komi-Yodzyak language, erjya, Livvi-Karelian language, Kompane

Öppen tillgång

Öppet

Licens

Creative Commons Attribution NonCommercial 4.0 International (CC BY NC 4.0)

Nyckelord

Ämnesord

Temporal täckning

undefined

Relaterade till denna forskningsdata