Plenary Sessions of the Parliament of Finland, Kielipankki Korp Version 1.5

Beskrivning

This corpus version is available in Kielipankki, the Language Bank of Finland (Korp service), see Access location. The corpus contains a slightly modified version of the transcriptions of of the plenary sessions of the Parliament of Finland from 10.09.2008 to 1.7.2016. The transcripts have been aligned with the audio tracks of the video recordings of the original sessions. Each speaker's speech has been separately aligned. The alignment is based on the output of a set of automatic tools and it was provided by Aalto University. In this updated version, links have been added from each utterance in the transcript to the corresponding portion of the video recording. Video links can be found from the search results in Korp. In addition, for backward compatibility, some search results in Korp have a link to the LAT version of the session in case it exists. Please note that the LAT version may not be available in future versions of the corpus. However, the original transcription files will continue to be available in the downloadable version of the corpus (see Relation for the link to the download version). Please note that the aligned transcript may contain errors and some superfluous tags may have been inserted in the text due to the automatic alignment and speech recognition process. For portions where the original audio track did not have matching text in the transcript, the speech signal was recognized automatically using a Finnish language model, and such portions may contain strange or erroneous content. The text in the transcripts has been parsed automatically using a Finnish language model. This is why the part-of-speech of word tokens in the Swedish portions within the transcripts has usually been marked as 'foreign word'. In the search results of this corpus version in Korp, there are links to the original authoritative session transcripts as well as to the original video streams that are provided by the Parliament of Finland.
Visa mer

Publiceringsår

2019

Typ av data

Upphovspersoner

The Parliament of Finland - Upphovsperson

University of Helsinki - Kurator

Projekt

Övriga uppgifter

Vetenskapsområden

Språkvetenskaper

Språk

finska, svenska

Öppen tillgång

Öppet

Licens

CLARIN PUB (Public) End User License 1.0

Nyckelord

Ämnesord

Temporal täckning

undefined

Relaterade till denna forskningsdata