The Suomi 24 Sentences Corpus 2001-2020, Korp version 1.1 (release candidate)
Beskrivning
This content is available in Kielipankki.
Please note that the corpus is a release candidate, so it may still change.
This collection contains two sets of Suomi24 data: "The Suomi24 Sentences Corpus 2001-2017, Korp version" and "The Suomi24 Sentences Corpus 2018-2020, Korp version".
Together, the two corpora cover all the discussion forums of the Suomi24 online social networking website from 1st January 2001 to 31st December 2020.
Updates:
2025-04-11: For version 1.1 the data has been updated with annotations of names recognized with FiNER 1.6 and languages of sentences identified with HeLI-OTS 2.0.
For further details on various versions of Suomi 24, see the resource group page at http://urn.fi/urn:nbn:fi:lb-2017021630.
Visa merPubliceringsår
2021
Typ av data
Upphovspersoner
City Digital Group - Upphovsperson
Helsingfors universitet - Utgivare
User support FIN-CLARIN - Kurator
Projekt
Övriga uppgifter
Vetenskapsområden
Språkvetenskaper
Språk
finska
Öppen tillgång
Begränsad tillgång