The Suomi24 Sentences Corpus 2001-2023, Korp version

Beskrivning

This resource is available via Korp in Kielipankki – the Language Bank of Finland. This collection contains two sets of Suomi24 data: "The Suomi24 Sentences Corpus 2001-2020, Korp version 1.1" and "The Suomi24 Sentences Corpus 2021-2023, Korp version". Together, the two corpora cover all the discussion forums of the Suomi24 online social networking website from 1st January 2001 to 31st December 2023. The data is enriched with annotations of names recognized with FiNER 1.6 and languages of sentences identified with HeLI-OTS 2.0. For further details on various versions of Suomi 24, see the resource group page at http://urn.fi/urn:nbn:fi:lb-2017021630.
Visa mer

Publiceringsår

2025

Typ av data

Upphovspersoner

City Digital Group - Upphovsperson

User support FIN-CLARIN - Kurator

Projekt

Övriga uppgifter

Vetenskapsområden

Språkvetenskaper

Språk

finska

Öppen tillgång

Begränsad tillgång

Licens

Creative Commons Attribution NonCommercial 2.0 Generic (CC BY NC 2.0

Nyckelord

Ämnesord

Temporal täckning

undefined

Relaterade till denna forskningsdata