Joensuu Corpus of Swedish Compounds

Beskrivning

Computer corpus (list) of Swedish compounds in Göteborgs-Posten (a Swedish newspaper) data-base of 24.2 million word tokens originally collected by Elisabeth Ahlsén (Linguistics, Göteborg University) and eventually morphologically tagged by Matti Laine’s and Patrick Virtanen’s WordMill Lexical Search program (Center for Cognitive Neuroscience, U. Turku). about 3800 compound tokens, with their WordMill variables (incl. frequency of use in the Göteborgs-Posten), about 3 person months Relevant publication(s) using the corpus: S. Niemi: Compounds in Swedish. Lingue e Linguaggio 8: 257-269. Part of cross-linguistic study of compounds, co-ordinated by Sergio Scalise (Linguistics, U. Bologna), see http://morbo.lingue.unibo.it/mmm/enlm.php log 25.11.2018 link http://islrn.org/resources/128-829-996-277-5 removed
Visa mer

Publiceringsår

2018

Typ av data

Upphovspersoner

University of Eastern Finland - Upphovsperson

Sinikka Niemi - Kurator

Projekt

Övriga uppgifter

Vetenskapsområden

Språkvetenskaper

Språk

svenska

Öppen tillgång

Begränsad tillgång

Licens

CLARIN RES (Restricted) End User License 1.0

Nyckelord

Ämnesord

Temporal täckning

undefined

Relaterade till denna forskningsdata