Corpus of Contemporary American English - Kielipankki VRT version 2020

Beskrivning

The resource is available for download in Kielipankki – the Language Bank of Finland. This most recent version of Corpus of Contemporary American English (COCA), released in March 2020, contains 1 billion words and 485,000 texts from the years 1990-2019. The corpus is evenly divided into spoken, fiction, magazine, newspaper, academic, blogs, web pages and TV/movies subtitles (~125 million words each). It is related to many other corpora of English, formerly known as the "BYU Corpora". This version of the resource is in the VRT format. License details: Researchers in the FIN-CLARIN member organizations can obtain access to the full data set by submitting an application and a research plan via Language Bank Rights, https://lbr.csc.fi. General terms and conditions: please see https://www.corpusdata.org/restrictions.asp.
Visa mer

Publiceringsår

2023

Typ av data

Upphovspersoner

FIN-CLARIN - Kurator

Projekt

Övriga uppgifter

Vetenskapsområden

Språkvetenskaper

Språk

engelska

Öppen tillgång

Begränsad tillgång

Licens

CLARIN RES (Restricted) End User License 1.0

Nyckelord

Ämnesord

Temporal täckning

undefined

Relaterade till denna forskningsdata