Turku Dependency Treebank

Beskrivning

The Turku Dependency Treebank team is building a broad-coverage dependency-annotated treebank of general Finnish. The treebank is annotated in a minor revision of the Stanford dependency scheme (de Marneffe et al. [1,2]). The primary purpose of the treebank is to support Finnish NLP. The release currently available for download (as of July 2013) comprises 678 documents in the publicly available set and 76 in the held-out test set. The syntax annotation is complete with this release. PropBank-style annotation of TDT is currently in progress. The treebank can be downloaded at http://bionlp.utu.fi/fintreebank.html in an XML format as well as the CoNLL-X format. The complete list of IPR holders is available at http://bionlp.utu.fi/static/fintreebank-online/index.html. Download location: http://bionlp.utu.fi/fintreebank-download.html. log 26.11.2018 link http://islrn.org/resources/530-139-472-864-3 removed
Visa mer

Publiceringsår

2018

Typ av data

Upphovspersoner

University of Turku

Filip Ginter - Kurator, Upphovsperson

Katri Haverinen - Kurator, Upphovsperson, Utgivare

Jenna Nyblom - Upphovsperson

Samuel Kohonen - Upphovsperson

Timo Viljanen - Upphovsperson

Veronika Laippala - Upphovsperson

Projekt

Övriga uppgifter

Vetenskapsområden

Språkvetenskaper

Språk

finska

Öppen tillgång

Öppet

Licens

Creative Commons Attribution ShareAlike 4.0 International (CC BY SA 4.0)

Nyckelord

Ämnesord

Temporal täckning

undefined

Relaterade till denna forskningsdata