Behind the words: Deep neural models of language meaning for industry-grade applications

Akronym

BehindTheWords

Bidragets beskrivning

Many language technology applications require the ability to model the meaning of language statements, regardless of their exact wording. The sentences “Can you corroborate that?” and “I need proofs.” have no common words, yet their meaning is almost the same. Being able to model this relation would enable considerable improvements in document search, clustering and analysis, language generation, and other applications which deal with the meaning, rather than the surface form. While easy for a human, the computational models of paraphrase are still weak, and mostly unavailable for Finnish, preventing many advanced language technology applications in the Finnish industry. We aim to address this gap and develop the deep learning models and, in collaboration with the industry, pilot several applications. To achieve this goal, the project will also develop a unique paraphrase dataset. The project is a collaboration between the natural language processing groups in Turku and Helsinki.
Visa mer

Startår

2021

Slutår

2024

Beviljade finansiering

Filip Ginter Orcid -palvelun logo
307 584 €


Rollen i Finlands Akademis konsortium

Övriga parter i konsortiet

Partner
Helsingfors universitet (335967)
307 688 €

Finansiär

Finlands Akademi

Typ av finansiering

Akademiprojekt med särskild inriktning

Övriga uppgifter

Finansieringsbeslutets nummer

335966

Vetenskapsområden

Data- och informationsvetenskap

Forskningsområden

Laskennallinen data-analyysi

Identifierade teman

languages, speech, linguistics