Behind the words: Deep neural models of language meaning for industry-grade applications
Akronym
BehindTheWords
Bidragets beskrivning
Many language technology applications require the ability to model the meaning of language statements, regardless of their exact wording. The sentences “Can you corroborate that?” and “I need proofs.” have no common words, yet their meaning is almost the same. Being able to model this relation would enable considerable improvements in document search, clustering and analysis, language generation, and other applications which deal with the meaning, rather than the surface form. While easy for a human, the computational models of paraphrase are still weak, and mostly unavailable for Finnish, preventing many advanced language technology applications in the Finnish industry. We aim to address this gap and develop the deep learning models and, in collaboration with the industry, pilot several applications. To achieve this goal, the project will also develop a unique paraphrase dataset. The project is a collaboration between the natural language processing groups in Turku and Helsinki.
Visa merStartår
2021
Slutår
2024
Beviljade finansiering
Rollen i Finlands Akademis konsortium
Övriga parter i konsortiet
Finansiär
Finlands Akademi
Typ av finansiering
Akademiprojekt med särskild inriktning
Övriga uppgifter
Finansieringsbeslutets nummer
335966
Vetenskapsområden
Data- och informationsvetenskap
Forskningsområden
Laskennallinen data-analyysi
Identifierade teman
languages, speech, linguistics