Structural correctness in metagenomics assembly

Bidragets beskrivning

The amount of sequencing data has increased enormously in the last decade. To analyse the data efficiently, it needs to be assembled to genomes or represented in a compact manner. However, current tools for assembly and compaction of sequencing data only output sequences with no estimates of their correctness which severely hampers accurate estimation of the correctness of downstream analysis. We will develop models to estimate the structural correctness of sequence reconstructions. We will provide for each substring of the sequences the probability that it occurs in the underlying sample. We will consider assembling one genome or a mixture of genomes and compacting sequencing data. Our methods will enable assessing the correctness of downstream genomic analysis accurately and to direct validation efforts to uncertain regions of reconstructed sequences.
Visa mer

Startår

2025

Slutår

2029

Beviljade finansiering

Leena Salmela Orcid -palvelun logo
599 999 €

Finansiär

Finlands Akademi

Typ av finansiering

Akademiprojekt

Beslutfattare

Forskningsrådet för naturvetenskap och teknik
12.06.2025

Övriga uppgifter

Finansieringsbeslutets nummer

370538

Vetenskapsområden

Biomedicinska vetenskaper

Forskningsområden

Systeemibiologia, bioinformatiikka

Identifierade teman

genes, genetics