NanoBaseLib: A Multi-Task Benchmark Dataset for Nanopore Sequencing

Beskrivning

NanoBaseLib is a multi-task benchmark dataset for Nanopore Sequencing. We compile and preprocess publicly available datasets using a unified pipeline to ensure consistency and quality across all tasks. The dataset is benchmarked for four key Nanopore sequencing tasks: base calling, polyA detection, segmentation and event alignment, and RNA modification detection. NanoBaseLib is available at https://nanobaselib.github.io.
Visa mer

Publiceringsår

2024

Typ av data

Upphovspersoner

Department of Computer Science

Chengbo Fu - Upphovsperson

Guangzhao Cheng Orcid -palvelun logo - Upphovsperson

Lu Cheng Orcid -palvelun logo - Upphovsperson

University of Eastern Finland - Medarbetare

Zenodo - Utgivare

Projekt

Övriga uppgifter

Vetenskapsområden

Data- och informationsvetenskap

Språk

Öppen tillgång

Öppet

Licens

Creative Commons Attribution 4.0 International (CC BY 4.0)

Nyckelord

Ämnesord

Temporal täckning

undefined

Relaterade till denna forskningsdata