Projects per year
Abstract
MOTIVATION: Despite recent advancements in sequencing technologies and assembly methods, obtaining high-quality microbial genomes from metagenomic samples is still not a trivial task. Current metagenomic binners do not take full advantage of assembly graphs and are not optimized for long-read assemblies. Deep graph learning algorithms have been proposed in other fields to deal with complex graph data structures. The graph structure generated during the assembly process could be integrated with contig features to obtain better bins with deep learning.
RESULTS: We propose GraphMB, which uses graph neural networks to incorporate the assembly graph into the binning process. We test GraphMB on long-read datasets of different complexities, and compare the performance with other binners in terms of the number of High Quality (HQ) genome bins obtained. With our approach, we were able to obtain unique bins on all real datasets, and obtain more bins on most datasets. In particular, we obtained on average 17.5% more HQ bins when compared with state-of-the-art binners and 13.7% when aggregating the results of our binner with the others. These results indicate that a deep learning model can integrate contig-specific and graph-structure information to improve metagenomic binning.
AVAILABILITY AND IMPLEMENTATION: GraphMB is available from https://github.com/MicrobialDarkMatter/GraphMB.
SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Original language | English |
---|---|
Journal | Bioinformatics |
Volume | 38 |
Issue number | 19 |
Pages (from-to) | 4481-4487 |
Number of pages | 7 |
ISSN | 1367-4803 |
DOIs | |
Publication status | Published - Oct 2022 |
Bibliographical note
© The Author(s) 2022. Published by Oxford University Press.Keywords
- Algorithms
- Genome, Microbial
- Metagenome
- Metagenomics/methods
- Sequence Analysis, DNA/methods
Fingerprint
Dive into the research topics of 'Metagenomic binning with assembly graph embeddings'. Together they form a unique fingerprint.-
Poul Due Jensen Professorate in Big Data and Artificial Intelligence
Hose, K. (PI), Jendal, T. E. (Project Participant) & Hansen, E. R. (Project Participant)
01/11/2019 → 31/12/2025
Project: Research
-
Microflora Danica: The Microbiome of Denmark
Albertsen, M. (PI), Nielsen, P. H. (PI), Jensen, T. B. N. (Project Participant), Sørensen, E. A. (Project Participant), Sereika, M. (Project Participant), Dottorini, G. (Project Participant), Petriglieri, F. (Project Participant), Jørgensen, V. R. (Project Participant), Kirkegaard, R. H. (Project Participant), Yang, Y. (Project Participant), Karst, S. M. (Project Participant), Giguere, A. T. (Project Participant), Knutsson, S. (Project Participant), Singleton, C. M. (Project Participant), Dueholm, M. K. D. (Project Participant), Mølvang Dall, S. (Project Participant), Kristensen, J. M. (Project Participant) & Delogu, F. (Project Participant)
01/01/2019 → 31/12/2025
Project: Research
-
Data Science meets Microbial Dark Matter
Albertsen, M. (PI), Hose, K. (PI), Nielsen, T. D. (PI), Lamurias, A. (Project Participant) & Mølvang Dall, S. (Project Participant)
Danish E-infrastructure Cooperation, Villum Foundation
01/01/2021 → 31/12/2023
Project: Research