[en] The reconstruction of genomes is a critical step in genome-resolved metagenomics as well as for multi-omic data integration from microbial communities. Here, we present binny, a binning tool that produces high-quality metagenome-assembled genomes from both contiguous and highly fragmented genomes. Based on established metrics, binny outperforms existing state-of-the-art binning methods and finds unique genomes that could not be detected by other methods.binny uses k-mer-composition and coverage by metagenomic reads for iterative, non-linear dimension reduction of genomic signatures as well as subsequent automated contig clustering with cluster assessment using lineage-specific marker gene sets.When compared to five widely used binning algorithms, binny recovers the most near-complete (\>95 pure, \>90 complete) and high-quality (\>90 pure, \>70 complete) genomes from simulated data sets from the Critical Assessment of Metagenome Interpretation (CAMI) initiative, as well as from a real-world benchmark comprised of metagenomes from various environments. binny is implemented as Snakemake workflow and available from https://github.com/a-h-b/binny.Competing Interest StatementThe authors have declared no competing interest.
Centre de recherche :
- Luxembourg Centre for Systems Biomedicine (LCSB): Bioinformatics Core (R. Schneider Group) - Luxembourg Centre for Systems Biomedicine (LCSB): Eco-Systems Biology (Wilmes Group)
Disciplines :
Génétique & processus génétiques Sciences de l’environnement & écologie
Auteur, co-auteur :
HICKL, Oskar ; University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) > Bioinformatics Core
TEIXEIRA QUEIROS, Pedro ; University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) > Systems Ecology
WILMES, Paul ; University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) > Systems Ecology
MAY, Patrick ; University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) > Bioinformatics Core
Heintz-Buschart, Anna
Langue du document :
Anglais
Titre :
binny: an automated binning algorithm to recover high-quality genomes from complex metagenomic datasets 2021.12.22.473795