![]() Kunath, Benoît ![]() ![]() ![]() in Microbiome (2022) Background: Alterations to the gut microbiome have been linked to multiple chronic diseases. However, the drivers of such changes remain largely unknown. The oral cavity acts as a major route of exposure ... [more ▼] Background: Alterations to the gut microbiome have been linked to multiple chronic diseases. However, the drivers of such changes remain largely unknown. The oral cavity acts as a major route of exposure to exogenous factors including pathogens, and processes therein may affect the communities in the subsequent compartments of the gastrointestinal tract. Here, we perform strain‑resolved, integrated meta‑genomic, transcriptomic, and proteomic analyses of paired saliva and stool samples collected from 35 individuals from eight families with multiple cases of type 1 diabetes mellitus (T1DM). Results: We identified distinct oral microbiota mostly reflecting competition between streptococcal species. More specifically, we found a decreased abundance of the commensal Streptococcus salivarius in the oral cavity of T1DM individuals, which is linked to its apparent competition with the pathobiont Streptococcus mutans. The decrease in S. salivarius in the oral cavity was also associated with its decrease in the gut as well as higher abundances in facultative anaerobes including Enterobacteria. In addition, we found evidence of gut inflammation in T1DM as reflected in the expression profiles of the Enterobacteria as well as in the human gut proteome. Finally, we were able to follow transmitted strain‑variants from the oral cavity to the gut at the individual omic levels, highlighting not only the transfer, but also the activity of the transmitted taxa along the gastrointestinal tract. Conclusions: Alterations of the oral microbiome in the context of T1DM impact the microbial communities in the lower gut, in particular through the reduction of “mouth‑to‑gut” transfer of Streptococcus salivarius. Our results indicate that the observed oral‑cavity‑driven gut microbiome changes may contribute towards the inflammatory processes involved in T1DM. Through the integration of multi‑omic analyses, we resolve strain‑variant “mouth‑to‑gut” transfer in a disease context. [less ▲] Detailed reference viewed: 25 (1 UL)![]() Teixeira Queiros, Pedro ![]() Doctoral thesis (2022) Due to technological advances across all scientific domains, data is generated at an extremely fast pace. This is especially true in biology, where advances in computational and sequencing technologies ... [more ▼] Due to technological advances across all scientific domains, data is generated at an extremely fast pace. This is especially true in biology, where advances in computational and sequencing technologies led to the necessity to develop automated methods for data analysis; thus the field of bioinformatics was born. This thesis focuses on one specific field within bioinformatics - functional genomics. To be precise, in the development of techniques and software for the integration of data to generate novel insights. Indeed, as the amount of knowledge increases, so does the need to integrate it systematically. In this context, the work described herein relates to the integration of multiple resources to improve the functional annotation of proteins, which led to the development of two bioinformatic tools - Mantis and UniFunc. For the downstream integration and analysis of functional predictions, a network annotation tool was developed - UniFuncNet, which, together with the previous tools, enables the efficient functional characterisation of individual organisms or communities. [less ▲] Detailed reference viewed: 85 (14 UL)![]() Hickl, Oskar ![]() ![]() ![]() E-print/Working paper (2021) The reconstruction of genomes is a critical step in genome-resolved metagenomics as well as for multi-omic data integration from microbial communities. Here, we present binny, a binning tool that produces ... [more ▼] The reconstruction of genomes is a critical step in genome-resolved metagenomics as well as for multi-omic data integration from microbial communities. Here, we present binny, a binning tool that produces high-quality metagenome-assembled genomes from both contiguous and highly fragmented genomes. Based on established metrics, binny outperforms existing state-of-the-art binning methods and finds unique genomes that could not be detected by other methods.binny uses k-mer-composition and coverage by metagenomic reads for iterative, non-linear dimension reduction of genomic signatures as well as subsequent automated contig clustering with cluster assessment using lineage-specific marker gene sets.When compared to five widely used binning algorithms, binny recovers the most near-complete (\>95 pure, \>90 complete) and high-quality (\>90 pure, \>70 complete) genomes from simulated data sets from the Critical Assessment of Metagenome Interpretation (CAMI) initiative, as well as from a real-world benchmark comprised of metagenomes from various environments. binny is implemented as Snakemake workflow and available from https://github.com/a-h-b/binny.Competing Interest StatementThe authors have declared no competing interest. [less ▲] Detailed reference viewed: 133 (13 UL)![]() Teixeira Queiros, Pedro ![]() ![]() ![]() in GigaScience (2021), 10(6), The rapid development of the (meta-)omics fields has produced an unprecedented amount of high-resolution and high-fidelity data. Through the use of these datasets we can infer the role of previously ... [more ▼] The rapid development of the (meta-)omics fields has produced an unprecedented amount of high-resolution and high-fidelity data. Through the use of these datasets we can infer the role of previously functionally unannotated proteins from single organisms and consortia. In this context, protein function annotation can be described as the identification of regions of interest (i.e., domains) in protein sequences and the assignment of biological functions. Despite the existence of numerous tools, challenges remain in terms of speed, flexibility, and reproducibility. In the big data era, it is also increasingly important to cease limiting our findings to a single reference, coalescing knowledge from different data sources, and thus overcoming some limitations in overly relying on computationally generated data from single sources.We implemented a protein annotation tool, Mantis, which uses database identifiers intersection and text mining to integrate knowledge from multiple reference data sources into a single consensus-driven output. Mantis is flexible, allowing for the customization of reference data and execution parameters, and is reproducible across different research goals and user environments. We implemented a depth-first search algorithm for domain-specific annotation, which significantly improved annotation performance compared to sequence-wide annotation. The parallelized implementation of Mantis results in short runtimes while also outputting high coverage and high-quality protein function annotations.Mantis is a protein function annotation tool that produces high-quality consensus-driven protein annotations. It is easy to set up, customize, and use, scaling from single genomes to large metagenomes. Mantis is available under the MIT license at https://github.com/PedroMTQ/mantis. [less ▲] Detailed reference viewed: 104 (12 UL)![]() Teixeira Queiros, Pedro ![]() ![]() ![]() in Biological Chemistry (2021) A common approach to genome annotation involves the use of homology-based tools for the prediction of the functional role of proteins. The quality of functional annotations is dependent on the reference ... [more ▼] A common approach to genome annotation involves the use of homology-based tools for the prediction of the functional role of proteins. The quality of functional annotations is dependent on the reference data used, as such, choosing the appropriate sources is crucial. Unfortunately, no single reference data source can be universally considered the gold standard, thus using multiple references could potentially increase annotation quality and coverage. However, this comes with challenges, particularly due to the introduction of redundant and exclusive annotations. Through text mining it is possible to identify highly similar functional descriptions, thus strengthening the confidence of the final protein functional annotation and providing a redundancy-free output. Here we present UniFunc, a text mining approach that is able to detect similar functional descriptions with high precision. UniFunc was built as a small module and can be independently used or integrated into protein function annotation pipelines. By removing the need to individually analyse and compare annotation results, UniFunc streamlines the complementary use of multiple reference datasets. [less ▲] Detailed reference viewed: 186 (3 UL)![]() ; Kunath, Benoît ![]() in Nature Communications (2021), 12(1), 7305 Abstract Metaproteomics has matured into a powerful tool to assess functional interactions in microbial communities. While many metaproteomic workflows are available, the impact of method choice on ... [more ▼] Abstract Metaproteomics has matured into a powerful tool to assess functional interactions in microbial communities. While many metaproteomic workflows are available, the impact of method choice on results remains unclear. Here, we carry out a community-driven, multi-laboratory comparison in metaproteomics: the critical assessment of metaproteome investigation study (CAMPI). Based on well-established workflows, we evaluate the effect of sample preparation, mass spectrometry, and bioinformatic analysis using two samples: a simplified, laboratory-assembled human intestinal model and a human fecal sample. We observe that variability at the peptide level is predominantly due to sample processing workflows, with a smaller contribution of bioinformatic pipelines. These peptide-level differences largely disappear at the protein group level. While differences are observed for predicted community composition, similar functional profiles are obtained across workflows. CAMPI demonstrates the robustness of present-day metaproteomics research, serves as a template for multi-laboratory studies in metaproteomics, and provides publicly available data sets for benchmarking future developments. [less ▲] Detailed reference viewed: 66 (0 UL) |
||