Eprint already available on another site (E-prints, Working papers and Research blog)
An approach to integrate metagenomics, metatranscriptomics and metaproteomics data in public resources
Wang, Shengbo; Kaur, Satwant; Kumath, Benoit J. et al.
2025
 

Files


Full Text
1258069.pdf
Publisher postprint (837.6 kB) Creative Commons License - Public Domain Dedication
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
data integration; microbiome research; multi-omics
Abstract :
[en] The availability of public metaproteomics, metagenomics and metatranscriptomics data in public resources such as MGnify (for metagenomics/metatranscriptomics) and the PRIDE database (for metaproteomics), continues to increase. When these omics techniques are applied to the same samples, their integration offers new opportunities to understand the structure (metagenome) and functional expression (metatranscriptome and metaproteome) of the microbiome. Here, we describe a pilot study aimed at integrating public multi-meta-omics datasets from studies based on human gut and marine hatchery samples. Reference search databases (search DBs) were built using assembled metagenomic (and metatranscriptomic, where available) sequence data followed by de novo gene calling, using both data from the same sampling event and from independent samples. The resulting protein sets were evaluated for their utility in metaproteomics analysis. In agreement with previous studies, the highest number of peptide identifications was generally obtained when using search DBs created from the same samples. Data integration of the multi-omics results was performed in MGnify. For that purpose, the MGnify website was extended to enable the visualisation of the resulting peptide/protein information from three reanalysed metaproteomics datasets. A workflow (https://github.com/PRIDE-reanalysis/MetaPUF) has been developed allowing researchers to perform equivalent data integration, using paired multi-omics datasets. This is the first time that a data integration approach for multi-omics datasets has been implemented from public data available in the world-leading MGnify and PRIDE databases.
Research center :
Luxembourg Centre for Systems Biomedicine (LCSB): Bioinformatics Core (R. Schneider Group)
Luxembourg Centre for Systems Biomedicine (LCSB): Eco-Systems Biology (Wilmes Group)
Disciplines :
Environmental sciences & ecology
Genetics & genetic processes
Author, co-author :
Wang, Shengbo;  European Bioinformatics Institute
Kaur, Satwant;  European Bioinformatics Institute
Kumath, Benoit J.;  University of Luxembourg
MAY, Patrick  ;  University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) > Bioinformatics Core
Richardson, Lorna;  European Bioinformatics Institute
WILMES, Paul ;  University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) > Systems Ecology
Finn, Robert D.;  European Bioinformatics Institute
Vizcaino, Juan Antonio ;  European Bioinformatics Institute
Language :
English
Title :
An approach to integrate metagenomics, metatranscriptomics and metaproteomics data in public resources
Publication date :
09 January 2025
Focus Area :
Systems Biomedicine
Development Goals :
3. Good health and well-being
FnR Project :
FNR13684739 - The Dark Metaproteome: Identifying Proteins Of Unknown Function In The Human Gut Microbiome, 2019 (01/04/2020-31/03/2022) - Paul Wilmes
Name of the research project :
R-AGR-3717 - C19/BM/13684739/MetaPUF - WILMES Paul
Funders :
FNR - Fonds National de la Recherche
Funding text :
The authors would like to acknowledge funding from the National Research Fund Luxembourg (FNR) [grant number C19/BM/13684739] and EMBL core funding. We would also like to thank the original researchers who made the datasets available in the public domain.
Available on ORBilu :
since 09 January 2025

Statistics


Number of views
449 (2 by Unilu)
Number of downloads
229 (1 by Unilu)

OpenCitations
 
0
OpenAlex citations
 
0

Bibliography


Similar publications



Contact ORBilu