Article (Scientific journals)
An Approach to Integrate Metagenomics, Metatranscriptomics and Metaproteomics Data in Public Data Resources.
Wang, Shengbo; Kaur, Satwant; Kunath, Benoit J et al.
2025In Proteomics, p. 202500002
Peer Reviewed verified by ORBi
 

Files


Full Text
Proteomics - 2025 - Wang - An Approach to Integrate Metagenomics Metatranscriptomics and Metaproteomics Data in Public.pdf
Publisher postprint (1.82 MB) Creative Commons License - Public Domain Dedication
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
data integration; data workflow; metagenomics; metaproteomics; metatranscriptomics
Abstract :
[en] The availability of public metaproteomics, metagenomics and metatranscriptomics data in public resources such as MGnify (for metagenomics/metatranscriptomics) and the PRIDE database (for metaproteomics), continues to increase. When these omics techniques are applied to the same samples, their integration offers new opportunities to understand the structure (metagenome) and functional expression (metatranscriptome and metaproteome) of the microbiome. Here, we describe a pilot study aimed at integrating public multi-meta-omics datasets from studies based on human gut and marine hatchery samples. Reference search databases (search DBs) were built using assembled metagenomic (and metatranscriptomic, where available) sequence data followed by de novo gene calling, using both data from the same sampling event and from independent samples. The resulting protein sets were evaluated for their utility in metaproteomics analysis. In agreement with previous studies, the highest number of peptide identifications was generally obtained when using search DBs created from the same samples. Data integration of the multi-omics results was performed in MGnify. For that purpose, the MGnify website was extended to enable the visualisation of the resulting peptide/protein information from three reanalysed metaproteomics datasets. A workflow (https://github.com/PRIDE-reanalysis/MetaPUF) has been developed allowing researchers to perform equivalent data integration, using paired multi-omics datasets. This is the first time that a data integration approach for multi-omics datasets has been implemented from public data available in the world-leading MGnify and PRIDE resources.
Research center :
Luxembourg Centre for Systems Biomedicine (LCSB): Bioinformatics Core (R. Schneider Group)
Luxembourg Centre for Systems Biomedicine (LCSB): Eco-Systems Biology (Wilmes Group)
Disciplines :
Microbiology
Environmental sciences & ecology
Author, co-author :
Wang, Shengbo;  European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK
Kaur, Satwant;  European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK
Kunath, Benoit J;  Systems Ecology Group, Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg ; Department of Life Sciences and Medicine, Faculty of Science, Technology and Medicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
MAY, Patrick  ;  University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) > Bioinformatics Core
Richardson, Lorna;  European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK
Rogers, Alexander B;  European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK
WILMES, Paul ;  University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) > Systems Ecology
Finn, Robert D;  European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK
Vizcaíno, Juan Antonio ;  European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK
External co-authors :
yes
Language :
English
Title :
An Approach to Integrate Metagenomics, Metatranscriptomics and Metaproteomics Data in Public Data Resources.
Publication date :
28 April 2025
Journal title :
Proteomics
ISSN :
1615-9853
eISSN :
1615-9861
Publisher :
Wiley, Weinheim, United States - Delaware
Pages :
e202500002
Peer reviewed :
Peer Reviewed verified by ORBi
Focus Area :
Systems Biomedicine
Development Goals :
3. Good health and well-being
FnR Project :
FNR13684739 - metaPUF - The Dark Metaproteome: Identifying Proteins Of Unknown Function In The Human Gut Microbiome, 2019 (01/04/2020-31/03/2022) - Paul Wilmes
Name of the research project :
R-AGR-3717 - C19/BM/13684739/MetaPUF - WILMES Paul
Funders :
FNR - Fonds National de la Recherche
Funding number :
FNR13684739
Funding text :
The authors would like to acknowledge funding from the National Research Fund Luxembourg (FNR) [grant number C19/BM/13684739], Wellcome [grant number 223745/Z/21/Z] and EMBL core funding. We would also like to thank the original researchers who made the datasets available in the public domain.
Available on ORBilu :
since 29 April 2025

Statistics


Number of views
90 (1 by Unilu)
Number of downloads
82 (0 by Unilu)

Scopus citations®
 
3
Scopus citations®
without self-citations
3
OpenCitations
 
0
OpenAlex citations
 
3

Bibliography


Similar publications



Contact ORBilu