Article (Périodiques scientifiques)
Empowering large chemical knowledge bases for exposomics: PubChemLite meets MetFrag
SCHYMANSKI, Emma; KONDIC, Todor; Neumann, Steffen et al.
2021In Journal of Cheminformatics, 13 (1), p. 19
Peer reviewed vérifié par ORBi
 

Documents


Texte intégral
Schymanski_etal_2021_PubChemLite_s13321-021-00489-0.pdf
Postprint Éditeur (3.27 MB)
Télécharger

Tous les documents dans ORBilu sont protégés par une licence d'utilisation.

Envoyer vers



Détails



Résumé :
[en] Abstract Compound (or chemical) databases are an invaluable resource for many scientific disciplines. Exposomics researchers need to find and identify relevant chemicals that cover the entirety of potential (chemical and other) exposures over entire lifetimes. This daunting task, with over 100 million chemicals in the largest chemical databases, coupled with broadly acknowledged knowledge gaps in these resources, leaves researchers faced with too much—yet not enough—information at the same time to perform comprehensive exposomics research. Furthermore, the improvements in analytical technologies and computational mass spectrometry workflows coupled with the rapid growth in databases and increasing demand for high throughput “big data” services from the research community present significant challenges for both data hosts and workflow developers. This article explores how to reduce candidate search spaces in non-target small molecule identification workflows, while increasing content usability in the context of environmental and exposomics analyses, so as to profit from the increasing size and information content of large compound databases, while increasing efficiency at the same time. In this article, these methods are explored using PubChem, the NORMAN Network Suspect List Exchange and the in silico fragmentation approach MetFrag. A subset of the PubChem database relevant for exposomics, PubChemLite, is presented as a database resource that can be (and has been) integrated into current workflows for high resolution mass spectrometry. Benchmarking datasets from earlier publications are used to show how experimental knowledge and existing datasets can be used to detect and fill gaps in compound databases to progressively improve large resources such as PubChem, and topic-specific subsets such as PubChemLite. PubChemLite is a living collection, updating as annotation content in PubChem is updated, and exported to allow direct integration into existing workflows such as MetFrag. The source code and files necessary to recreate or adjust this are jointly hosted between the research parties (see data availability statement). This effort shows that enhancing the FAIRness (Findability, Accessibility, Interoperability and Reusability) of open resources can mutually enhance several resources for whole community benefit. The authors explicitly welcome additional community input on ideas for future developments.
Disciplines :
Sciences de l’environnement & écologie
Auteur, co-auteur :
SCHYMANSKI, Emma  ;  University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB)
KONDIC, Todor ;  University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) > Environmental Cheminformatics
Neumann, Steffen
Thiessen, Paul A.
Zhang, Jian
Bolton, Evan E.
Co-auteurs externes :
yes
Langue du document :
Anglais
Titre :
Empowering large chemical knowledge bases for exposomics: PubChemLite meets MetFrag
Date de publication/diffusion :
mars 2021
Titre du périodique :
Journal of Cheminformatics
eISSN :
1758-2946
Maison d'édition :
Springer, Allemagne
Volume/Tome :
13
Fascicule/Saison :
1
Pagination :
19
Peer reviewed :
Peer reviewed vérifié par ORBi
Focus Area :
Systems Biomedicine
Projet FnR :
FNR12341006 - Environmental Cheminformatics To Identify Unknown Chemicals And Their Effects, 2018 (01/10/2018-30/09/2023) - Emma Schymanski
Organisme subsidiant :
FNR - Fonds National de la Recherche
Disponible sur ORBilu :
depuis le 24 avril 2021

Statistiques


Nombre de vues
172 (dont 4 Unilu)
Nombre de téléchargements
109 (dont 3 Unilu)

citations Scopus®
 
78
citations Scopus®
sans auto-citations
55
OpenCitations
 
31
citations OpenAlex
 
113
citations WoS
 
78

Bibliographie


Publications similaires



Contacter ORBilu