The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
[en] Abstract Despite the increasing availability of tandem mass spectrometry (MS/MS) community spectral libraries for untargeted metabolomics over the past decade, the majority of acquired MS/MS spectra remain uninterpreted. To further aid in interpreting unannotated spectra, we created a nearest neighbor suspect spectral library, consisting of 87,916 annotated MS/MS spectra derived from hundreds of millions of public MS/MS spectra. Annotations were propagated based on structural relationships to reference molecules using MS/MS-based spectrum alignment. We demonstrate the broad relevance of the nearest neighbor suspect spectral library through representative examples of propagation-based annotation of acylcarnitines, bacterial and plant natural products, and drug metabolism. Our results also highlight how the library can help to better understand an Alzheimer’s brain phenotype. The nearest neighbor suspect spectral library is openly available through the GNPS platform to help investigators hypothesize candidate structures for unknown MS/MS spectra in untargeted metabolomics data.
Disciplines :
Life sciences: Multidisciplinary, general & others
Author, co-author :
Bittremieux, Wout
Avalon, Nicole E.
Thomas, Sydney P.
Kakhkhorov, Sarvar A.
Aksenov, Alexander A.
Gomes, Paulo Wender P.
Aceves, Christine M.
Caraballo Rodríguez, Andrés Mauricio
Gauglitz, Julia M.
Gerwick, William H.
Jarmusch, Alan K.
Kaddurah-Daouk, Rima F.
Kang, Kyo Bin
Kim, Hyun Woo
Kondic, Todor ; University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) > Environmental Cheminformatics
Mannochio-Russo, Helena
Meehan, Michael J.
Melnik, Alexey V.
Nothias, Louis-Felix
O’Donovan, Claire
Panitchpakdi, Morgan
Petras, Daniel
Schmid, Robin
Schymanski, Emma ; University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB)