curation; disease mechanisms; pathway biology; systems biology; translational research; Biochemistry; Biotechnology; Computational Mathematics; Statistics and Probability; Structural Biology; Environmental Engineering
Abstract :
[en] As a conceptual model of disease mechanisms, a disease map integrates available knowledge and is applied for data interpretation, predictions and hypothesis generation. It is possible to model disease mechanisms on different levels of granularity and adjust the approach to the goals of a particular project. This rich environment together with requirements for high-quality network reconstruction makes it challenging for new curators and groups to be quickly introduced to the development methods. In this review, we offer a step-by-step guide for developing a disease map within its mainstream pipeline that involves using the CellDesigner tool for creating and editing diagrams and the MINERVA Platform for online visualisation and exploration. We also describe how the Neo4j graph database environment can be used for managing and querying efficiently such a resource. For assessing the interoperability and reproducibility we apply FAIR principles.
Disciplines :
Life sciences: Multidisciplinary, general & others
Author, co-author :
MAZEIN, Alexander ; University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) > Bioinformatics Core
ACENCIO, Marcio Luis ; University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) > Bioinformatics Core
BALAUR, Irina-Afrodita ; University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) > Bioinformatics Core
Rougny, Adrien; Independent Researcher, Massy, France
WELTER, Danielle ; University of Luxembourg > Luxembourg Centre for Systems Biomedicine > Bioinformatics Core > Translational Informatics
Niarakis, Anna; Université Paris-Saclay, Laboratoire Européen de Recherche Pour la Polyarthrite Rhumatoïde-Genhotel, University Evry, Evry, France ; Lifeware Group, Inria Saclay-Ile de France, Palaiseau, France
Ramirez Ardila, Diana; ITTM Information Technology for Translational Medicine, Esch-sur-Alzette, Luxemburg
Dogrusoz, Ugur; Computer Engineering Department, Bilkent University, Ankara, Türkiye
GAWRON, Piotr ; University of Luxembourg > Luxembourg Centre for Systems Biomedicine > Bioinformatics Core > Visualisation
SATAGOPAM, Venkata ; University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) > Bioinformatics Core ; ELIXIR Luxembourg, Belvaux, Luxembourg
GU, Wei ; University of Luxembourg ; ELIXIR Luxembourg, Belvaux, Luxembourg
KREMER, Andreas ; University of Luxembourg ; ITTM Information Technology for Translational Medicine, Esch-sur-Alzette, Luxemburg
SCHNEIDER, Reinhard ; University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) > Bioinformatics Core ; ELIXIR Luxembourg, Belvaux, Luxembourg
OSTASZEWSKI, Marek ; University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) > Bioinformatics Core ; ELIXIR Luxembourg, Belvaux, Luxembourg
Aghakhani S. Soliman S. Niarakis A. (2022). Metabolic reprogramming in rheumatoid arthritis synovial fibroblasts: A Hybrid modeling approach. PLoS Comput. Biol. 18 (12), e1010408. 10.1371/journal.pcbi.1010408
Aghamiri S. S. Singh V. Naldi A. Helikar T. Soliman S. Niarakis A. (2020). Automated inference of Boolean models from molecular interaction maps using CaSQ. Bioinforma. Oxf. Engl. 36, 4473–4482. 10.1093/bioinformatics/btaa484
Balaur I. Mazein A. Saqi M. Lysenko A. Rawlings C. J. Auffray C. (2016). Recon2Neo4j: Applying graph database technologies for managing comprehensive genome-scale networks. Bioinforma. Oxf. Engl. 33, 1096–1098. 10.1093/bioinformatics/btw731
Balaur I. Roy L. Mazein A. Karaca S. G. Dogrusoz U. Barillot E. et al. (2020). cd2sbgnml: bidirectional conversion between CellDesigner and SBGN formats. Bioinforma. Oxf. Engl. 36, 4975. 10.1093/bioinformatics/btaa528
Balaur I. Roy L. Touré V. Mazein A. Auffray C. (2022). GraphML-SBGN bidirectional converter for metabolic networks. J. Integr. Bioinforma. 19, 20220030. 10.1515/jib-2022-0030
Balci H. Dogrusoz U. (2022). fCoSE: A Fast Compound graph layout algorithm with Constraint support. IEEE Trans. Vis. Comput. Graph. 28, 4582–4593. 10.1109/TVCG.2021.3095303
Balci H. Siper M. C. Saleh N. Safarli I. Roy L. Kilicarslan M. et al. (2020). Newt: A comprehensive web-based tool for viewing, constructing and analyzing biological maps. Bioinformatics 37, 1475–1477. 10.1093/bioinformatics/btaa850
Bento A. P. Gaulton A. Hersey A. Bellis L. J. Chambers J. Davies M. et al. (2014). The ChEMBL bioactivity database: An update. Nucleic Acids Res. 42, D1083–D1090. 10.1093/nar/gkt1031
Bergmann F. T. Czauderna T. Dogrusoz U. Rougny A. Dräger A. Touré V. et al. (2020). Systems biology graphical notation markup language (SBGNML) version 0.3. J. Integr. Bioinforma. 17, 20200016. 10.1515/jib-2020-0016
Bonnet E. Viara E. Kuperstein I. Calzone L. Cohen D. P. A. Barillot E. et al. (2015). NaviCell Web Service for network-based data visualization. Nucleic Acids Res. 43, W560–W565. 10.1093/nar/gkv450
Chanrion M. Kuperstein I. Barrière C. El Marjou F. Cohen D. Vignjevic D. et al. (2014). Concomitant Notch activation and p53 deletion trigger epithelial-to-mesenchymal transition and metastasis in mouse gut. Nat. Commun. 5, 5005. 10.1038/ncomms6005
Cooling M. T. Nickerson D. P. Nielsen P. M. F. Hunter P. J. (2016). Modular modelling with Physiome standards. J. Physiol. 594, 6817–6831. 10.1113/JP272633
Czauderna T. Klukas C. Schreiber F. (2010). Editing, validating and translating of SBGN maps. Bioinforma. Oxf. Engl. 26, 2340–2341. 10.1093/bioinformatics/btq407
Del Toro N. Shrivastava A. Ragueneau E. Meldal B. Combe C. Barrera E. et al. (2022). The IntAct database: Efficient access to fine-grained molecular interaction data. Nucleic Acids Res. 50, D648–D653. 10.1093/nar/gkab1006
Dräger A. Helikar T. Barberis M. Birtwistle M. Calzone L. Chaouiya C. et al. (2021). SysMod: The ISCB community for data-driven computational modelling and multi-scale analysis of biological systems. Bioinforma. Oxf. Engl. 37, 3702–3706. 10.1093/bioinformatics/btab229
Fabregat A. Korninger F. Viteri G. Sidiropoulos K. Marin-Garcia P. Ping P. et al. (2018). Reactome graph database: Efficient access to complex pathway data. PLoS Comput. Biol. 14, e1005968. 10.1371/journal.pcbi.1005968
Ferguson C. Araújo D. Faulk L. Gou Y. Hamelers A. Huang Z. et al. (2021). Europe PMC in 2020. Nucleic Acids Res. 49, D1507–D1514. 10.1093/nar/gkaa994
Fujita K. A. Ostaszewski M. Matsuoka Y. Ghosh S. Glaab E. Trefois C. et al. (2013). Integrating pathways of Parkinson’s disease in a molecular interaction map. Mol. Neurobiol. 49, 88–102. 10.1007/s12035-013-8489-4
Gaulton A. Bellis L. J. Bento A. P. Chambers J. Davies M. Hersey A. et al. (2012). ChEMBL: A large-scale bioactivity database for drug discovery. Nucleic Acids Res. 40, D1100–D1107. 10.1093/nar/gkr777
Gawron P. Ostaszewski M. Satagopam V. Gebel S. Mazein A. Kuzma M. et al. (2016). MINERVA-a platform for visualization and curation of molecular interaction networks. NPJ Syst. Biol. Appl. 2, 16020. 10.1038/npjsba.2016.20
Gene Ontology Consortium Douglass E. Good B. M. Unni D. R. Harris N. L. Mungall C. J. et al. (2021). The gene ontology resource: Enriching a GOld mine. Nucleic Acids Res. 49, D325–D334. 10.1093/nar/gkaa1113
Gillespie M. Jassal B. Stephan R. Milacic M. Rothfels K. Senff-Ribeiro A. et al. (2022). The reactome pathway knowledgebase 2022. Nucleic Acids Res. 50, D687–D692. 10.1093/nar/gkab1028
Grant M. J. Booth A. (2009). A typology of reviews: an analysis of 14 review types and associated methodologies. Health Inf. Libr. J. 26, 91–108. 10.1111/j.1471-1842.2009.00848.x
Hanspers K. Kutmon M. Coort S. L. Digles D. Dupuis L. J. Ehrhart F. et al. (2021). Ten simple rules for creating reusable pathway models for computational analysis and visualization. PLoS Comput. Biol. 17, e1009226. 10.1371/journal.pcbi.1009226
Hastings J. Owen G. Dekker A. Ennis M. Kale N. Muthukrishnan V. et al. (2016). ChEBI in 2016: Improved services and an expanding collection of metabolites. Nucleic Acids Res. 44, D1214–D1219. 10.1093/nar/gkv1031
Hoksza D. Gawron P. Ostaszewski M. Hausenauer J. Schneider R. (2019a). Closing the gap between formats for storing layout information in systems biology. Brief. Bioinform 21, 1249–1260. 10.1093/bib/bbz067
Hoksza D. Gawron P. Ostaszewski M. Smula E. Schneider R. (2019b). MINERVA API and plugins: Opening molecular network analysis and visualization to the community. Bioinforma. Oxf. Engl. 35, 4496–4498. 10.1093/bioinformatics/btz286
Hucka M. Nickerson D. P. Bader G. D. Bergmann F. T. Cooper J. Demir E. et al. (2015). Promoting coordinated development of community-based information standards for modeling in biology: The COMBINE initiative. Front. Bioeng. Biotechnol. 3, 19. 10.3389/fbioe.2015.00019
Jdey W. Thierry S. Russo C. Devun F. Al Abo M. Noguiez-Hellin P. et al. (2016). Drug-driven synthetic lethality: Bypassing tumor cell Genetics with a combination of AsiDNA and PARP Inhibitors. Clin. Cancer Res. Off. J. Am. Assoc. Cancer Res. 23, 1001–1011. 10.1158/1078-0432.CCR-16-1193
Kanehisa M. Goto S. (2000). Kegg: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30. 10.1093/nar/28.1.27
Keating S. M. Waltemath D. König M. Zhang F. Dräger A. Chaouiya C. et al. (2020). SBML level 3: An extensible format for the exchange and reuse of biological models. Mol. Syst. Biol. 16, e9110. 10.15252/msb.20199110
Kim S. Chen J. Cheng T. Gindulyte A. He J. He S. et al. (2021). PubChem in 2021: New data content and improved web interfaces. Nucleic Acids Res. 49, D1388–D1395. 10.1093/nar/gkaa971
Kondratova M. Sompairac N. Barillot E. Zinovyev A. Kuperstein I. (2018). Signalling maps in cancer research: Construction and data analysis. Database J. Biol. Databases Curation 2018, bay036. 10.1093/database/bay036
Kuperstein I. Cohen D. P. A. Pook S. Viara E. Calzone L. Barillot E. et al. (2013). NaviCell: A web-based environment for navigation, curation and maintenance of large molecular interaction maps. BMC Syst. Biol. 7, 100. 10.1186/1752-0509-7-100
Kuperstein I. Bonnet E. Nguyen H.-A. Cohen D. Viara E. Grieco L. et al. (2015a). Atlas of cancer signalling network: A systems biology resource for integrative analysis of cancer data with Google maps. Oncogenesis 4, e160. 10.1038/oncsis.2015.19
Kuperstein I. Grieco L. Cohen D. P. A. Thieffry D. Zinovyev A. Barillot E. (2015b). The shortest path is not the one you know: Application of biological network resources in precision oncology research. Mutagenesis 30, 191–204. 10.1093/mutage/geu078
Le Novère N. Hucka M. Mi H. Moodie S. Schreiber F. Sorokin A. et al. (2009). The systems biology graphical notation. Nat. Biotechnol. 27, 735–741. 10.1038/nbt.1558
Le Novère N. (2015). Quantitative and logic modelling of molecular and gene networks. Nat. Rev. Genet. 16, 146–158. 10.1038/nrg3885
Licata L. Lo Surdo P. Iannuccelli M. Palma A. Micarelli E. Perfetto L. et al. (2020). SIGNOR 2.0, the SIGnaling network open resource 2.0: 2019 update. Nucleic Acids Res. 48, D504–D510. 10.1093/nar/gkz949
Lysenko A. Roznovăţ I. A. Saqi M. Mazein A. Rawlings C. J. Auffray C. (2016). Representing and querying disease networks using graph databases. BioData Min. 9, 23. 10.1186/s13040-016-0102-8
Matsuoka Y. Matsumae H. Katoh M. Eisfeld A. J. Neumann G. Hase T. et al. (2013). A comprehensive map of the influenza A virus replication cycle. BMC Syst. Biol. 7, 97. 10.1186/1752-0509-7-97
Mazein A. Ostaszewski M. Kuperstein I. Watterson S. Le Novère N. Lefaudeux D. et al. (2018). Systems medicine disease maps: Community-driven comprehensive representation of disease mechanisms. Npj Syst. Biol. Appl. 4, 21. 10.1038/s41540-018-0059-y
Mazein A. Ivanova O. Balaur I. Ostaszewski M. Berzhitskaya V. Serebriyskaya T. et al. (2021a). AsthmaMap: An interactive knowledge repository for mechanisms of asthma. J. Allergy Clin. Immunol. 147, 853–856. 10.1016/j.jaci.2020.11.032
Mazein A. Rougny A. Karr J. R. Saez-Rodriguez J. Ostaszewski M. Schneider R. (2021b). Reusability and composability in process description maps: RAS-RAF-MEK-ERK signalling. Brief. Bioinform. 22, bbab103. 10.1093/bib/bbab103
Meldal B. H. M. Perfetto L. Combe C. Lubiana T. Ferreira Cavalcante J. V. Bye-A-Jee H. et al. (2022). Complex portal 2022: New curation frontiers. Nucleic Acids Res. 50, D578–D586. 10.1093/nar/gkab991
Mi H. Schreiber F. Moodie S. Czauderna T. Demir E. Haw R. et al. (2015). Systems biology graphical notation: Activity Flow language Level 1 version 1.2. J. Integr. Bioinforma. 12, 340–381. 10.1515/jib-2015-265
Mi H. Muruganujan A. Ebert D. Huang X. Thomas P. D. (2019). PANTHER version 14: More genomes, a new PANTHER GO-slim and improvements in enrichment analysis tools. Nucleic Acids Res. 47, D419–D426. 10.1093/nar/gky1038
Miagoux Q. Singh V. de Mézquita D. Chaudru V. Elati M. Petit-Teixeira E. et al. (2021). Inference of an integrative, executable network for rheumatoid arthritis combining data-driven machine learning approaches and a state-of-the-art mechanistic disease map. J. Pers. Med. 11, 785. 10.3390/jpm11080785
Mizuno S. Iijima R. Ogishima S. Kikuchi M. Matsuoka Y. Ghosh S. et al. (2012). AlzPathway: A comprehensive map of signaling pathways of Alzheimer’s disease. BMC Syst. Biol. 6, 52. 10.1186/1752-0509-6-52
Monraz Gomez L. C. Kondratova M. Ravel J.-M. Barillot E. Zinovyev A. Kuperstein I. (2019). Application of Atlas of cancer signalling network in preclinical studies. Brief. Bioinform. 20, 701–716. 10.1093/bib/bby031
Niarakis A. Kuiper M. Ostaszewski M. Malik Sheriff R. S. Casals-Casas C. Thieffry D. et al. (2021). Setting the basis of best practices and standards for curation and annotation of logical models in biology-highlights of the [BC]2 2019 CoLoMoTo/SysMod Workshop. Brief. Bioinform 22, 1848–1859. 10.1093/bib/bbaa046
Niarakis A. Waltemath D. Glazier J. Schreiber F. Keating S. M. Nickerson D. et al. (2022). Addressing barriers in comprehensiveness, accessibility, reusability, interoperability and reproducibility of computational models in systems biology. Brief. Bioinform. 23, bbac212. 10.1093/bib/bbac212
Noronha A. Daníelsdóttir A. D. Gawron P. Jóhannsson F. Jónsdóttir S. Jarlsson S. et al. (2017). ReconMap: An interactive visualization of human metabolism. Bioinforma. Oxf. Engl. 33, 605–607. 10.1093/bioinformatics/btw667
Ogishima S. Mizuno S. Kikuchi M. Miyashita A. Kuwano R. Tanaka H. et al. (2016). AlzPathway, an updated map of curated signaling pathways: Towards deciphering Alzheimer’s disease pathogenesis. Methods Mol. Biol. Clifton N. J. 1303, 423–432. 10.1007/978-1-4939-2627-5_25
Ostaszewski M. Gebel S. Kuperstein I. Mazein A. Zinovyev A. Dogrusoz U. et al. (2018). Community-driven roadmap for integrated disease maps. Brief. Bioinform. 20, 659–670. 10.1093/bib/bby024
Ostaszewski M. Niarakis A. Mazein A. Kuperstein I. Phair R. Orta-Resendiz A. et al. (2021). COVID19 Disease Map, a computational knowledge repository of virus-host interaction mechanisms. Mol. Syst. Biol. 17, e10387. 10.15252/msb.202110387
Parton A. McGilligan V. Chemaly M. O’Kane M. Watterson S. (2019). New models of atherosclerosis and multi-drug therapeutic interventions. Bioinforma. Oxf. Engl. 35, 2449–2457. 10.1093/bioinformatics/bty980
Pereira C. Mazein A. Farinha C. M. Gray M. A. Kunzelmann K. Ostaszewski M. et al. (2021). CyFi-MAP: An interactive pathway-based resource for cystic fibrosis. Sci. Rep. 11, 22223. 10.1038/s41598-021-01618-3
Pratt D. Chen J. Welker D. Rivas R. Pillich R. Rynkov V. et al. (2015). NDEx, the network data exchange. Cell Syst. 1, 302–305. 10.1016/j.cels.2015.10.001
Ravel J.-M. Monraz Gomez L. C. Sompairac N. Calzone L. Zhivotovsky B. Kroemer G. et al. (2020). Comprehensive map of the regulated cell death signaling network: A powerful analytical tool for studying diseases. Cancers 12, 990. 10.3390/cancers12040990
Rougny A. Touré V. Moodie S. Balaur I. Czauderna T. Borlinghaus H. et al. (2019). Systems biology graphical notation: Process Description language Level 1 version 2.0. J. Integr. Bioinforma. 16, 20190022. 10.1515/jib-2019-0022
Rougny A. Touré V. Albanese J. Waltemath D. Shirshov D. Sorokin A. et al. (2021). SBGN Bricks Ontology as a tool to describe recurring concepts in molecular networks. Brief. Bioinform 22, bbab049. 10.1093/bib/bbab049
Rougny A. Balaur I. Luna A. Mazein A. (2023). StonPy: A tool to parse and query collections of SBGN maps in a graph database. Bioinforma. Oxf. Engl. 39, btad100. 10.1093/bioinformatics/btad100
Satagopam V. Gu W. Eifes S. Gawron P. Ostaszewski M. Gebel S. et al. (2016). Integration and visualization of translational medicine data for better understanding of human diseases. Big Data 4, 97–108. 10.1089/big.2015.0057
Shannon P. Markiel A. Ozier O. Baliga N. S. Wang J. T. Ramage D. et al. (2003). Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504. 10.1101/gr.1239303
Siebenhaller M. Nielsen S. S. McGee F. Balaur I. Auffray C. Mazein A. (2018). Human-like layout algorithms for signalling hypergraphs: Outlining requirements. Brief. Bioinform. 21, 62–72. 10.1093/bib/bby099
Singh V. Kalliolias G. D. Ostaszewski M. Veyssiere M. Pilalis E. Gawron P. et al. (2020). RA-Map: Building a state-of-the-art interactive knowledge base for rheumatoid arthritis. Database J. Biol. Databases Curation 2020, baaa017. 10.1093/database/baaa017
Sud M. Fahy E. Cotter D. Brown A. Dennis E. A. Glass C. K. et al. (2007). Lmsd: LIPID MAPS structure database. Nucleic Acids Res. 35, D527–D532. 10.1093/nar/gkl838
Tang Y. A. Pichler K. Füllgrabe A. Lomax J. Malone J. Munoz-Torres M. C. et al. (2019). Ten quick tips for biocuration. PLOS Comput. Biol. 15, e1006906. 10.1371/journal.pcbi.1006906
Thiele I. Palsson B. Ø. (2010). A protocol for generating a high-quality genome-scale metabolic reconstruction. Nat. Protoc. 5, 93–121. 10.1038/nprot.2009.203
Thiele I. Swainston N. Fleming R. M. T. Hoppe A. Sahoo S. Aurich M. K. et al. (2013). A community-driven global reconstruction of human metabolism. Nat. Biotechnol. 31, 419–425. 10.1038/nbt.2488
Touré V. Flobak Å. Niarakis A. Vercruysse S. Kuiper M. (2021a). The status of causality in biological databases: Data resources and data retrieval possibilities to support logical modeling. Brief. Bioinform. 22, bbaa390. 10.1093/bib/bbaa390
Touré V. Vercruysse S. Acencio M. L. Lovering R. C. Orchard S. Bradley G. et al. (2021b). The minimum information about a molecular interaction CAusal STatement (MI2CAST). Bioinforma. Oxf. Engl. 36, 5712–5718. 10.1093/bioinformatics/btaa622
Türei D. Korcsmáros T. Saez-Rodriguez J. (2016). OmniPath: Guidelines and gateway for literature-curated signaling pathway resources. Nat. Methods 13, 966–967. 10.1038/nmeth.4077
Varusai T. M. Jupe S. Sevilla C. Matthews L. Gillespie M. Stein L. et al. (2020). Using Reactome to build an autophagy mechanism knowledgebase. Autophagy 0, 1543–1554. 10.1080/15548627.2020.1761659
Vogt T. Czauderna T. Schreiber F. (2013). Translation of SBGN maps: Process description to activity Flow. BMC Syst. Biol. 7, 115. 10.1186/1752-0509-7-115
Wilkinson M. D. Dumontier M. Aalbersberg I. J. J. Appleton G. Axton M. Baak A. et al. (2016). The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 3, 160018. 10.1038/sdata.2016.18
Wishart D. S. Tzur D. Knox C. Eisner R. Guo A. C. Young N. et al. (2007). Hmdb: The human Metabolome database. Nucleic Acids Res. 35, D521–D526. 10.1093/nar/gkl923
Wishart D. S. Feunang Y. D. Guo A. C. Lo E. J. Marcu A. Grant J. R. et al. (2018). DrugBank 5.0: A major update to the DrugBank database for 2018. Nucleic Acids Res. 46, D1074–D1082. 10.1093/nar/gkx1037
Zerrouk N. Aghakhani S. Singh V. Augé F. Niarakis A. (2022). A mechanistic cellular Atlas of the rheumatic Joint. Front. Syst. Biol. 2. 10.3389/fsysb.2022.925791