Databases, Factual; Language; Documentation; Systems Biology/methods; Software; Statistics and Probability; Biochemistry; Molecular Biology; Computer Science Applications; Computational Theory and Mathematics; Computational Mathematics
Abstract :
[en] [en] SUMMARY: The systems biology graphical notation (SBGN) has become the de facto standard for the graphical representation of molecular maps. Having rapid and easy access to the content of large collections of maps is necessary to perform semantic or graph-based analysis of these resources. To this end, we propose StonPy, a new tool to store and query SBGN maps in a Neo4j graph database. StonPy notably includes a data model that takes into account all three SBGN languages and a completion module to automatically build valid SBGN maps from query results. StonPy is built as a library that can be integrated into other software and offers a command-line interface that allows users to easily perform all operations.
AVAILABILITY AND IMPLEMENTATION: StonPy is implemented in Python 3 under a GPLv3 license. Its code and complete documentation are freely available from https://github.com/adrienrougny/stonpy.
SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Disciplines :
Biotechnology
Author, co-author :
Rougny, Adrien ; Biotechnology Research Institute for Drug Discovery, National Institute of Advanced Industrial Science and Technology (AIST), Tokyo 135-0064, Japan ; Com. Bio Big Data Open Innovation Lab. (CBBD-OIL), AIST, Tokyo 169-8555, Japan
BALAUR, Irina-Afrodita ; University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) > Bioinformatics Core
Luna, Augustin; Department of Systems Biology, Harvard Medical School, Boston, MA, USA ; Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, USA
MAZEIN, Alexander ; University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB) > Bioinformatics Core
External co-authors :
yes
Language :
English
Title :
StonPy: a tool to parse and query collections of SBGN maps in a graph database.
Kuperstein,I. et al. (2015) Atlas of cancer signalling network: a systems biology resource for integrative analysis of cancer data with google maps. Oncogenesis, 4, e160.
Le Novere,N. et al. (2009) The systems biology graphical notation. Nat. Biotechnol., 27, 735. 741.
Mazein,A. et al. (2021) Reusability and composability in process description maps: RAS-RAF-MEK-ERK signalling. Brief. Bioinform., 22, bbab103.
Mazein,A. et al. (2022). High-quality reconstruction of disease mechanisms: planning, development and maintenance. pp. 2022120209.
Mi,H. et al. (2015) Systems biology graphical notation: activity flow language level 1 version 1.2. J. Integr. Bioinform., 12, 340. 381.
Mi,H. et al. (2019) Panther version 14: more genomes, a new panther go-slim and improvements in enrichment analysis tools. Nucleic Acids Res., 47, D419. D426.
Niarakis,A. et al.; The COVID-19 Disease Map Community (2022) A versatile and interoperable computational framework for the analysis and modeling of COVID-19 disease mechanisms. bioRxiv. https://doi.org/10.1101/2022.12.17.520865.
Rougny,A. et al. (2019) Systems biology graphical notation: process description language level 1 version 2.0. J. Integr. Bioinform., 16, 20190022.
Rougny,A. et al. (2021) SBGN bricks ontology as a tool to describe recurring concepts in molecular networks. Brief. Bioinform., 22, bbab049.
Sorokin,A. et al. (2015) Systems biology graphical notation: entity relationship language level 1 version 2. J. Integr. Bioinform., 12, 281.
Touré,V. et al. (2016) STON: exploring biological pathways using the SBGN standard and graph databases. BMC Bioinform., 17, 1.