![]() Rauschenberger, Armin ![]() ![]() in Bioinformatics (2021), 37(21), 38893895 Motivation: Multivariate (multi-target) regression has the potential to outperform univariate (single-target) regression at predicting correlated outcomes, which frequently occur in biomedical and ... [more ▼] Motivation: Multivariate (multi-target) regression has the potential to outperform univariate (single-target) regression at predicting correlated outcomes, which frequently occur in biomedical and clinical research. Here we implement multivariate lasso and ridge regression using stacked generalisation. Results: Our flexible approach leads to predictive and interpretable models in high-dimensional settings, with a single estimate for each input-output effect. In the simulation, we compare the predictive performance of several state-of-the-art methods for multivariate regression. In the application, we use clinical and genomic data to predict multiple motor and non-motor symptoms in Parkinson’s disease patients. We conclude that stacked multivariate regression, with our adaptations, is a competitive method for predicting correlated outcomes. Availability and Implementation: The R package joinet is available on GitHub (https://github.com/rauschenberger/joinet) and CRAN (https://CRAN.R-project.org/package=joinet). [less ▲] Detailed reference viewed: 84 (3 UL)![]() ; Pang, Jun ![]() in Bioinformatics (2021), 36(6), 879-881 Detailed reference viewed: 92 (4 UL)![]() Kratochvil, Miroslav ![]() ![]() in Bioinformatics (2021) COBREXA.jl is a Julia package for scalable, high-performance constraint-based reconstruction and analysis of very large-scale biological models. Its primary purpose is to facilitate the integration of ... [more ▼] COBREXA.jl is a Julia package for scalable, high-performance constraint-based reconstruction and analysis of very large-scale biological models. Its primary purpose is to facilitate the integration of modern high performance computing environments with the processing and analysis of large-scale metabolic models of challenging complexity. We report the architecture of the package, and demonstrate how the design promotes analysis scalability on several use-cases with multi-organism community models.https://doi.org/10.17881/ZKCR-BT30.Supplementary data are available at Bioinformatics online. [less ▲] Detailed reference viewed: 67 (8 UL)![]() Rauschenberger, Armin ![]() ![]() in Bioinformatics (2021), 37(14), 20122016 Motivation: Machine learning in the biomedical sciences should ideally provide predictive and interpretable models. When predicting outcomes from clinical or molecular features, applied researchers often ... [more ▼] Motivation: Machine learning in the biomedical sciences should ideally provide predictive and interpretable models. When predicting outcomes from clinical or molecular features, applied researchers often want to know which features have effects, whether these effects are positive or negative, and how strong these effects are. Regression analysis includes this information in the coefficients but typically renders less predictive models than more advanced machine learning techniques. Results: Here we propose an interpretable meta-learning approach for high-dimensional regression. The elastic net provides a compromise between estimating weak effects for many features and strong effects for some features. It has a mixing parameter to weight between ridge and lasso regularisation. Instead of selecting one weighting by tuning, we combine multiple weightings by stacking. We do this in a way that increases predictivity without sacrificing interpretability. Availability and Implementation: The R package starnet is available on GitHub: https://github.com/rauschenberger/starnet. [less ▲] Detailed reference viewed: 285 (27 UL)![]() del Sol Mesa, Antonio ![]() in Bioinformatics (2019) Detailed reference viewed: 182 (10 UL)![]() ; ; et al in Bioinformatics (2019) The correct classification of missense variants as benign or pathogenic remains challenging. Pathogenic variants are expected to have higher deleterious prediction scores than benign variants in the same ... [more ▼] The correct classification of missense variants as benign or pathogenic remains challenging. Pathogenic variants are expected to have higher deleterious prediction scores than benign variants in the same gene. However, most of the existing variant annotation tools do not reference the score range of benign population variants on gene level. Here, we present a web-application, Variant Score Ranker, which enables users to rapidly annotate variants and perform gene-specific variant score ranking on the population level. We also provide an intuitive example of how gene- and population-calibrated variant ranking scores can improve epilepsy variant prioritization. [less ▲] Detailed reference viewed: 88 (4 UL)![]() Hoksza, David ![]() ![]() ![]() in Bioinformatics (2019) SUMMARY: The complexity of molecular networks makes them difficult to navigate and interpret, creating a need for specialized software. MINERVA is a web platform for visualization, exploration and ... [more ▼] SUMMARY: The complexity of molecular networks makes them difficult to navigate and interpret, creating a need for specialized software. MINERVA is a web platform for visualization, exploration and management of molecular networks. Here, we introduce an extension to MINERVA architecture that greatly facilitates the access and use of the stored molecular network data. It allows to incorporate such data in analytical pipelines via a programmatic access interface, and to extend the platform's visual exploration and analytics functionality via plugin architecture. This is possible for any molecular network hosted by the MINERVA platform encoded in well-recognized systems biology formats. To showcase the possibilities of the plugin architecture, we have developed several plugins extending the MINERVA core functionalities. In the article, we demonstrate the plugins for interactive tree traversal of molecular networks, for enrichment analysis and for mapping and visualization of known disease variants or known adverse drug reactions to molecules in the network. AVAILABILITY AND IMPLEMENTATION: Plugins developed and maintained by the MINERVA team are available under the AGPL v3 license at https://git-r3lab.uni.lu/minerva/plugins/. The MINERVA API and plugin documentation is available at https://minerva-web.lcsb.uni.lu. [less ▲] Detailed reference viewed: 182 (6 UL)![]() ; Paul, Soumya ![]() in Bioinformatics (2019), 35(14), 558-567 Detailed reference viewed: 169 (3 UL)![]() Ravichandran, Srikanth ![]() ![]() in Bioinformatics (2019) SUMMARY: Single-cell RNA-sequencing is increasingly employed to characterize disease or ageing cell subpopulation phenotypes. Despite exponential increase in data generation, systematic identification of ... [more ▼] SUMMARY: Single-cell RNA-sequencing is increasingly employed to characterize disease or ageing cell subpopulation phenotypes. Despite exponential increase in data generation, systematic identification of key regulatory factors for controlling cellular phenotype to enable cell rejuvenation in disease or ageing remains a challenge. Here, we present SigHotSpotter, a computational tool to predict hotspots of signaling pathways responsible for the stable maintenance of cell subpopulation phenotypes, by integrating signaling and transcriptional networks. Targeted perturbation of these signaling hotspots can enable precise control of cell subpopulation phenotypes. SigHotSpotter correctly predicts the signaling hotspots with known experimental validations in different cellular systems. The tool is simple, user-friendly and is available as web-server or as stand-alone software. We believe SigHotSpotter will serve as a general purpose tool for the systematic prediction of signaling hotspots based on single-cell RNA-seq data, and potentiate novel cell rejuvenation strategies in the context of disease and ageing. AVAILABILITY AND IMPLEMENTATION: SigHotSpotter is at https://SigHotSpotter.lcsb.uni.lu as a web tool. Source code, example datasets and other information are available at https://gitlab.com/srikanth.ravichandran/sighotspotter. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. [less ▲] Detailed reference viewed: 68 (2 UL)![]() Baldini, Federico ![]() ![]() ![]() in Bioinformatics (2018) The application of constraint-based modeling to functionally analyze metagenomic data has been limited so far, partially due to the absence of suitable toolboxes. To address this gap, we created a ... [more ▼] The application of constraint-based modeling to functionally analyze metagenomic data has been limited so far, partially due to the absence of suitable toolboxes. To address this gap, we created a comprehensive toolbox to model i) microbe-microbe and host-microbe metabolic interactions, and ii) microbial communities using microbial genome-scale metabolic reconstructions and metagenomic data. The Microbiome Modeling Toolbox extends the functionality of the COBRA Toolbox. The Microbiome Modeling Toolbox and the tutorials at https://git.io/microbiomeModelingToolbox. [less ▲] Detailed reference viewed: 320 (8 UL)![]() Hoksza, David ![]() ![]() ![]() in Bioinformatics (2018) Summary MolArt fills the gap between sequence and structure visualization by providing a light-weight, interactive environment enabling exploration of sequence annotations in the context of available ... [more ▼] Summary MolArt fills the gap between sequence and structure visualization by providing a light-weight, interactive environment enabling exploration of sequence annotations in the context of available experimental or predicted protein structures. Provided a UniProt ID, MolArt downloads and displays sequence annotations, sequence-structure mapping and relevant structures. The sequence and structure views are interlinked, enabling sequence annotations being color overlaid over the mapped structures, thus providing an enhanced understanding and interpretation of the available molecular data. Availability and implementation MolArt is released under the Apache 2 license and is available at https://github.com/davidhoksza/MolArt. The project web page https://davidhoksza.github.io/MolArt/ features examples and applications of the tool. [less ▲] Detailed reference viewed: 165 (17 UL)![]() ; ; et al in Bioinformatics (2018) Detailed reference viewed: 204 (35 UL)![]() Moein, Mahsa ![]() ![]() in Bioinformatics (2018), 1 Ca2þ is a central second messenger in eukaryotic cells that regulates many cellular proc- esses. Recently, we have indicated that typical Ca2þ signals are not purely oscillatory as widely assumed, but ... [more ▼] Ca2þ is a central second messenger in eukaryotic cells that regulates many cellular proc- esses. Recently, we have indicated that typical Ca2þ signals are not purely oscillatory as widely assumed, but exhibit stochastic spiking with cell type and pathway specific characteristics. Here, we present the Calcium Signaling Analyzer (CaSiAn), an open source software tool that allows for quantifying these signal characteristics including individual spike properties and time course statis- tics in a semi-automated manner. CaSiAn provides an intuitive graphical user interface allowing experimentalists to easily process a large amount of Ca2þ signals, interactively tune peak detection, revise statistical measures and access the quantified signal properties as excel or text files. [less ▲] Detailed reference viewed: 153 (16 UL)![]() ; ; Gu, Wei ![]() in Bioinformatics (2018) Motivation Standardization and semantic alignment have been considered one of the major challenges for data integration in clinical research. The inclusion of the CDISC SDTM clinical data standard into ... [more ▼] Motivation Standardization and semantic alignment have been considered one of the major challenges for data integration in clinical research. The inclusion of the CDISC SDTM clinical data standard into the tranSMART i2b2 via a guiding master ontology tree positively impacts and supports the efficacy of data sharing, visualization and exploration across datasets. Results We present here a schema for the organization of SDTM variables into the tranSMART i2b2 tree along with a script and test dataset to exemplify the mapping strategy. The eTRIKS master tree concept is demonstrated by making use of fictitious data generated for four patients, including 16 SDTM clinical domains. We describe how the usage of correct visit names and data labels can help to integrate multiple readouts per patient and avoid ETL crashes when running a tranSMART loading routine. Availability The eTRIKS Master Tree package and test datasets are publicly available at https://doi.org/10.5281/zenodo.1009098 and a functional demo installation at https://public.etriks.org/transmart/datasetExplorer/ under eTRIKS - Master Tree branch, where the discussed examples can be visualized. [less ▲] Detailed reference viewed: 194 (16 UL)![]() De Landtsheer, Sébastien ![]() ![]() ![]() in Bioinformatics (2017) Motivation: Mathematical modelling of regulatory networks allows for the discovery of knowledge at the system level. However, existing modelling tools are often computation-heavy and do not offer ... [more ▼] Motivation: Mathematical modelling of regulatory networks allows for the discovery of knowledge at the system level. However, existing modelling tools are often computation-heavy and do not offer intuitive ways to explore the model, to test hypotheses or to interpret the results biologically. Results: We have developed a computational approach to contextualise logical models of regulatory networks with biological measurements based on a probabilistic description of rule-based interactions between the different molecules. Here, we propose a Matlab toolbox, FALCON, to automatically and efficiently build and contextualise networks, which includes a pipeline for conducting parameter analysis, knockouts, and easy and fast model investigation. The contextualised models could then provide qualitative and quantitative information about the network and suggest hypotheses about biological processes. Availability and implementation: FALCON is freely available for non-commercial users on GitHub under the GPLv3 licence. The toolbox, installation instructions, full documentation and test datasets are available at https://github.com/sysbiolux/FALCON . FALCON runs under Matlab (MathWorks) and requires the Optimization Toolbox. Contact: thomas.sauter@uni.lu. Supplementary information: Supplementary data are available at Bioinformatics online. [less ▲] Detailed reference viewed: 212 (31 UL)![]() Herzinger, Sascha ![]() ![]() ![]() in Bioinformatics (2017) In translational research, efficient knowledge exchange between the different fields of expertise is crucial. An open platform that is capable of storing a multitude of data types such as clinical, pre ... [more ▼] In translational research, efficient knowledge exchange between the different fields of expertise is crucial. An open platform that is capable of storing a multitude of data types such as clinical, pre-clinical, or OMICS data combined with strong visual analytical capabilities will significantly accelerate the scientific progress by making data more accessible and hypothesis generation easier. The open data warehouse tranSMART is capable of storing a variety of data types and has a growing user community including both academic institutions and pharmaceutical companies. tranSMART, however, currently lacks interactive and dynamic visual analytics and does not permit any post-processing interaction or exploration. For this reason, we developed SmartR , a plugin for tranSMART, that equips the platform not only with several dynamic visual analytical workflows, but also provides its own framework for the addition of new custom workflows. Modern web technologies such as D3.js or AngularJS were used to build a set of standard visualizations that were heavily improved with dynamic elements. Contact: reinhard.schneider@uni.lu. Supplementary information: Supplementary data are available at Bioinformatics online. Availability: : The source code is licensed under the Apache 2.0 License and is freely available on GitHub: https://github.com/transmart/SmartR. [less ▲] Detailed reference viewed: 310 (20 UL)![]() del Sol Mesa, Antonio ![]() ![]() ![]() in Bioinformatics (2017) Detailed reference viewed: 246 (33 UL)![]() Heirendt, Laurent ![]() ![]() ![]() in Bioinformatics (2017) MOTIVATION: Flux balance analysis, and its variants, are widely used methods for predicting steady-state reaction rates in biochemical reaction networks. The exploration of high dimensional networks with ... [more ▼] MOTIVATION: Flux balance analysis, and its variants, are widely used methods for predicting steady-state reaction rates in biochemical reaction networks. The exploration of high dimensional networks with such methods is currently hampered by software performance limitations. RESULTS: DistributedFBA.jl is a high-level, high-performance, open-source implementation of flux balance analysis in Julia. It is tailored to solve multiple flux balance analyses on a subset or all the reactions of large and huge-scale networks, on any number of threads or nodes. AVAILABILITY: The code is freely available on github.com/opencobra/COBRA.jl. The documentation can be found at opencobra.github.io/COBRA.jl. [less ▲] Detailed reference viewed: 261 (12 UL)![]() Haraldsdottir, Hulda ![]() ![]() in Bioinformatics (2017) Detailed reference viewed: 180 (9 UL)![]() ; ; et al in Bioinformatics (2016), 32(3), 474-6 Summary: We present a new, extended version of the Protein Topology Graph Library (PTGL) web server. The PTGL describes the protein topology on the super-secondary structure level. It allows to compute ... [more ▼] Summary: We present a new, extended version of the Protein Topology Graph Library (PTGL) web server. The PTGL describes the protein topology on the super-secondary structure level. It allows to compute and visualize protein ligand graphs and search for protein structural motifs. The new server features additional information on ligand binding to secondary structure elements (SSEs), increased usability, and an application programming interface (API) to retrieve data, allowing for an automated analysis of protein topology. Availability: The PTGL server is freely available on the web at http://ptgl.uni-frankfurt.de. The website is implemented in PHP, JavaScript, PostgreSQL and Apache. It is supported by all major browsers. The VPLG software that was used to compute the protein ligand graphs and all other data in the database is available under the GNU public license 2.0 from http://vplg.sourceforge.net. [less ▲] Detailed reference viewed: 372 (8 UL) |
||