Article (Scientific journals)
Clustering approaches for visual knowledge exploration in molecular interaction networks.
Ostaszewski, Marek; Kieffer, Emmanuel; Danoy, Gregoire et al.
2018In BMC Bioinformatics, 19 (1), p. 308
Peer Reviewed verified by ORBi
 

Files


Full Text
s12859-018-2314-z.pdf
Publisher postprint (1.09 MB)
Request a copy

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Algorithms; Cluster Analysis; Computational Biology/methods; Computer Graphics; Data Mining/methods; Databases, Factual; Gene Regulatory Networks; Humans; Bi-level optimization; Clustering; Evolutionary algorithms; Knowledge discovery; Molecular diagrams; Ontology
Abstract :
[en] BACKGROUND: Biomedical knowledge grows in complexity, and becomes encoded in network-based repositories, which include focused, expert-drawn diagrams, networks of evidence-based associations and established ontologies. Combining these structured information sources is an important computational challenge, as large graphs are difficult to analyze visually. RESULTS: We investigate knowledge discovery in manually curated and annotated molecular interaction diagrams. To evaluate similarity of content we use: i) Euclidean distance in expert-drawn diagrams, ii) shortest path distance using the underlying network and iii) ontology-based distance. We employ clustering with these metrics used separately and in pairwise combinations. We propose a novel bi-level optimization approach together with an evolutionary algorithm for informative combination of distance metrics. We compare the enrichment of the obtained clusters between the solutions and with expert knowledge. We calculate the number of Gene and Disease Ontology terms discovered by different solutions as a measure of cluster quality. Our results show that combining distance metrics can improve clustering accuracy, based on the comparison with expert-provided clusters. Also, the performance of specific combinations of distance functions depends on the clustering depth (number of clusters). By employing bi-level optimization approach we evaluated relative importance of distance functions and we found that indeed the order by which they are combined affects clustering performance. Next, with the enrichment analysis of clustering results we found that both hierarchical and bi-level clustering schemes discovered more Gene and Disease Ontology terms than expert-provided clusters for the same knowledge repository. Moreover, bi-level clustering found more enriched terms than the best hierarchical clustering solution for three distinct distance metric combinations in three different instances of disease maps. CONCLUSIONS: In this work we examined the impact of different distance functions on clustering of a visual biomedical knowledge repository. We found that combining distance functions may be beneficial for clustering, and improve exploration of such repositories. We proposed bi-level optimization to evaluate the importance of order by which the distance functions are combined. Both combination and order of these functions affected clustering quality and knowledge recognition in the considered benchmarks. We propose that multiple dimensions can be utilized simultaneously for visual knowledge exploration.
Disciplines :
Computer science
Author, co-author :
Ostaszewski, Marek  ;  University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB)
Kieffer, Emmanuel ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)
Danoy, Gregoire
Schneider, Reinhard ;  University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB)
Bouvry, Pascal ;  University of Luxembourg > Faculty of Science, Technology and Communication (FSTC) > Computer Science and Communications Research Unit (CSC)
External co-authors :
no
Language :
English
Title :
Clustering approaches for visual knowledge exploration in molecular interaction networks.
Publication date :
2018
Journal title :
BMC Bioinformatics
ISSN :
1471-2105
Publisher :
BioMed Central, United Kingdom
Volume :
19
Issue :
1
Pages :
308
Peer reviewed :
Peer Reviewed verified by ORBi
Focus Area :
Computational Sciences
Available on ORBilu :
since 27 October 2018

Statistics


Number of views
207 (34 by Unilu)
Number of downloads
6 (0 by Unilu)

Scopus citations®
 
4
Scopus citations®
without self-citations
2
OpenCitations
 
2
WoS citations
 
4

Bibliography


Similar publications



Contact ORBilu