[en] The Affinity Propagation (AP) clustering algorithm proposed by Frey and Dueck (2007) provides an understandable, nearly optimal summary of a data set. However, it suffers two major shortcomings: i) the number of clusters is vague with the user-defined parameter called self-confidence, and ii) the quadratic computational complexity. When aiming at a given number of clusters due to prior knowledge, AP has to be launched many times until an appropriate setting of self-confidence is found. The re-launched AP increases the computational cost by one order of magnitude. In this paper, we propose an algorithm, called K-AP, to exploit the immediate results of K clusters by introducing a constraint in the process of message passing. Through theoretical analysis and experimental validation, K-AP was shown to be able to directly generate K clusters as user defined, with a negligible increase of computational cost compared to AP. In the meanwhile, KAP preserves the clustering quality as AP in terms of the distortion. K-AP is more effective than k-medoids w.r.t. the distortion minimization and higher clustering purity.
Disciplines :
Computer science
Identifiers :
UNILU:UL-CONFERENCE-2010-448
Author, co-author :
Zhang, Xiangliang; MCSE, King Abdullah University of Science and Technology (KAUST), Saudi Arabia
WANG, Wei ; University of Luxembourg > Faculty of Science, Technology and Communication (FSTC) > Computer Science and Communications Research Unit (CSC) ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)
Nørvåg, Kjetil; Norwegian University of Science and Technology, Norway
Sebag, Michèle; Université Paris-Sud 11, France
Language :
English
Title :
K-AP: Generating Specified K Clusters by Efficient Affinity Propagation
Publication date :
2010
Event name :
IEEE International Conference on Data Mining (ICDM)
Event place :
Sydney, Australia
Event date :
13-17 December 2010
Main work title :
ICDM '10 Proceedings of the 2010 IEEE International Conference on Data Mining
Publisher :
IEEE Computer Society, Washington, United States - District of Columbia