TY - GEN
T1 - Data streaming with affinity propagation
AU - Zhang, Xiangliang
AU - Furtlehner, Cyril
AU - Sebag, Michèle
N1 - Generated from Scopus record by KAUST IRTS on 2023-09-20
PY - 2008/11/19
Y1 - 2008/11/19
N2 - This paper proposed StrAP (Streaming AP), extending Affinity Propagation (AP) to data steaming. AP, a new clustering algorithm, extracts the data items, or exemplars, that best represent the dataset using a message passing method. Several steps are made to build StrAP. The first one (Weighted AP) extends AP to weighted items with no loss of generality. The second one (Hierarchical WAP) is concerned with reducing the quadratic AP complexity, by applying AP on data subsets and further applying Weighted AP on the exemplars extracted from all subsets. Finally StrAP extends Hierarchical WAP to deal with changes in the data distribution. Experiments on artificial datasets, on the Intrusion Detection benchmark (KDD99) and on a real-world problem, clustering the stream of jobs submitted to the EGEE grid system, provide a comparative validation of the approach. © 2008 Springer-Verlag Berlin Heidelberg.
AB - This paper proposed StrAP (Streaming AP), extending Affinity Propagation (AP) to data steaming. AP, a new clustering algorithm, extracts the data items, or exemplars, that best represent the dataset using a message passing method. Several steps are made to build StrAP. The first one (Weighted AP) extends AP to weighted items with no loss of generality. The second one (Hierarchical WAP) is concerned with reducing the quadratic AP complexity, by applying AP on data subsets and further applying Weighted AP on the exemplars extracted from all subsets. Finally StrAP extends Hierarchical WAP to deal with changes in the data distribution. Experiments on artificial datasets, on the Intrusion Detection benchmark (KDD99) and on a real-world problem, clustering the stream of jobs submitted to the EGEE grid system, provide a comparative validation of the approach. © 2008 Springer-Verlag Berlin Heidelberg.
UR - http://link.springer.com/10.1007/978-3-540-87481-2_41
UR - http://www.scopus.com/inward/record.url?scp=56049108623&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-87481-2_41
DO - 10.1007/978-3-540-87481-2_41
M3 - Conference contribution
AN - SCOPUS:56049108623
SN - 3540874801
SN - 9783540874805
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 628
EP - 643
BT - Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2008, Proceedings
T2 - European Conference on Machine Learning and Knowledge Discovery in Databases, ECML PKDD 2008
Y2 - 15 September 2008 through 19 September 2008
ER -