TY - JOUR
T1 - Efficient nonparametric and asymptotic Bayesian model selection methods for attributed graph clustering
AU - Xu, Zhiqiang
AU - Cheng, James
AU - Xiao, Xiaokui
AU - Fujimaki, Ryohei
AU - Muraoka, Yusuke
N1 - KAUST Repository Item: Exported on 2020-10-01
Acknowledgements: The authors would like to thank the anonymous reviewers of the paper for their valuable comments that help significantly improve the quality of the paper.
PY - 2017/2/16
Y1 - 2017/2/16
N2 - Attributed graph clustering, also known as community detection on attributed graphs, attracts much interests recently due to the ubiquity of attributed graphs in real life. Many existing algorithms have been proposed for this problem, which are either distance based or model based. However, model selection in attributed graph clustering has not been well addressed, that is, most existing algorithms assume the cluster number to be known a priori. In this paper, we propose two efficient approaches for attributed graph clustering with automatic model selection. The first approach is a popular Bayesian nonparametric method, while the second approach is an asymptotic method based on a recently proposed model selection criterion, factorized information criterion. Experimental results on both synthetic and real datasets demonstrate that our approaches for attributed graph clustering with automatic model selection significantly outperform the state-of-the-art algorithm.
AB - Attributed graph clustering, also known as community detection on attributed graphs, attracts much interests recently due to the ubiquity of attributed graphs in real life. Many existing algorithms have been proposed for this problem, which are either distance based or model based. However, model selection in attributed graph clustering has not been well addressed, that is, most existing algorithms assume the cluster number to be known a priori. In this paper, we propose two efficient approaches for attributed graph clustering with automatic model selection. The first approach is a popular Bayesian nonparametric method, while the second approach is an asymptotic method based on a recently proposed model selection criterion, factorized information criterion. Experimental results on both synthetic and real datasets demonstrate that our approaches for attributed graph clustering with automatic model selection significantly outperform the state-of-the-art algorithm.
UR - http://hdl.handle.net/10754/622931
UR - http://link.springer.com/article/10.1007/s10115-017-1030-8
UR - http://www.scopus.com/inward/record.url?scp=85013076675&partnerID=8YFLogxK
U2 - 10.1007/s10115-017-1030-8
DO - 10.1007/s10115-017-1030-8
M3 - Article
SN - 0219-1377
VL - 53
SP - 239
EP - 268
JO - Knowledge and Information Systems
JF - Knowledge and Information Systems
IS - 1
ER -