TY - JOUR
T1 - A Diversified Attention Model for Interpretable Multiple Clusterings
AU - Ren, Liangrui
AU - Yu, Guoxian
AU - Wang, Jun
AU - Liu, Lei
AU - Domeniconi, Carlotta
AU - Zhang, Xiangliang
N1 - Generated from Scopus record by KAUST IRTS on 2023-09-20
PY - 2023/9/1
Y1 - 2023/9/1
N2 - Multiple clusterings can explore the same set of data from different perspectives by discovering different and meaningful clusterings. However, most, if not all, of the existing approaches overwhelmingly focus on the diversity between clustering subspaces, and pay much less attention on the salience of the subspaces. As a consequence, the quality of the produced clusterings is an understudied aspect of the problem. Furthermore, existing methods cannot explain the unique internal subspace structure of each clustering, and cannot incorporate multi-facet knowledge to generate different clusterings. In this paper, we propose a solution named iMClusts (interpretable Multiple Clusterings by diversified attention). iMClusts makes use of the expressive representational power of deep autoencoders and multi-head attention to generate multiple salient embedding matrices, and multiple clusterings therein. In addition, it leverages multi-facet knowledge and enhances the diversity between heads to boost the quality and diversity of multiple clusterings. Experimental results on benchmark datasets show that iMClusts can generate multiple clusterings with quality, interpretability, and diversity.
AB - Multiple clusterings can explore the same set of data from different perspectives by discovering different and meaningful clusterings. However, most, if not all, of the existing approaches overwhelmingly focus on the diversity between clustering subspaces, and pay much less attention on the salience of the subspaces. As a consequence, the quality of the produced clusterings is an understudied aspect of the problem. Furthermore, existing methods cannot explain the unique internal subspace structure of each clustering, and cannot incorporate multi-facet knowledge to generate different clusterings. In this paper, we propose a solution named iMClusts (interpretable Multiple Clusterings by diversified attention). iMClusts makes use of the expressive representational power of deep autoencoders and multi-head attention to generate multiple salient embedding matrices, and multiple clusterings therein. In addition, it leverages multi-facet knowledge and enhances the diversity between heads to boost the quality and diversity of multiple clusterings. Experimental results on benchmark datasets show that iMClusts can generate multiple clusterings with quality, interpretability, and diversity.
UR - https://ieeexplore.ieee.org/document/9935321/
UR - http://www.scopus.com/inward/record.url?scp=85141571944&partnerID=8YFLogxK
U2 - 10.1109/TKDE.2022.3218693
DO - 10.1109/TKDE.2022.3218693
M3 - Article
SN - 1558-2191
VL - 35
SP - 8852
EP - 8864
JO - IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING
JF - IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING
IS - 9
ER -