TY - GEN
T1 - Random Matrix Asymptotics of Inner Product Kernel Spectral Clustering
AU - Ali, Hafiz Tiomoko
AU - Kammoun, Abla
AU - Couillet, Romain
N1 - KAUST Repository Item: Exported on 2020-10-01
Acknowledgements: The work of R. Couillet and H. Tiomoko Ali is supported by the ANR Project RMT4GRAPH (ANR-14-CE28-0006)
PY - 2018/9/21
Y1 - 2018/9/21
N2 - We study in this article the asymptotic performance of spectral clustering with inner product kernel for Gaussian mixture models of high dimension with numerous samples. As is now classical in large dimensional spectral analysis, we establish a phase transition phenomenon by which a minimum distance between the class means and covariances is required for clustering to be possible from the dominant eigenvectors. Beyond this phase transition, we evaluate the asymptotic content of the dominant eigenvectors thus allowing for a full characterization of clustering performance. However, a surprising finding is that in some particular scenarios, the phase transition does not occur and clustering can be achieved irrespective of the class means and covariances. This is evidenced here in the case of the mixture of two Gaussian datasets having the same means and arbitrary difference between covariances.
AB - We study in this article the asymptotic performance of spectral clustering with inner product kernel for Gaussian mixture models of high dimension with numerous samples. As is now classical in large dimensional spectral analysis, we establish a phase transition phenomenon by which a minimum distance between the class means and covariances is required for clustering to be possible from the dominant eigenvectors. Beyond this phase transition, we evaluate the asymptotic content of the dominant eigenvectors thus allowing for a full characterization of clustering performance. However, a surprising finding is that in some particular scenarios, the phase transition does not occur and clustering can be achieved irrespective of the class means and covariances. This is evidenced here in the case of the mixture of two Gaussian datasets having the same means and arbitrary difference between covariances.
UR - http://hdl.handle.net/10754/631603
UR - https://ieeexplore.ieee.org/document/8462052
UR - http://www.scopus.com/inward/record.url?scp=85054228627&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2018.8462052
DO - 10.1109/ICASSP.2018.8462052
M3 - Conference contribution
SN - 9781538646588
SP - 2441
EP - 2445
BT - 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
PB - Institute of Electrical and Electronics Engineers (IEEE)
ER -