Abstract
Classical solvers for the dense symmetric eigenvalue problem suffer from the first step, which involves a reduction to tridiagonal form that is dominated by the cost of accessing memory during the panel factorization. The solution is to reduce the matrix to a banded form, which then requires the eigenvalues of the banded matrix to be computed. The standard divide and conquer algorithm can be modified for this purpose. The paper combines this insight with tile algorithms that can be scheduled via a dynamic runtime system to multicore architectures. A detailed analysis of performance and accuracy is included. Performance improvements of 14-fold and 4-fold speedups are reported relative to LAPACK and Intel's Math Kernel Library.
Original language | English (US) |
---|---|
Pages (from-to) | C249-C274 |
Number of pages | 1 |
Journal | SIAM Journal on Scientific Computing |
Volume | 34 |
Issue number | 6 |
DOIs | |
State | Published - Jan 2012 |