High throughput multidimensional tridiagonal system solvers on FPGAs

Kamalavasan Kamalakkannan, Gihan R. Mudalige, Istvan Z. Reguly, Suhaib A. Fahmy

Research output: Chapter in Book/Report/Conference proceedingConference contribution


We present a high performance tridiagonal solver library for Xilinx FPGAs optimized for multiple multi-dimensional systems common in real-world applications. An analytical performance model is developed and used to explore the design space and obtain rapid performance estimates that are over 85% accurate. This library achieves an order of magnitude better performance when solving large batches of systems than previous FPGA work. A detailed comparison with a current state-of-the-art GPU library for multi-dimensional tridiagonal systems on an Nvidia V100 GPU shows the FPGA achieving competitive or better runtime and significant energy savings of over 30%. Through this design, we learn lessons about the types of applications where FPGAs can challenge the current dominance of GPUs.
Original languageEnglish (US)
Title of host publicationProceedings of the 36th ACM International Conference on Supercomputing
StatePublished - Jun 2022


Dive into the research topics of 'High throughput multidimensional tridiagonal system solvers on FPGAs'. Together they form a unique fingerprint.

Cite this