Exploring FPGA optimizations to compute sparse numerical linear algebra kernels.

Favaro, Federico - Dufrechou, Ernesto - Ezzatti, Pablo - Oliver, Juan Pablo

Resumen:

The solution of sparse triangular linear systems (sptrsv) is the bottleneck of many numerical methods. Thus, it is crucial to count with efficient implementations of such kernel, at least for commonly used platforms. In this sense, Field–Programmable Gate Arrays (FPGAs) have evolved greatly in the last years, entering the HPC hardware ecosystem largely due to their superior energy–efficiency relative to more established accelerators. Up until recently, the design for FPGAs implied the use of low–level Hardware Description Languages (HDL) such as VHDL or Verilog. Nowadays, manufacturers are making a large effort to adopt High–Level Synthesis languages like C/C++ or OpenCL, but the gap between their performance and that of HDLs is not yet fully studied. This work focuses on the performance offered by FPGAs to compute the sptrsv using OpenCL. For this purpose, we implement different parallel variants of this kernel and experimentally evaluate several setups, varying among others the work–group size, the number of compute units, the unroll–factor and the vectorization–factor.


Detalles Bibliográficos
2020
FPGAs
Sparse linear algebra
SPTRSV
Power consumption
Inglés
Universidad de la República
COLIBRI
https://link.springer.com/chapter/10.1007/978-3-030-44534-8_20
https://hdl.handle.net/20.500.12008/31512
Acceso abierto
Licencia Creative Commons Atribución - No Comercial - Sin Derivadas (CC - By-NC-ND 4.0)
Resumen:
Sumario:Publicado en: Applied Reconfigurable Computing. Architectures, Tools, and Applications. ARC 2020. Lecture Notes in Computer Science, (LNCS, volume 12083), Springer, Cham, pp. 258-268