An approximate model SpMV on FPGA assisting HLS optimizations for low power and high performance

Alden C. Shaji

Zainab Aizaz

Kavita Khare

International Journal of Reconfigurable and Embedded Systems

An approximate model SpMV on FPGA assisting HLS optimizations for low power and high performance

Abstract

High performance computing (HPC) in embedded systems is particularly relevant with the rise of artificial intelligence (AI) and machine learning at the edge. Deep learning models require substantial computational power, and running these models on embedded systems with limited resources poses significant challenges. The energy-efficient nature of field-programmable gate arrays (FPGAs), coupled with their adaptability, positions them as compelling choices for optimizing the performance of sparse matrix-vector multiplication (SpMV), which plays a significant role in various computational tasks within these fields. This article initially did analysis to find a power and delay efficient SpMV model kernel using high level synthesis (HLS) optimizations which incorporates loop pipelining, varied memory access patterns, and data partitioning strategies, all of this exert influence on the underlying hardware architecture. After identifying the minimum resource utilization model, we propose an approximate model algorithm on SpMV kernel to reduce the execution time in Xilinx Zynq-7000 FPGA. The experimental results shows that the FPGA power consumption was reduced by 50% when compared to a previously implemented streaming dataflow engine (SDE) flow, and the proposed approximate model improved performance by 2× times compared to that of original compressed sparse row (CSR) sparse matrix.

Cite

Full View

DOI

10.11591/ijres.v14.i2.pp375-387

ISSN Information

2089-4864

Pages

375-387

More Information

Volume 14

Issue 2

Publish at 2025-07-01

Discover Our Library

Embark on a journey through our expansive collection of articles and let curiosity lead your path to innovation.

Explore Now