Scholars@Duke publication: A data locality-aware design framework for reconfigurable sparse matrix-vector multiplication kernel

A data locality-aware design framework for reconfigurable sparse matrix-vector multiplication kernel

Publication , Conference

Li, S; Wang, Y; Wen, W; Chen, Y; Li, H

Published in: IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD

November 7, 2016

Sparse matrix-vector multiplication (SpMV) is an important computational kernel in many applications. For performance improvement, software libraries designated for SpMV computation have been introduced, e.g., MKL library for CPUs and cuSPARSE library for GPUs. However, the computational throughput of these libraries is far below the peak floating-point performance offered by hardware platforms, because the efficiency of SpMV kernel is greatly constrained by the limited memory bandwidth and irregular data access patterns. In this work, we propose a data locality-aware design framework for FPGA-based SpMV acceleration. We first include the hardware constraints in sparse matrix compression at software level to regularize the memory allocation and accesses. Moreover, a distributed architecture composed of processing elements is developed to improve the computation parallelism. We implement the reconfigurable SpMV kernel on Convey HC-2ex and conduct the evaluation by using the University of Florida sparse matrix collection. The experiments demonstrate an average computational efficiency of 48.2%, which is a lot better than those of CPU and GPU implementations. Our FPGA-based kernel has a comparable runtime as GPU, and achieves 2.1x reduction than CPU. Moreover, our design obtains substantial saving in energy consumption, say, 9.3x and 5.6x better than the implementations on CPU and GPU, respectively.

Duke Scholars

Author Yiran Chen Electrical and Computer Engineering

Author Hai "Helen" Li Electrical and Computer Engineering

Published In

IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD

DOI

10.1145/2966986.2966987

ISSN

1092-3152

Publication Date

November 7, 2016

Volume

07-10-November-2016

Citation

APA

Chicago

ICMJE

MLA

NLM

Li, S., Wang, Y., Wen, W., Chen, Y., & Li, H. (2016). A data locality-aware design framework for reconfigurable sparse matrix-vector multiplication kernel. In IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD (Vol. 07-10-November-2016). https://doi.org/10.1145/2966986.2966987

Li, S., Y. Wang, W. Wen, Y. Chen, and H. Li. “A data locality-aware design framework for reconfigurable sparse matrix-vector multiplication kernel.” In IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD, Vol. 07-10-November-2016, 2016. https://doi.org/10.1145/2966986.2966987.

Li S, Wang Y, Wen W, Chen Y, Li H. A data locality-aware design framework for reconfigurable sparse matrix-vector multiplication kernel. In: IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD. 2016.

Li, S., et al. “A data locality-aware design framework for reconfigurable sparse matrix-vector multiplication kernel.” IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD, vol. 07-10-November-2016, 2016. Scopus, doi:10.1145/2966986.2966987.

Li S, Wang Y, Wen W, Chen Y, Li H. A data locality-aware design framework for reconfigurable sparse matrix-vector multiplication kernel. IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD. 2016.

Published In

IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD

DOI

10.1145/2966986.2966987

ISSN

1092-3152

Publication Date

November 7, 2016

Volume

07-10-November-2016