Scholars@Duke publication: REREC: In-ReRAM Acceleration with Access-Aware Mapping for Personalized Recommendation

REREC: In-ReRAM Acceleration with Access-Aware Mapping for Personalized Recommendation

Publication , Conference

Wang, Y; Zhu, Z; Chen, F; Ma, M; Dai, G; Li, H; Chen, Y

Published in: IEEE ACM International Conference on Computer Aided Design Digest of Technical Papers Iccad

January 1, 2021

Personalized recommendation systems are widely used in many Internet services. The sparse embedding lookup in recommendation models dominates the computational cost of inference due to its intensive irregular memory accesses. Applying resistive random access memory (ReRAM) based process-in-memory (PIM) architecture to accelerate recommendation processing can avoid data movements caused by off-chip memory accesses. However, naïve adoption of ReRAM-based DNN accelerators leads to low computation parallelism and severe under-utilization of computing resources, which is caused by the fine-grained inner-product in feature interaction. In this paper, we propose REREC, an architecture-algorithm co-designed accelerator, which specializes in fine-grained ReRAM-based inner-product engines with access-aware mapping algorithm for recommendation inference. At the architecture level, we reduce the size and increase the amount of crossbars. The crossbars are fully-connected by Analog-to-Digital Converters (ADCs) in one inner-product engine, which can adapt to the fine-grained and irregular computational patterns and improve the processing parallelism. We further explore trade-offs of (i) crossbar size vs. hardware utilization, and (ii) ADC implementation vs. area/energy efficiency to optimize the design. At the algorithm level, we propose a novel access-aware mapping (AAM) algorithm to optimize resource allocations. Our AAM algorithm tackles the problems of (i) the workload imbalance and (ii) the long recommendation inference latency induced by the great variance of access frequency of embedding vectors. Experimental results show that REREC achieves 7.69× speedup compared with a ReRAM-based baseline design. Compared to CPU and the state-of-the-art recommendation accelerator, REREC demonstrates 29.26× and 3.48× performance improvement, respectively.

Duke Scholars

Author Hai "Helen" Li Pierre R. Lamond Department of Electrical and Computer Engin ...

Author Yiran Chen Pierre R. Lamond Department of Electrical and Computer Engin ...

Published In

IEEE ACM International Conference on Computer Aided Design Digest of Technical Papers Iccad

DOI

10.1109/ICCAD51958.2021.9643573

ISSN

1092-3152

Publication Date

January 1, 2021

Volume

2021-November

Citation

APA

Chicago

ICMJE

MLA

NLM

Wang, Y., Zhu, Z., Chen, F., Ma, M., Dai, G., Li, H., & Chen, Y. (2021). REREC: In-ReRAM Acceleration with Access-Aware Mapping for Personalized Recommendation. In IEEE ACM International Conference on Computer Aided Design Digest of Technical Papers Iccad (Vol. 2021-November). https://doi.org/10.1109/ICCAD51958.2021.9643573

Wang, Y., Z. Zhu, F. Chen, M. Ma, G. Dai, H. Li, and Y. Chen. “REREC: In-ReRAM Acceleration with Access-Aware Mapping for Personalized Recommendation.” In IEEE ACM International Conference on Computer Aided Design Digest of Technical Papers Iccad, Vol. 2021-November, 2021. https://doi.org/10.1109/ICCAD51958.2021.9643573.

Wang Y, Zhu Z, Chen F, Ma M, Dai G, Li H, et al. REREC: In-ReRAM Acceleration with Access-Aware Mapping for Personalized Recommendation. In: IEEE ACM International Conference on Computer Aided Design Digest of Technical Papers Iccad. 2021.

Wang, Y., et al. “REREC: In-ReRAM Acceleration with Access-Aware Mapping for Personalized Recommendation.” IEEE ACM International Conference on Computer Aided Design Digest of Technical Papers Iccad, vol. 2021-November, 2021. Scopus, doi:10.1109/ICCAD51958.2021.9643573.

Wang Y, Zhu Z, Chen F, Ma M, Dai G, Li H, Chen Y. REREC: In-ReRAM Acceleration with Access-Aware Mapping for Personalized Recommendation. IEEE ACM International Conference on Computer Aided Design Digest of Technical Papers Iccad. 2021.

Published In

IEEE ACM International Conference on Computer Aided Design Digest of Technical Papers Iccad

DOI

10.1109/ICCAD51958.2021.9643573

ISSN

1092-3152

Publication Date

January 1, 2021

Volume

2021-November