Skip to main content

REREC: In-ReRAM Acceleration with Access-Aware Mapping for Personalized Recommendation

Publication ,  Conference
Wang, Y; Zhu, Z; Chen, F; Ma, M; Dai, G; Li, H; Chen, Y
Published in: IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD
January 1, 2021

Personalized recommendation systems are widely used in many Internet services. The sparse embedding lookup in recommendation models dominates the computational cost of inference due to its intensive irregular memory accesses. Applying resistive random access memory (ReRAM) based process-in-memory (PIM) architecture to accelerate recommendation processing can avoid data movements caused by off-chip memory accesses. However, naïve adoption of ReRAM-based DNN accelerators leads to low computation parallelism and severe under-utilization of computing resources, which is caused by the fine-grained inner-product in feature interaction. In this paper, we propose REREC, an architecture-algorithm co-designed accelerator, which specializes in fine-grained ReRAM-based inner-product engines with access-aware mapping algorithm for recommendation inference. At the architecture level, we reduce the size and increase the amount of crossbars. The crossbars are fully-connected by Analog-to-Digital Converters (ADCs) in one inner-product engine, which can adapt to the fine-grained and irregular computational patterns and improve the processing parallelism. We further explore trade-offs of (i) crossbar size vs. hardware utilization, and (ii) ADC implementation vs. area/energy efficiency to optimize the design. At the algorithm level, we propose a novel access-aware mapping (AAM) algorithm to optimize resource allocations. Our AAM algorithm tackles the problems of (i) the workload imbalance and (ii) the long recommendation inference latency induced by the great variance of access frequency of embedding vectors. Experimental results show that REREC achieves 7.69× speedup compared with a ReRAM-based baseline design. Compared to CPU and the state-of-the-art recommendation accelerator, REREC demonstrates 29.26× and 3.48× performance improvement, respectively.

Duke Scholars

Published In

IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD

DOI

ISSN

1092-3152

Publication Date

January 1, 2021

Volume

2021-November
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Wang, Y., Zhu, Z., Chen, F., Ma, M., Dai, G., Li, H., & Chen, Y. (2021). REREC: In-ReRAM Acceleration with Access-Aware Mapping for Personalized Recommendation. In IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD (Vol. 2021-November). https://doi.org/10.1109/ICCAD51958.2021.9643573
Wang, Y., Z. Zhu, F. Chen, M. Ma, G. Dai, H. Li, and Y. Chen. “REREC: In-ReRAM Acceleration with Access-Aware Mapping for Personalized Recommendation.” In IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD, Vol. 2021-November, 2021. https://doi.org/10.1109/ICCAD51958.2021.9643573.
Wang Y, Zhu Z, Chen F, Ma M, Dai G, Li H, et al. REREC: In-ReRAM Acceleration with Access-Aware Mapping for Personalized Recommendation. In: IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD. 2021.
Wang, Y., et al. “REREC: In-ReRAM Acceleration with Access-Aware Mapping for Personalized Recommendation.” IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD, vol. 2021-November, 2021. Scopus, doi:10.1109/ICCAD51958.2021.9643573.
Wang Y, Zhu Z, Chen F, Ma M, Dai G, Li H, Chen Y. REREC: In-ReRAM Acceleration with Access-Aware Mapping for Personalized Recommendation. IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD. 2021.

Published In

IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD

DOI

ISSN

1092-3152

Publication Date

January 1, 2021

Volume

2021-November