Skip to main content

Understanding RDMA Microarchitecture Resources for Performance Isolation

Publication ,  Conference
Kong, X; Chen, J; Bai, W; Xu, Y; Elhaddad, M; Raindel, S; Padhye, J; Lebeck, AR; Zhuo, D
Published in: Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2023
January 1, 2023

Recent years have witnessed the wide adoption of RDMA in the cloud to accelerate first-party workloads and achieve cost savings by freeing up CPU cycles. Now cloud providers are working towards supporting RDMA in general-purpose guest VMs to benefit third-party workloads. To this end, cloud providers must provide strong performance isolation so that the RDMA workloads of one tenant do not adversely impact the RDMA performance of another tenant. Despite many efforts on network performance isolation in the public cloud, we find that RDMA brings unique challenges due to its complex NIC microarchitecture resources (e.g., the NIC cache). In this paper, we aim to systematically understand the impact of RNIC microarchitecture resources on performance isolation. We present a model that represents how RDMA operations use RNIC resources. Using this model, we develop a test suite to evaluate RDMA performance isolation solutions. Our test suite can break all existing solutions in various scenarios. Our results are acknowledged and reproduced by one of the largest RDMA NIC vendors. Finally, based on the test results, we summarize new insights on designing future RDMA performance isolation solutions.

Duke Scholars

Published In

Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2023

Publication Date

January 1, 2023

Start / End Page

31 / 48
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Kong, X., Chen, J., Bai, W., Xu, Y., Elhaddad, M., Raindel, S., … Zhuo, D. (2023). Understanding RDMA Microarchitecture Resources for Performance Isolation. In Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2023 (pp. 31–48).
Kong, X., J. Chen, W. Bai, Y. Xu, M. Elhaddad, S. Raindel, J. Padhye, A. R. Lebeck, and D. Zhuo. “Understanding RDMA Microarchitecture Resources for Performance Isolation.” In Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2023, 31–48, 2023.
Kong X, Chen J, Bai W, Xu Y, Elhaddad M, Raindel S, et al. Understanding RDMA Microarchitecture Resources for Performance Isolation. In: Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2023. 2023. p. 31–48.
Kong, X., et al. “Understanding RDMA Microarchitecture Resources for Performance Isolation.” Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2023, 2023, pp. 31–48.
Kong X, Chen J, Bai W, Xu Y, Elhaddad M, Raindel S, Padhye J, Lebeck AR, Zhuo D. Understanding RDMA Microarchitecture Resources for Performance Isolation. Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2023. 2023. p. 31–48.

Published In

Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2023

Publication Date

January 1, 2023

Start / End Page

31 / 48