Scholars@Duke publication: Adaptive simultaneous multi-tenancy for GPUs

Adaptive simultaneous multi-tenancy for GPUs

Publication , Conference

Bashizade, R; Li, Y; Lebeck, AR

Published in: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

January 1, 2019

Published version (DOI)

Graphics Processing Units (GPUs) are energy-efficient massively parallel accelerators that are increasingly deployed in multi-tenant environments such as data-centers for general-purpose computing as well as graphics applications. Using GPUs in multi-tenant setups requires an efficient and low-overhead method for sharing the device among multiple users that improves system throughput while adapting to the changes in workload. This requires mechanisms to control the resources allocated to each kernel, and an efficient policy to make decisions about this allocation. In this paper, we propose adaptive simultaneous multi-tenancy to address these issues. Adaptive simultaneous multi-tenancy allows for sharing the GPU among multiple kernels, as opposed to single kernel multi-tenancy that only runs one kernel on the GPU at any given time and static simultaneous multi-tenancy that does not adapt to events in the system. Our proposed system dynamically adjusts the kernels’ parameters at run-time when a new kernel arrives or a running kernel ends. Evaluations using our prototype implementation show that, compared to sequentially executing the kernels, system throughput is improved by an average of 9.8% (and up to 22.4%) for combinations of kernels that include at least one low-utilization kernel.

Duke Scholars

Author Alvin R. Lebeck Computer Science

Published In

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

DOI

10.1007/978-3-030-10632-4_5

EISSN

1611-3349

ISSN

0302-9743

ISBN

9783030106317

Publication Date

January 1, 2019

Volume

11332 LNCS

Start / End Page

83 / 106

Related Subject Headings

Artificial Intelligence & Image Processing
46 Information and computing sciences

Citation

APA

Chicago

ICMJE

MLA

NLM

Bashizade, R., Li, Y., & Lebeck, A. R. (2019). Adaptive simultaneous multi-tenancy for GPUs. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11332 LNCS, pp. 83–106). https://doi.org/10.1007/978-3-030-10632-4_5

Bashizade, R., Y. Li, and A. R. Lebeck. “Adaptive simultaneous multi-tenancy for GPUs.” In Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 11332 LNCS:83–106, 2019. https://doi.org/10.1007/978-3-030-10632-4_5.

Bashizade R, Li Y, Lebeck AR. Adaptive simultaneous multi-tenancy for GPUs. In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2019. p. 83–106.

Bashizade, R., et al. “Adaptive simultaneous multi-tenancy for GPUs.” Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11332 LNCS, 2019, pp. 83–106. Scopus, doi:10.1007/978-3-030-10632-4_5.

Bashizade R, Li Y, Lebeck AR. Adaptive simultaneous multi-tenancy for GPUs. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2019. p. 83–106.

Published In

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

DOI

10.1007/978-3-030-10632-4_5

EISSN

1611-3349

ISSN

0302-9743

ISBN

9783030106317

Publication Date

January 1, 2019

Volume

11332 LNCS

Start / End Page

83 / 106

Related Subject Headings

Artificial Intelligence & Image Processing
46 Information and computing sciences