Skip to main content

Characterizing output bottlenecks in a supercomputer

Publication ,  Journal Article
Xie, B; Chase, J; Dillow, D; Drokin, O; Klasky, S; Oral, S; Podhorszki, N
Published in: International Conference for High Performance Computing Networking Storage and Analysis Sc
December 1, 2012

Supercomputer I/O loads are often dominated by writes. HPC (High Performance Computing) file systems are designed to absorb these bursty outputs at high bandwidth through massive parallelism. However, the delivered write bandwidth often falls well below the peak. This paper characterizes the data absorption behavior of a center-wide shared Lustre parallel file system on the Jaguar supercomputer. We use a statistical methodology to address the challenges of accurately measuring a shared machine under production load and to obtain the distribution of bandwidth across samples of compute nodes, storage targets, and time intervals. We observe and quantify limitations from competing traffic, contention on storage servers and I/O routers, concurrency limitations in the client compute node operating systems, and the impact of variance (stragglers) on coupled output such as striping. We then examine the implications of our results for application performance and the design of I/O middleware systems on shared supercomputers. © 2012 IEEE.

Duke Scholars

Published In

International Conference for High Performance Computing Networking Storage and Analysis Sc

DOI

EISSN

2167-4337

ISSN

2167-4329

Publication Date

December 1, 2012
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Xie, B., Chase, J., Dillow, D., Drokin, O., Klasky, S., Oral, S., & Podhorszki, N. (2012). Characterizing output bottlenecks in a supercomputer. International Conference for High Performance Computing Networking Storage and Analysis Sc. https://doi.org/10.1109/SC.2012.28
Xie, B., J. Chase, D. Dillow, O. Drokin, S. Klasky, S. Oral, and N. Podhorszki. “Characterizing output bottlenecks in a supercomputer.” International Conference for High Performance Computing Networking Storage and Analysis Sc, December 1, 2012. https://doi.org/10.1109/SC.2012.28.
Xie B, Chase J, Dillow D, Drokin O, Klasky S, Oral S, et al. Characterizing output bottlenecks in a supercomputer. International Conference for High Performance Computing Networking Storage and Analysis Sc. 2012 Dec 1;
Xie, B., et al. “Characterizing output bottlenecks in a supercomputer.” International Conference for High Performance Computing Networking Storage and Analysis Sc, Dec. 2012. Scopus, doi:10.1109/SC.2012.28.
Xie B, Chase J, Dillow D, Drokin O, Klasky S, Oral S, Podhorszki N. Characterizing output bottlenecks in a supercomputer. International Conference for High Performance Computing Networking Storage and Analysis Sc. 2012 Dec 1;

Published In

International Conference for High Performance Computing Networking Storage and Analysis Sc

DOI

EISSN

2167-4337

ISSN

2167-4329

Publication Date

December 1, 2012