CPR: Composable performance regression for scalable multiprocessor models

Journal Article

Uniprocessor simulators track resource utilization cycle by cycle to estimate performance. Multiprocessor simulators, however, must account for synchronization events that increase the cost of every cycle simulated and shared resource contention that increases the total number of cycles simulated. These effects cause multiprocessor simulation times to scale superlinearly with the number of cores. Composable performance regression (CPR) fundamentally addresses these intractable multiprocessor simulation times, estimating multiprocessor performance with a combination of uniprocessor, contention, and penalty models. The uniprocessor model predicts baseline performance of each core while the contention models predict interfering accesses from other cores. Uniprocessor and contention model outputs are composed by a penalty model to produce the final multiprocessor performance estimate. Trained with a production quality simulator, CPR is accurate with median errors of 6.63, 4.83 percent for dual-, quad-core multiprocessors. Furthermore, composable regression is scalable, requiring 0.33x the simulations required by prior regression strategies. © 2008 IEEE.

Full Text

Duke Authors

Cited Authors

  • Lee, BC; Collins, J; Wang, H; Brooks, D

Published Date

  • December 1, 2008

Published In

Start / End Page

  • 270 - 281

International Standard Serial Number (ISSN)

  • 1072-4451

Digital Object Identifier (DOI)

  • 10.1109/MICRO.2008.4771797

Citation Source

  • Scopus