Skip to main content

BigLittleMCA: A Spatially-Optimal Tiled Hardware Accelerator for MCMC Image Processing

Publication ,  Journal Article
Kjellqvist, C; Wills, L; Lebeck, A
Published in: ACM Transactions on Architecture and Code Optimization
May 20, 2025

Markov-Chain Monte-Carlo (MCMC) algorithms offer a general framework for performing interpretable inference but have high overheads due to the computational complexity of the sampling process and the large number of samples required to produce an accurate result. Computer Vision is a common class of workloads that can be performed using MCMC methods. As computer vision workloads trend toward high-resolution real-time inference, it becomes challenging to perform inference in contexts such as edge computing, which operates under strict power and area budgets. Previous work explores hardware techniques for efficient sampling; however, MCMC algorithms still require many samples. We reduce the overheads of Gibbs Sampling, an MCMC algorithm, using an approach we call mixed-resolution sampling. This approach uses low-resolution inference to provide a starting point for full-resolution sampling. We evaluate this approach on three important computer vision tasks: stereo matching, optical flow, and blind source separation. Mixed-resolution sampling reduces root mean square error (RMSE) by an average of 19.6% for stereo-matching tasks, 13% for optical flow tasks, and 6.3% for blind source separation relative to traditional Gibbs Sampling. To enable real-time, explainable MCMC inference under edge power constraints, we exploit the structure of mixed-resolution sampling to architect and implement a hardware-software co-designed accelerator architecture, BigLittleMCA ( - MC ccelerator). BigLittleMCA is a tiled MCMC accelerator architecture that uses a small sampler for low-resolution sampling and a large sampler for full-resolution sampling. Our results show that the architecture sustains real-time 720p inference at 30 FPS (frames per second) using 48.5% less power than prior work.

Duke Scholars

Published In

ACM Transactions on Architecture and Code Optimization

DOI

EISSN

1544-3973

ISSN

1544-3566

Publication Date

May 20, 2025

Publisher

Association for Computing Machinery (ACM)

Related Subject Headings

  • 4606 Distributed computing and systems software
  • 4009 Electronics, sensors and digital hardware
  • 0906 Electrical and Electronic Engineering
  • 0803 Computer Software
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Kjellqvist, C., Wills, L., & Lebeck, A. (2025). BigLittleMCA: A Spatially-Optimal Tiled Hardware Accelerator for MCMC Image Processing. ACM Transactions on Architecture and Code Optimization. https://doi.org/10.1145/3736171
Kjellqvist, Chris, Lisa Wills, and Alvin Lebeck. “BigLittleMCA: A Spatially-Optimal Tiled Hardware Accelerator for MCMC Image Processing.” ACM Transactions on Architecture and Code Optimization, May 20, 2025. https://doi.org/10.1145/3736171.
Kjellqvist C, Wills L, Lebeck A. BigLittleMCA: A Spatially-Optimal Tiled Hardware Accelerator for MCMC Image Processing. ACM Transactions on Architecture and Code Optimization. 2025 May 20;
Kjellqvist, Chris, et al. “BigLittleMCA: A Spatially-Optimal Tiled Hardware Accelerator for MCMC Image Processing.” ACM Transactions on Architecture and Code Optimization, Association for Computing Machinery (ACM), May 2025. Crossref, doi:10.1145/3736171.
Kjellqvist C, Wills L, Lebeck A. BigLittleMCA: A Spatially-Optimal Tiled Hardware Accelerator for MCMC Image Processing. ACM Transactions on Architecture and Code Optimization. Association for Computing Machinery (ACM); 2025 May 20;

Published In

ACM Transactions on Architecture and Code Optimization

DOI

EISSN

1544-3973

ISSN

1544-3566

Publication Date

May 20, 2025

Publisher

Association for Computing Machinery (ACM)

Related Subject Headings

  • 4606 Distributed computing and systems software
  • 4009 Electronics, sensors and digital hardware
  • 0906 Electrical and Electronic Engineering
  • 0803 Computer Software