Skip to main content

BigLittleMCA: A Spatially-Optimal Tiled Hardware Accelerator for MCMC Image Processing

Publication ,  Journal Article
Kjellqvist, C; Wills, L; Lebeck, A
Published in: ACM Transactions on Architecture and Code Optimization
September 17, 2025

Markov-Chain Monte-Carlo (MCMC) algorithms offer a general framework for performing interpretable inference but have high overheads due to the computational complexity of the sampling process and the large number of samples required to produce an accurate result. Computer Vision is a common class of workloads that can be performed using MCMC methods. As computer vision workloads trend toward high-resolution real-time inference, it becomes challenging to perform inference in contexts such as edge computing, which operates under strict power and area budgets. Previous work explores hardware techniques for efficient sampling; however, MCMC algorithms still require many samples. We reduce the overheads of Gibbs Sampling, an MCMC algorithm, using an approach we call mixed-resolution sampling. This approach uses low-resolution inference to provide a starting point for full-resolution sampling. We evaluate this approach on three important computer vision tasks: stereo matching, optical flow, and blind source separation. Mixed-resolution sampling reduces root mean square error (RMSE) by an average of 19.6% for stereo-matching tasks, 13% for optical flow tasks, and 6.3% for blind source separation relative to traditional Gibbs Sampling. To enable real-time, explainable MCMC inference under edge power constraints, we exploit the structure of mixed-resolution sampling to architect and implement a hardware-software co-designed accelerator architecture, BigLittleMCA (Big-Little MCMC Accelerator). BigLittleMCA is a tiled MCMC accelerator architecture that uses a small sampler for low-resolution sampling and a large sampler for full-resolution sampling. Our results show that the architecture sustains real-time 720p inference at 30 FPS (frames per second) using 48.5% less power than prior work.

Duke Scholars

Published In

ACM Transactions on Architecture and Code Optimization

DOI

EISSN

1544-3973

ISSN

1544-3566

Publication Date

September 17, 2025

Volume

22

Issue

3

Related Subject Headings

  • 4606 Distributed computing and systems software
  • 4009 Electronics, sensors and digital hardware
  • 0906 Electrical and Electronic Engineering
  • 0803 Computer Software
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Kjellqvist, C., Wills, L., & Lebeck, A. (2025). BigLittleMCA: A Spatially-Optimal Tiled Hardware Accelerator for MCMC Image Processing. ACM Transactions on Architecture and Code Optimization, 22(3). https://doi.org/10.1145/3736171
Kjellqvist, C., L. Wills, and A. Lebeck. “BigLittleMCA: A Spatially-Optimal Tiled Hardware Accelerator for MCMC Image Processing.” ACM Transactions on Architecture and Code Optimization 22, no. 3 (September 17, 2025). https://doi.org/10.1145/3736171.
Kjellqvist C, Wills L, Lebeck A. BigLittleMCA: A Spatially-Optimal Tiled Hardware Accelerator for MCMC Image Processing. ACM Transactions on Architecture and Code Optimization. 2025 Sep 17;22(3).
Kjellqvist, C., et al. “BigLittleMCA: A Spatially-Optimal Tiled Hardware Accelerator for MCMC Image Processing.” ACM Transactions on Architecture and Code Optimization, vol. 22, no. 3, Sept. 2025. Scopus, doi:10.1145/3736171.
Kjellqvist C, Wills L, Lebeck A. BigLittleMCA: A Spatially-Optimal Tiled Hardware Accelerator for MCMC Image Processing. ACM Transactions on Architecture and Code Optimization. 2025 Sep 17;22(3).

Published In

ACM Transactions on Architecture and Code Optimization

DOI

EISSN

1544-3973

ISSN

1544-3566

Publication Date

September 17, 2025

Volume

22

Issue

3

Related Subject Headings

  • 4606 Distributed computing and systems software
  • 4009 Electronics, sensors and digital hardware
  • 0906 Electrical and Electronic Engineering
  • 0803 Computer Software