Skip to main content
construction release_alert
Profile editing is temporarily unavailable from June 11-24, 2026 while manual profile data entry transitions to Elements. Learn More.
cancel

CAMformer: Binary Associative Memory Is All You Need

Publication ,  Journal Article
Molom-Ochir, T; Morris, BF; Horton, M; Wei, C; Guo, C; Taylor, B; Liu, P; Wang, SX; Fan, D; Li, H; Chen, Y
Published in: IEEE Transactions on Circuits and Systems I Regular Papers
January 1, 2026

Transformer attention mechanisms pose significant scalability challenges due to quadratic complexity in sequence length, and existing accelerators remain bottlenecked by dense arithmetic and data movement. This paper proposes CAMformer, a hardware accelerator that reinterprets attention as an associative memory operation, contributing at three levels. At the circuit level, a voltage-domain Binary Attention CAM (BA-CAM) computes Hamming similarity through analog charge sharing, achieving 1.12% mean error under PVT variation— 7x lower than time-domain approaches. At the architecture level, a three-stage pipeline with hierarchical two-stage top- k filtering reduces score storage by 8x while hiding DRAM latency. At the algorithm level, this top- k mechanism, co-designed with Hamming Attention Distillation (HAD), maintains <0.4% accuracy degradation on GLUE benchmarks. Implemented in 65 nm CMOS and evaluated on BERT-Large, Vision Transformer, and GPT-2 decoder workloads via HSPICE simulation and Design Compiler synthesis, CAMformer achieves 9,045 queries/mJ (10x), 191 queries/ms (4x), and 0.26 mm2 (6– 8x reduction) for attention computation compared to state-of-the-art accelerators. These results demonstrate that reconceptualizing attention as associative memory retrieval enables order-of-magnitude efficiency gains for edge Transformer inference.

Duke Scholars

Published In

IEEE Transactions on Circuits and Systems I Regular Papers

DOI

EISSN

1558-0806

ISSN

1549-8328

Publication Date

January 1, 2026

Related Subject Headings

  • Electrical & Electronic Engineering
  • 4009 Electronics, sensors and digital hardware
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Molom-Ochir, T., Morris, B. F., Horton, M., Wei, C., Guo, C., Taylor, B., … Chen, Y. (2026). CAMformer: Binary Associative Memory Is All You Need. IEEE Transactions on Circuits and Systems I Regular Papers. https://doi.org/10.1109/TCSI.2026.3692014
Molom-Ochir, T., B. F. Morris, M. Horton, C. Wei, C. Guo, B. Taylor, P. Liu, et al. “CAMformer: Binary Associative Memory Is All You Need.” IEEE Transactions on Circuits and Systems I Regular Papers, January 1, 2026. https://doi.org/10.1109/TCSI.2026.3692014.
Molom-Ochir T, Morris BF, Horton M, Wei C, Guo C, Taylor B, et al. CAMformer: Binary Associative Memory Is All You Need. IEEE Transactions on Circuits and Systems I Regular Papers. 2026 Jan 1;
Molom-Ochir, T., et al. “CAMformer: Binary Associative Memory Is All You Need.” IEEE Transactions on Circuits and Systems I Regular Papers, Jan. 2026. Scopus, doi:10.1109/TCSI.2026.3692014.
Molom-Ochir T, Morris BF, Horton M, Wei C, Guo C, Taylor B, Liu P, Wang SX, Fan D, Li H, Chen Y. CAMformer: Binary Associative Memory Is All You Need. IEEE Transactions on Circuits and Systems I Regular Papers. 2026 Jan 1;

Published In

IEEE Transactions on Circuits and Systems I Regular Papers

DOI

EISSN

1558-0806

ISSN

1549-8328

Publication Date

January 1, 2026

Related Subject Headings

  • Electrical & Electronic Engineering
  • 4009 Electronics, sensors and digital hardware