Skip to main content

Affinitention nets: Kernel perspective on attention architectures for set classification with applications to medical text and images

Publication ,  Conference
Dov, D; Assaad, S; Si, S; Wang, R; Xu, H; Kovalsky, SZ; Bell, J; Range, DE; Cohen, J; Henao, R; Carin, L
Published in: ACM CHIL 2021 - Proceedings of the 2021 ACM Conference on Health, Inference, and Learning
April 8, 2021

Set classification is the task of predicting a single label from a set comprising multiple instances. The examples we consider are pathology slides represented by sets of patches and medical text data represented by sets of word embeddings. State-of-the-art methods, such as the transformer network, typically use attention mechanisms to learn representations of set data, by modeling interactions between instances of the set. These methods, however, have complex heuristic architectures comprising multiple heads and layers. The complexity of attention architectures hampers their training when only a small number of labeled sets is available, as is often the case in medical applications. To address this problem, we present a kernel-based representation learning framework that links learning affinity kernels to learning representations from attention architectures. We show that learning a combination of the sum and the product of kernels is equivalent to learning representations from multi-head multi-layer attention architectures. From our framework, we devise a simplified attention architecture which we term affinitention (affinity-attention) nets. We demonstrate the application of affinitention nets to the classification of the Set-Cifar10 dataset, thyroid malignancy prediction from pathology slides, as well as patient text-message triage. We show that affinitention nets provide competitive results compared to heuristic attention architectures and outperform other competing methods.

Duke Scholars

Published In

ACM CHIL 2021 - Proceedings of the 2021 ACM Conference on Health, Inference, and Learning

DOI

ISBN

9781450383592

Publication Date

April 8, 2021

Start / End Page

14 / 24
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Dov, D., Assaad, S., Si, S., Wang, R., Xu, H., Kovalsky, S. Z., … Carin, L. (2021). Affinitention nets: Kernel perspective on attention architectures for set classification with applications to medical text and images. In ACM CHIL 2021 - Proceedings of the 2021 ACM Conference on Health, Inference, and Learning (pp. 14–24). https://doi.org/10.1145/3450439.3451856
Dov, D., S. Assaad, S. Si, R. Wang, H. Xu, S. Z. Kovalsky, J. Bell, et al. “Affinitention nets: Kernel perspective on attention architectures for set classification with applications to medical text and images.” In ACM CHIL 2021 - Proceedings of the 2021 ACM Conference on Health, Inference, and Learning, 14–24, 2021. https://doi.org/10.1145/3450439.3451856.
Dov D, Assaad S, Si S, Wang R, Xu H, Kovalsky SZ, et al. Affinitention nets: Kernel perspective on attention architectures for set classification with applications to medical text and images. In: ACM CHIL 2021 - Proceedings of the 2021 ACM Conference on Health, Inference, and Learning. 2021. p. 14–24.
Dov, D., et al. “Affinitention nets: Kernel perspective on attention architectures for set classification with applications to medical text and images.” ACM CHIL 2021 - Proceedings of the 2021 ACM Conference on Health, Inference, and Learning, 2021, pp. 14–24. Scopus, doi:10.1145/3450439.3451856.
Dov D, Assaad S, Si S, Wang R, Xu H, Kovalsky SZ, Bell J, Range DE, Cohen J, Henao R, Carin L. Affinitention nets: Kernel perspective on attention architectures for set classification with applications to medical text and images. ACM CHIL 2021 - Proceedings of the 2021 ACM Conference on Health, Inference, and Learning. 2021. p. 14–24.

Published In

ACM CHIL 2021 - Proceedings of the 2021 ACM Conference on Health, Inference, and Learning

DOI

ISBN

9781450383592

Publication Date

April 8, 2021

Start / End Page

14 / 24