Scholars@Duke publication: Efficient Distributed Sequence Parallelism for Transformer-based Image Segmentation

Efficient Distributed Sequence Parallelism for Transformer-based Image Segmentation

Publication , Conference

Lyngaas, I; Meena, MG; Calabrese, E; Wahib, M; Chen, P; Igarashi, J; Huo, Y; Wang, X

Published in: IS and T International Symposium on Electronic Imaging Science and Technology

January 1, 2024

We introduce an efficient distributed sequence parallel approach for training transformer-based deep learning image segmentation models. The neural network models are comprised of a combination of a Vision Transformer encoder with a convolutional decoder to provide image segmentation mappings. The utility of the distributed sequence parallel approach is especially useful in cases where the tokenized embedding representation of image data are too large to fit into standard computing hardware memory. To demonstrate the performance and characteristics of our models trained in sequence parallel fashion compared to standard models, we evaluate our approach using a 3D MRI brain tumor segmentation dataset. We show that training with a sequence parallel approach can match standard sequential model training in terms of convergence. Furthermore, we show that our sequence parallel approach has the capability to support training of models that would not be possible on standard computing resources.

Duke Scholars

Author Evan Calabrese Radiology, Neuroradiology

Published In

IS and T International Symposium on Electronic Imaging Science and Technology

DOI

10.2352/EI.2024.36.12.HPCI-199

EISSN

2470-1173

Publication Date

January 1, 2024

Volume

Issue

Start / End Page

1991 / 1997

Citation

APA

Chicago

ICMJE

MLA

NLM

Lyngaas, I., Meena, M. G., Calabrese, E., Wahib, M., Chen, P., Igarashi, J., … Wang, X. (2024). Efficient Distributed Sequence Parallelism for Transformer-based Image Segmentation. In IS and T International Symposium on Electronic Imaging Science and Technology (Vol. 36, pp. 1991–1997). https://doi.org/10.2352/EI.2024.36.12.HPCI-199

Lyngaas, I., M. G. Meena, E. Calabrese, M. Wahib, P. Chen, J. Igarashi, Y. Huo, and X. Wang. “Efficient Distributed Sequence Parallelism for Transformer-based Image Segmentation.” In IS and T International Symposium on Electronic Imaging Science and Technology, 36:1991–97, 2024. https://doi.org/10.2352/EI.2024.36.12.HPCI-199.

Lyngaas I, Meena MG, Calabrese E, Wahib M, Chen P, Igarashi J, et al. Efficient Distributed Sequence Parallelism for Transformer-based Image Segmentation. In: IS and T International Symposium on Electronic Imaging Science and Technology. 2024. p. 1991–7.

Lyngaas, I., et al. “Efficient Distributed Sequence Parallelism for Transformer-based Image Segmentation.” IS and T International Symposium on Electronic Imaging Science and Technology, vol. 36, no. 12, 2024, pp. 1991–97. Scopus, doi:10.2352/EI.2024.36.12.HPCI-199.

Lyngaas I, Meena MG, Calabrese E, Wahib M, Chen P, Igarashi J, Huo Y, Wang X. Efficient Distributed Sequence Parallelism for Transformer-based Image Segmentation. IS and T International Symposium on Electronic Imaging Science and Technology. 2024. p. 1991–1997.

Published In

IS and T International Symposium on Electronic Imaging Science and Technology

DOI

10.2352/EI.2024.36.12.HPCI-199

EISSN

2470-1173

Publication Date

January 1, 2024

Volume

Issue

Start / End Page

1991 / 1997