Skip to main content

A reinforcement learning agent for head and neck intensity-modulated radiation therapy

Publication ,  Journal Article
Stephens, H; Li, X; Sheng, Y; Wu, Q; Ge, Y; Wu, QJ
Published in: Frontiers in Physics
January 1, 2024

Head and neck (HN) cancers pose a difficult problem in the planning of intensity-modulated radiation therapy (IMRT) treatment. The primary tumor can be large and asymmetrical, and multiple organs at risk (OARs) with varying dose-sparing goals lie close to the target volume. Currently, there is no systematic way of automating the generation of IMRT plans, and the manual options face planning quality and long planning time challenges. In this article, we present a reinforcement learning (RL) model for the purposes of providing automated treatment planning to reduce clinical workflow time as well as providing a better starting point for human planners to modify and build upon. Several models with progressing complexity are presented, including the relevant plan dosimetry analysis and model interpretations of the resulting strategies learned by the auto-planning agent. Models were trained on a set of 40 patients and validated on a set of 20 patients. The presented models are shown to be consistent with the requirements of an RL model to be underpinned by a Markov decision process (MDP). In-depth interpretability of the models is presented by examination of the decision space using action hyperplanes. The auto-planning agent was able to generate plans with superior reduction in the mean dose of the left and right parotid glands by approximately 7 Gy (Formula presented.) 2.5 Gy (p < 0.01) over a starting, static template plan with only pre-defined general prescription information. RL plans were comparable to a human expert’s clinical plans for the primary (44 Gy), boost (26 Gy), and the summed plans (70 Gy) with p-values of 0.43, 0.72, and 0.67, respectively, for the dosimetric endpoints and uniform target coverage normalization. The RL planning agent was able to produce the plans used in validation in an average of 13.58 min, with a minimum and a maximum planning time of 2.27 and 44.82 min, respectively.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Frontiers in Physics

DOI

EISSN

2296-424X

Publication Date

January 1, 2024

Volume

12

Related Subject Headings

  • 51 Physical sciences
  • 49 Mathematical sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Stephens, H., Li, X., Sheng, Y., Wu, Q., Ge, Y., & Wu, Q. J. (2024). A reinforcement learning agent for head and neck intensity-modulated radiation therapy. Frontiers in Physics, 12. https://doi.org/10.3389/fphy.2024.1331849
Stephens, H., X. Li, Y. Sheng, Q. Wu, Y. Ge, and Q. J. Wu. “A reinforcement learning agent for head and neck intensity-modulated radiation therapy.” Frontiers in Physics 12 (January 1, 2024). https://doi.org/10.3389/fphy.2024.1331849.
Stephens H, Li X, Sheng Y, Wu Q, Ge Y, Wu QJ. A reinforcement learning agent for head and neck intensity-modulated radiation therapy. Frontiers in Physics. 2024 Jan 1;12.
Stephens, H., et al. “A reinforcement learning agent for head and neck intensity-modulated radiation therapy.” Frontiers in Physics, vol. 12, Jan. 2024. Scopus, doi:10.3389/fphy.2024.1331849.
Stephens H, Li X, Sheng Y, Wu Q, Ge Y, Wu QJ. A reinforcement learning agent for head and neck intensity-modulated radiation therapy. Frontiers in Physics. 2024 Jan 1;12.

Published In

Frontiers in Physics

DOI

EISSN

2296-424X

Publication Date

January 1, 2024

Volume

12

Related Subject Headings

  • 51 Physical sciences
  • 49 Mathematical sciences