Skip to main content

Off-Policy Selection for Initiating Human-Centric Experimental Design

Publication ,  Conference
Gao, G; Yang, X; Gao, Q; Ju, S; Pajic, M; Chi, M
Published in: Advances in Neural Information Processing Systems
January 1, 2024

In human-centric tasks such as healthcare and education, the heterogeneity among patients and students necessitates personalized treatments and instructional interventions. While reinforcement learning (RL) has been utilized in those tasks, off-policy selection (OPS) is pivotal to close the loop by offline evaluating and selecting policies without online interactions, yet current OPS methods often overlook the heterogeneity among participants. Our work is centered on resolving a pivotal challenge in human-centric systems (HCSs): how to select a policy to deploy when a new participant joining the cohort, without having access to any prior offline data collected over the participant? We introduce First-Glance Off-Policy Selection (FPS), a novel approach that systematically addresses participant heterogeneity through sub-group segmentation and tailored OPS criteria to each sub-group. By grouping individuals with similar traits, FPS facilitates personalized policy selection aligned with unique characteristics of each participant or group of participants. FPS is evaluated via two important but challenging applications, intelligent tutoring systems and a healthcare application for sepsis treatment and intervention. FPS presents significant advancement in enhancing learning outcomes of students and in-hospital care outcomes.

Duke Scholars

Published In

Advances in Neural Information Processing Systems

ISSN

1049-5258

Publication Date

January 1, 2024

Volume

37

Related Subject Headings

  • 4611 Machine learning
  • 1702 Cognitive Sciences
  • 1701 Psychology
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Gao, G., Yang, X., Gao, Q., Ju, S., Pajic, M., & Chi, M. (2024). Off-Policy Selection for Initiating Human-Centric Experimental Design. In Advances in Neural Information Processing Systems (Vol. 37).
Gao, G., X. Yang, Q. Gao, S. Ju, M. Pajic, and M. Chi. “Off-Policy Selection for Initiating Human-Centric Experimental Design.” In Advances in Neural Information Processing Systems, Vol. 37, 2024.
Gao G, Yang X, Gao Q, Ju S, Pajic M, Chi M. Off-Policy Selection for Initiating Human-Centric Experimental Design. In: Advances in Neural Information Processing Systems. 2024.
Gao, G., et al. “Off-Policy Selection for Initiating Human-Centric Experimental Design.” Advances in Neural Information Processing Systems, vol. 37, 2024.
Gao G, Yang X, Gao Q, Ju S, Pajic M, Chi M. Off-Policy Selection for Initiating Human-Centric Experimental Design. Advances in Neural Information Processing Systems. 2024.

Published In

Advances in Neural Information Processing Systems

ISSN

1049-5258

Publication Date

January 1, 2024

Volume

37

Related Subject Headings

  • 4611 Machine learning
  • 1702 Cognitive Sciences
  • 1701 Psychology