Scholars@Duke publication: Linear complementarity for regularized policy evaluation and improvement

Linear complementarity for regularized policy evaluation and improvement

Publication , Conference

Johns, J; Painter-Wakefield, C; Parr, R

Published in: Advances in Neural Information Processing Systems 23 24th Annual Conference on Neural Information Processing Systems 2010 Nips 2010

January 1, 2010

Recent work in reinforcement learning has emphasized the power of L 1 regularization to perform feature selection and prevent overfitting. We propose formulating the L1 regularized linear fixed point problemas a linear complementarity problem (LCP). This formulation offers several advantages over the LARS-inspired formulation, LARS-TD. The LCP formulation allows the use of efficient off-theshelf solvers, leads to a new uniqueness result, and can be initialized with starting points from similar problems (warm starts). We demonstrate that warm starts, as well as the efficiency of LCP solvers, can speed up policy iteration. Moreover, warm starts permit a form of modified policy iteration that can be used to approximate a "greedy" homotopy path, a generalization of the LARS-TD homotopy path that combines policy evaluation and optimization.

Duke Scholars

Author Ronald Parr Computer Science

Published In

Advances in Neural Information Processing Systems 23 24th Annual Conference on Neural Information Processing Systems 2010 Nips 2010

Publication Date

January 1, 2010

Citation

APA

Chicago

ICMJE

MLA

NLM

Johns, J., Painter-Wakefield, C., & Parr, R. (2010). Linear complementarity for regularized policy evaluation and improvement. In Advances in Neural Information Processing Systems 23 24th Annual Conference on Neural Information Processing Systems 2010 Nips 2010.

Johns, J., C. Painter-Wakefield, and R. Parr. “Linear complementarity for regularized policy evaluation and improvement.” In Advances in Neural Information Processing Systems 23 24th Annual Conference on Neural Information Processing Systems 2010 Nips 2010, 2010.

Johns J, Painter-Wakefield C, Parr R. Linear complementarity for regularized policy evaluation and improvement. In: Advances in Neural Information Processing Systems 23 24th Annual Conference on Neural Information Processing Systems 2010 Nips 2010. 2010.

Johns, J., et al. “Linear complementarity for regularized policy evaluation and improvement.” Advances in Neural Information Processing Systems 23 24th Annual Conference on Neural Information Processing Systems 2010 Nips 2010, 2010.

Johns J, Painter-Wakefield C, Parr R. Linear complementarity for regularized policy evaluation and improvement. Advances in Neural Information Processing Systems 23 24th Annual Conference on Neural Information Processing Systems 2010 Nips 2010. 2010.

Published In

Advances in Neural Information Processing Systems 23 24th Annual Conference on Neural Information Processing Systems 2010 Nips 2010

Publication Date

January 1, 2010