Scholars@Duke publication: Non-Parametric Approximate Linear Programming for MDPs

Non-Parametric Approximate Linear Programming for MDPs

Publication , Conference

Pazis, J; Parr, R

Published in: Proceedings of the 25th AAAI Conference on Artificial Intelligence, AAAI 2011

August 11, 2011

The Approximate Linear Programming (ALP) approach to value function approximation for MDPs is a parametric value function approximation method, in that it represents the value function as a linear combination of features which are chosen a priori. Choosing these features can be a difficult challenge in itself. One recent effort, Regularized Approximate Linear Programming (RALP), uses L1 regularization to address this issue by combining a large initial set of features with a regularization penalty that favors a smooth value function with few non-zero weights. Rather than using smoothness as a backhanded way of addressing the feature selection problem, this paper starts with smoothness and develops a non-parametric approach to ALP that is consistent with the smoothness assumption. We show that this new approach has some favorable practical and analytical properties in comparison to (R)ALP.

Duke Scholars

Author Ronald Parr Computer Science

Published In

Proceedings of the 25th AAAI Conference on Artificial Intelligence, AAAI 2011

DOI

10.1609/aaai.v25i1.7930

Publication Date

August 11, 2011

Start / End Page

459 / 464

Citation

APA

Chicago

ICMJE

MLA

NLM

Pazis, J., & Parr, R. (2011). Non-Parametric Approximate Linear Programming for MDPs. In Proceedings of the 25th AAAI Conference on Artificial Intelligence, AAAI 2011 (pp. 459–464). https://doi.org/10.1609/aaai.v25i1.7930

Pazis, J., and R. Parr. “Non-Parametric Approximate Linear Programming for MDPs.” In Proceedings of the 25th AAAI Conference on Artificial Intelligence, AAAI 2011, 459–64, 2011. https://doi.org/10.1609/aaai.v25i1.7930.

Pazis J, Parr R. Non-Parametric Approximate Linear Programming for MDPs. In: Proceedings of the 25th AAAI Conference on Artificial Intelligence, AAAI 2011. 2011. p. 459–64.

Pazis, J., and R. Parr. “Non-Parametric Approximate Linear Programming for MDPs.” Proceedings of the 25th AAAI Conference on Artificial Intelligence, AAAI 2011, 2011, pp. 459–64. Scopus, doi:10.1609/aaai.v25i1.7930.

Pazis J, Parr R. Non-Parametric Approximate Linear Programming for MDPs. Proceedings of the 25th AAAI Conference on Artificial Intelligence, AAAI 2011. 2011. p. 459–464.

Published In

Proceedings of the 25th AAAI Conference on Artificial Intelligence, AAAI 2011

DOI

10.1609/aaai.v25i1.7930

Publication Date

August 11, 2011

Start / End Page

459 / 464