Skip to main content

Non-Parametric Approximate Linear Programming for MDPs

Publication ,  Conference
Pazis, J; Parr, R
Published in: Proceedings of the 25th AAAI Conference on Artificial Intelligence, AAAI 2011
August 11, 2011

The Approximate Linear Programming (ALP) approach to value function approximation for MDPs is a parametric value function approximation method, in that it represents the value function as a linear combination of features which are chosen a priori. Choosing these features can be a difficult challenge in itself. One recent effort, Regularized Approximate Linear Programming (RALP), uses L1 regularization to address this issue by combining a large initial set of features with a regularization penalty that favors a smooth value function with few non-zero weights. Rather than using smoothness as a backhanded way of addressing the feature selection problem, this paper starts with smoothness and develops a non-parametric approach to ALP that is consistent with the smoothness assumption. We show that this new approach has some favorable practical and analytical properties in comparison to (R)ALP.

Duke Scholars

Published In

Proceedings of the 25th AAAI Conference on Artificial Intelligence, AAAI 2011

DOI

Publication Date

August 11, 2011

Start / End Page

459 / 464
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Pazis, J., & Parr, R. (2011). Non-Parametric Approximate Linear Programming for MDPs. In Proceedings of the 25th AAAI Conference on Artificial Intelligence, AAAI 2011 (pp. 459–464). https://doi.org/10.1609/aaai.v25i1.7930
Pazis, J., and R. Parr. “Non-Parametric Approximate Linear Programming for MDPs.” In Proceedings of the 25th AAAI Conference on Artificial Intelligence, AAAI 2011, 459–64, 2011. https://doi.org/10.1609/aaai.v25i1.7930.
Pazis J, Parr R. Non-Parametric Approximate Linear Programming for MDPs. In: Proceedings of the 25th AAAI Conference on Artificial Intelligence, AAAI 2011. 2011. p. 459–64.
Pazis, J., and R. Parr. “Non-Parametric Approximate Linear Programming for MDPs.” Proceedings of the 25th AAAI Conference on Artificial Intelligence, AAAI 2011, 2011, pp. 459–64. Scopus, doi:10.1609/aaai.v25i1.7930.
Pazis J, Parr R. Non-Parametric Approximate Linear Programming for MDPs. Proceedings of the 25th AAAI Conference on Artificial Intelligence, AAAI 2011. 2011. p. 459–464.

Published In

Proceedings of the 25th AAAI Conference on Artificial Intelligence, AAAI 2011

DOI

Publication Date

August 11, 2011

Start / End Page

459 / 464