Scholars@Duke publication: Approximating Optimal Policies for Partially Observable Stochastic Domains

Approximating Optimal Policies for Partially Observable Stochastic Domains

Publication , Conference

Parr, R; Russell, S

Published in: IJCAI International Joint Conference on Artificial Intelligence

January 1, 1995

The problem of making optimaJ decisions in uncertain conditions is central to Artificial Intelligence If the state of the world is known at all times, the world can be modeled as a Markov Decision Pro cess (MDP) MDPs have been studied extensively and many methods are known for determining op timal courses of action or policies The more realistic case where state information is only partially observable Partially Observable Markov Decision Processes (POMDPs) have received much less attention The best exact algorithms for these problems can be very inefficient in both space and lime We introduce Smooth Partially Observable Value Approximation (SPOVA), a new approximation method that can quickly yield good approximations which can improve over time This mediod can be combined with reinforcement learning meth ods a combination that was very effective in our test cases.

Duke Scholars

Author Ronald Parr Computer Science

Published In

IJCAI International Joint Conference on Artificial Intelligence

ISSN

1045-0823

Publication Date

January 1, 1995

Volume

Start / End Page

1088 / 1094

Citation

APA

Chicago

ICMJE

MLA

NLM

Parr, R., & Russell, S. (1995). Approximating Optimal Policies for Partially Observable Stochastic Domains. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 2, pp. 1088–1094).

Parr, R., and S. Russell. “Approximating Optimal Policies for Partially Observable Stochastic Domains.” In IJCAI International Joint Conference on Artificial Intelligence, 2:1088–94, 1995.

Parr R, Russell S. Approximating Optimal Policies for Partially Observable Stochastic Domains. In: IJCAI International Joint Conference on Artificial Intelligence. 1995. p. 1088–94.

Parr, R., and S. Russell. “Approximating Optimal Policies for Partially Observable Stochastic Domains.” IJCAI International Joint Conference on Artificial Intelligence, vol. 2, 1995, pp. 1088–94.

Parr R, Russell S. Approximating Optimal Policies for Partially Observable Stochastic Domains. IJCAI International Joint Conference on Artificial Intelligence. 1995. p. 1088–1094.

Published In

IJCAI International Joint Conference on Artificial Intelligence

ISSN

1045-0823

Publication Date

January 1, 1995

Volume

Start / End Page

1088 / 1094