Scholars@Duke publication: ESense: BioMimetic modeling of echolocation and electrolocation using homeostatic dual-layered reinforcement learning

ESense: BioMimetic modeling of echolocation and electrolocation using homeostatic dual-layered reinforcement learning

Publication , Conference

Franklin, DM; Martin, D

Published in: Proceedings of the Southeast Conference Acmse 2017

April 13, 2017

This research explores diferent approaches to finding a moving target in a gridworld through reinforcement learning. Onewell known method for implementing reinforcement learning is the SARSA-γ algorithm. The traditional SARSA-γ algorithm is inefficient at finding a moving target because it relies only on the learned values of a stationary Q-table. While this works in static environments (e.g., finding optimal routes through a challenging environment to a stationary goal). The proposed solution to this problem is eSense. eSensex is a dual-layered, dynamic, homeostatic SARSA-γ algorithm with eligibility traces. It gives the AI a temporal sense (so it knows what is around it) to aid in the learning process. The dual-layered descriptor signifies that there are actually two grids in place, one for the navigation within the environment and another one that tracks the area surrounding the agent. Because this second grid moves around on the environment grid, it is dynamic. Additionally, the target the agent is pursuing is also moving, so it is also dynamic. Additionally, since this second grid is centered around the agent it is homeostatic (centered around the agent). Finally, the eligibility traces provide enhanced learning within this environment by providing more feedback per iteration (i.e., more states are updated each iteration). This enhanced configuration has helped eSense learn the target's tendencies while still relying on the Q-table to guide it away from walls and other obstacles. This layered approach provides an improvement to the standard SARSA-γ approach.

Duke Scholars

Author Derek Martin Engineering Graduate and Professional Programs

Published In

Proceedings of the Southeast Conference Acmse 2017

DOI

10.1145/3077286.3077309

Publication Date

April 13, 2017

Start / End Page

128 / 133

Citation

APA

Chicago

ICMJE

MLA

NLM

Franklin, D. M., & Martin, D. (2017). ESense: BioMimetic modeling of echolocation and electrolocation using homeostatic dual-layered reinforcement learning. In Proceedings of the Southeast Conference Acmse 2017 (pp. 128–133). https://doi.org/10.1145/3077286.3077309

Franklin, D. M., and D. Martin. “ESense: BioMimetic modeling of echolocation and electrolocation using homeostatic dual-layered reinforcement learning.” In Proceedings of the Southeast Conference Acmse 2017, 128–33, 2017. https://doi.org/10.1145/3077286.3077309.

Franklin DM, Martin D. ESense: BioMimetic modeling of echolocation and electrolocation using homeostatic dual-layered reinforcement learning. In: Proceedings of the Southeast Conference Acmse 2017. 2017. p. 128–33.

Franklin, D. M., and D. Martin. “ESense: BioMimetic modeling of echolocation and electrolocation using homeostatic dual-layered reinforcement learning.” Proceedings of the Southeast Conference Acmse 2017, 2017, pp. 128–33. Scopus, doi:10.1145/3077286.3077309.

Franklin DM, Martin D. ESense: BioMimetic modeling of echolocation and electrolocation using homeostatic dual-layered reinforcement learning. Proceedings of the Southeast Conference Acmse 2017. 2017. p. 128–133.

Published In

Proceedings of the Southeast Conference Acmse 2017

DOI

10.1145/3077286.3077309

Publication Date

April 13, 2017

Start / End Page

128 / 133