Skip to main content

Q-functionals for Value-Based Continuous Control

Publication ,  Conference
Lobel, S; Rammohan, S; He, B; Yu, S; Konidaris, G
Published in: Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023
June 27, 2023

We present Q-functionals, an alternative architecture for continuous control deep reinforcement learning. Instead of returning a single value for a state-action pair, our network transforms a state into a function that can be rapidly evaluated in parallel for many actions, allowing us to efficiently choose high-value actions through sampling. This contrasts with the typical architecture of off-policy continuous control, where a policy network is trained for the sole purpose of selecting actions from the Q-function. We represent our action-dependent Q-function as a weighted sum of basis functions (Fourier, Polynomial, etc) over the action space, where the weights are state-dependent and output by the Q-functional network. Fast sampling makes practical a variety of techniques that require Monte-Carlo integration over Q-functions, and enables action-selection strategies besides simple value-maximization. We characterize our framework, describe various implementations of Q-functionals, and demonstrate strong performance on a suite of continuous control tasks.

Duke Scholars

Published In

Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023

ISBN

9781577358800

Publication Date

June 27, 2023

Volume

37

Start / End Page

8932 / 8939
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Lobel, S., Rammohan, S., He, B., Yu, S., & Konidaris, G. (2023). Q-functionals for Value-Based Continuous Control. In Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023 (Vol. 37, pp. 8932–8939).
Lobel, S., S. Rammohan, B. He, S. Yu, and G. Konidaris. “Q-functionals for Value-Based Continuous Control.” In Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023, 37:8932–39, 2023.
Lobel S, Rammohan S, He B, Yu S, Konidaris G. Q-functionals for Value-Based Continuous Control. In: Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023. 2023. p. 8932–9.
Lobel, S., et al. “Q-functionals for Value-Based Continuous Control.” Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023, vol. 37, 2023, pp. 8932–39.
Lobel S, Rammohan S, He B, Yu S, Konidaris G. Q-functionals for Value-Based Continuous Control. Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023. 2023. p. 8932–8939.

Published In

Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023

ISBN

9781577358800

Publication Date

June 27, 2023

Volume

37

Start / End Page

8932 / 8939