Fitted Q-Learning for Relational Domains

Document Type

Poster

Subject Area(s)

Reinforcement Learning, First Order Logic

Abstract

We consider the problem of Approximate Dynamic Program- ming in relational domains. Inspired by the success of fit- ted Q-learning methods in propositional settings, we develop the first relational fitted Q-learning algorithms by represent- ing the value function and Bellman residuals. When we fit the Q-functions, we show how the two steps of Bellman op- erator; application and projection steps can be performed us- ing a gradient-boosting technique. Our proposed framework performs reasonably well on standard domains without using domain models and using fewer training trajectories.

Digital Object Identifier (DOI)

arXiv:2006.05595

APA Citation

Das, S., Natarajan, S., Roy, K., Parr, R., & Kersting, K. (2020). Fitted Q-Learning for Relational Domains. arXiv preprint arXiv:2006.05595.

Share

COinS