Results 1 -
5 of
5
Online learning and exploiting relational models in reinforcement learning
- In M. Veloso (Ed.), Proceedings of the 20th International Joint Conference on Artificial Intelligence (p
, 2007
"... In recent years, there has been a growing interest in using rich representations such as relational languages for reinforcement learning. However, while expressive languages have many advantages in terms of generalization and reasoning, extending existing approaches to such a relational setting is a ..."
Abstract
-
Cited by 13 (2 self)
- Add to MetaCart
In recent years, there has been a growing interest in using rich representations such as relational languages for reinforcement learning. However, while expressive languages have many advantages in terms of generalization and reasoning, extending existing approaches to such a relational setting is a non-trivial problem. In this paper, we present a first step towards the online learning and exploitation of relational models. We propose a representation for the transition and reward function that can be learned online and present a method that exploits these models by augmenting Relational Reinforcement Learning algorithms with planning techniques. The benefits and robustness of our approach are evaluated experimentally. 1
Multi-agent relational reinforcement learning. Explorations in multi-state coordination tasks
- In ToappearinLNCSbookon Learning and Adaptation in MAS
, 2006
"... ..."
Efficient learning of relational models for sequential decision making
, 2010
"... The exploration-exploitation tradeoff is crucial to reinforcement-learning (RL) agents, and a significant number of sample complexity results have been derived for agents in propositional domains. These results guarantee, with high probability, near-optimal behavior in all but a polynomial number of ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
The exploration-exploitation tradeoff is crucial to reinforcement-learning (RL) agents, and a significant number of sample complexity results have been derived for agents in propositional domains. These results guarantee, with high probability, near-optimal behavior in all but a polynomial number of timesteps in the agent’s lifetime. In this work, we prove similar results for certain relational representations, primarily a class we call “relational action schemas”. These generalized models allow us to specify state transitions in a compact form, for instance describing the effect of picking up a generic block instead of picking up 10 different specific blocks. We present theoretical results on crucial subproblems in action-schema learning using the KWIK framework, which allows us to characterize the sample efficiency of an agent learning these models in a reinforcement-learning setting. These results are extended in an apprenticeship learning paradigm where and agent has access not only to its environment, but also to a teacher that can demonstrate traces of state/action/state sequences. We show that the class of action schemas that are efficiently learnable in this paradigm is strictly larger than those learnable in the online setting. We link
Convergence of Reinforcement Learning Using a Decision Tree Learner
- In Proceedings of ICML-2005 Workshop on Rich Representation for Reinforcement Learning
, 2005
"... In this paper, we propose conditions under which Q iteration using decision trees for function approximation is guaranteed to converge to the optimal policy in the limit, using only a storage space linear in the size of the decision tree. We analyze di#erent factors that influence the e#ciency ..."
Abstract
- Add to MetaCart
In this paper, we propose conditions under which Q iteration using decision trees for function approximation is guaranteed to converge to the optimal policy in the limit, using only a storage space linear in the size of the decision tree. We analyze di#erent factors that influence the e#ciency of the proposed algorithm, and in particular study the e#- ciency of di#erent concept languages. We illustrate the approach with some preliminary experiments.

