Home / Papers / EDA-RL: estimation of distribution algorithms for reinforcement learning problems

EDA-RL: estimation of distribution algorithms for reinforcement learning problems

DOI: 10.1145/1569901.1569958Semantic Scholar

25 Citations•2009•

H. Handa

Proceedings of the 11th Annual conference on Genetic and evolutionary computation

Conditional Random Fields by Lafferty et al. is newly introduced into EDAs in this paper, which are extended to solve reinforcement learning problems which arise naturally in a framework for autonomous agents.

Abstract

By making use of probabilistic models, (EDAs) can outperform conventional evolutionary computations. In this paper, EDAs are extended to solve reinforcement learning problems which arise naturally in a framework for autonomous agents. In reinforcement learning problems, we have to find out better policies of agents such that the rewards for agents in the future are increased. In general, such a policy can be represented by conditional probabilities of the agents' actions, given the perceptual inputs. In order to estimate such a conditional probability distribution, Conditional Random Fields (CRFs) by Lafferty et al. is newly introduced into EDAs in this paper. The reason for adopting CRFs is that CRFs are able to learn conditional probabilistic distributions from a large amount of input-output data, i.e., episodes in the case of reinforcement learning problems. On the other hand, conventional reinforcement learning algorithms can only learn incrementally. Computer simulations of Probabilistic Transition Problems and Perceptual Aliasing Maze Problems show the effectiveness of EDA-RL.