Home / Papers / RL-CD: Dealing with Non-Stationarity in Reinforcement Learning

RL-CD: Dealing with Non-Stationarity in Reinforcement Learning

1 Citations2006
Bruno C. da Silva, Eduardo W. Basso, A. Bazzan
journal unavailable

A method for managing multiple partial models of the environment is proposed and described and previous results show that the proposed mechanism has better convergence times comparing to standard RL algorithms.

Abstract

This student abstract describes ongoing investigations regarding an approach for dealing with non-stationarity in reinforcement learning (RL) problems. We briefly propose and describe a method for managing multiple partial models of the environment and comment previous results which show that the proposed mechanism has better convergence times comparing to standard RL algorithms. Current efforts include the development of a more robust approach, capable of dealing with noisy environments, and also investigations regarding the possibility of using partial models in order to aliviate learning problems in systems with an explosive number of states.