A novel reinforcement learning architecture for continuous state and action spaces

Autor: VICTOR EMANUEL DE ATOCHA UC CETINA;

Colecciones: Artículos [523]

URI:

http://dx.doi.org/10.1155/2013/492852

http://redi.uady.mx:8080/handle/123456789/771

Metadatos: Mostrar el registro completo del recurso

Resumen

We introduce a reinforcement learning architecture designed for problems with an infinite number of states, where each state can be seen as a vector of real numbers and with a finite number of actions, where each action requires a vector of real numbers as parameters. The main objective of this architecture is to distribute in two actors the work required to learn the final policy. One actor decides what action must be performed; meanwhile, a second actor determines the right parameters for the selected action.We tested our architecture and one algorithm based on it solving the robot dribbling problem, a challenging robot control problem taken from the RoboCup competitions. Our experimental work with three different function approximators provides enough evidence to prove that the proposed architecture can be used to implement fast, robust, and reliable reinforcement learning algorithms.

Archivos en el recurso

Nombre:Artículo

Descripción:Artículo

Tamaño:1.809Mb

Formato:PDF