Tomás Martínez-Marín and Tom Duckett
Fast Reinforcement Learning for Vision-guided Mobile Robots
Proc. ICRA-2005, IEEE International Conference on
Robotics and Automation
Abstract
This paper presents a new reinforcement learning algorithm for accelerating
acquisition of new skills by real mobile robots, without requiring simulation.
It speeds up Q-learning by applying memory-based sweeping
and enforcing the ``adjoining property'', a technique that exploits
the natural ordering of sensory state spaces in many robotic applications
by only allowing transitions between neighbouring states.
The algorithm is tested within an image-based visual servoing framework on
a docking task, in which the robot has to position its gripper at a desired
configuration relative to an object on a table.
In experiments, we compare the performance of the new algorithm with
a hand-designed linear controller and a scheme using the linear controller
as a bias to further accelerate the learning.
By analysis of the controllability and docking time, we show that the
biased learner could improve on the performance of the linear controller,
while requiring substantially lower training time than unbiased learning
(less than 1 hour on the real robot).
Download
pdf (http)
pdf (ftp)
Bibtex
@INPROCEEDINGS{martinez05,
TITLE = {Fast Reinforcement Learning for Vision-guided Mobile Robots},
AUTHOR = {Mart\'{i}nez-Mar\'{i}n, Tom\'{a}s and Duckett, Tom},
BOOKTITLE = {IEEE International Conference on Robotics and Automation (ICRA 2005)},
ADDRESS = {Barcelona, Spain},
YEAR = {2005},
MONTH = {April 18-22}
}