Upload

Loading icon Loading...

This video is unavailable.

Generalized Model Learning for Reinforcement Learning on a Humanoid Robot

Sign in to YouTube

Sign in with your Google Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to like AustinVilla's video.

Sign in to YouTube

Sign in with your Google Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to dislike AustinVilla's video.

Sign in to YouTube

Sign in with your Google Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to add AustinVilla's video to your playlist.

Uploaded on Jan 13, 2010

This is the accompanying video for the paper:

Generalized Model Learning for Reinforcement Learning on a Humanoid Robot.
Todd Hester, Michael Quinlan, and Peter Stone.
International Conference on Robotics and Automation (ICRA), May 2010.

Abstract:
Reinforcement learning (RL) algorithms have long been promising methods for enabling an autonomous robot to improve its behavior on sequential decision-making tasks. The obvious enticement is that the robot should be able to improve its own behavior without the need for detailed step-by-step programming. However, for RL to reach its full potential, the algorithms must be sample efficient: they must learn competent behavior from very few real-world trials. From this perspective, model-based methods, which use experiential data more efficiently than model-free approaches, are appealing. But they often require exhaustive exploration to learn an accurate model of the domain. In this paper, we present an algorithm, Reinforcement Learning with Decision Trees (RL-DT), that uses decision trees to learn the model by generalizing the relative effect of actions across states. The agent explores the environment until it believes it has a reasonable policy. The combination of the learning approach with the targeted exploration policy enables fast learning of the model. We compare RL-DT against standard model-free and model-based learning methods, and demonstrate its effectiveness on an Aldebaran Nao humanoid robot scoring goals in a penalty kick scenario.

More details on our RL research are available at the following links:
http://www.cs.utexas.edu/~todd/resear...
http://www.cs.utexas.edu/~AustinVilla...

There are also related videos on the 'ToddHesterResearch' youtube channel:
http://www.youtube.com/user/toddheste...

Loading icon Loading...

Loading icon Loading...

Loading icon Loading...

Loading icon Loading...

Ratings have been disabled for this video.
Rating is available when the video has been rented.
This feature is not available right now. Please try again later.

Loading icon Loading...

Advertisement
Loading...
Working...
to add this to Watch Later

Add to