Loading...
Uploaded by yavnadd on Mar 5, 2009
An RL agent learns to play the Atari 2600 game, Freeway. The algorithm used is gradient-descent Sarsa Lambda (http://web.cs.ualberta.ca/~sutton/book/ebook/node89.html).
Education
Standard YouTube License
Load more suggestions
All Comments