Loading...
Uploaded by mmutsuda on Jan 20, 2011
Lego NXT with an implemented Q-Learning Reinforcement learning algorithm learns how to get the "candy".When the touch sensor is activated, the Robot gets the reinforcement. During the training it learns the best way to obtain it.
Science & Technology
Standard YouTube License
Load more suggestions
Link to this comment:
All Comments (0)