This video demonstrates an Aibo learning the XOR task, where it receives a reward following either a red or a blue stimulus, but not when both are presented together. The Temporal Difference learni...
This video demonstrates an Aibo learning the XOR task, where it receives a reward following either a red or a blue stimulus, but not when both are presented together. The Temporal Difference learning aspect allows the robot to learn when the reward should have been presented, so that it can handle real-time interactions.
Published in the proceedings of the Second International Conference on Development and Learning (ICDL 2002)
Like to rate videos and let people know what you think?
Automatically share your ratings, favorites, and more on Facebook, Twitter, and Google Reader with YouTube Autoshare.
Autoshare makes certain YouTube activities public on the services you choose. Select only the services you are comfortable with - like Facebook, Twitter, or Google Reader - to let your friends know what you like on YouTube. You can turn Autoshare off at any time.
Like to share videos with friends?
Automatically share your ratings, favorites, and more on Facebook, Twitter, and Google Reader with YouTube Autoshare.
Autoshare makes certain YouTube activities public on the services you choose. Select only the services you are comfortable with - like Facebook, Twitter, or Google Reader - to let your friends know what you like on YouTube. You can turn Autoshare off at any time.
Autoshare makes certain YouTube activities public on the services you choose. Select only the services you are comfortable with - like Facebook, Twitter, or Google Reader - to let your friends know what you like on YouTube. You can turn Autoshare off at any time.