Sort by time | Sort by thread (beta)

Link to this comment:

Share to:
see all

All Comments (5)

Sign In or Sign Up now to post a comment!
  • B.F. Skinner would be proud! Finally, an extension of his science to other aspects of humanity, specifically the development of artificial intelligence. What an elegant curve in the behavior of the robot, too! Great work. Perhaps other studies on variable, fixed, and interval schedules of reinforcement will yield data trends consistent with living organisms?

  • I might do a research about this for my last subject before I get my degree on computer science. It will be hard but at least i'm near the end :p

  • Nice, he learns through rewards.

  • so the RL is to do random actions when the reward is minus and when it maximizes you improve by small steps [IE Learning Rate] ?

  • RL is Nao!!!!

Loading...
0 / 00Unsaved Playlist Return to active list
    1. Your queue is empty. Add videos to your queue using this button:
      or sign in to load a different list.
    Loading...Loading...Saving...
    • Clear all videos from this list
    • Learn more