Alert icon
We're changing our privacy policy. This stuff matters.  Learn more  Dismiss

Lecture 16 | Machine Learning (Stanford)

Loading...

Sign in or sign up now!
14,033
Loading...
Alert icon
Sign in or sign up now!
Alert icon

Uploaded by on Jul 22, 2008

Lecture by Professor Andrew Ng for Machine Learning (CS 229) in the Stanford Computer Science department. Professor Ng discusses the topic of reinforcement learning, focusing particularly on MDPs, value functions, and policy and value iteration.

This course provides a broad introduction to machine learning and statistical pattern recognition. Topics include supervised learning, unsupervised learning, learning theory, reinforcement learning and adaptive control. Recent applications of machine learning, such as to robotic control, data mining, autonomous navigation, bioinformatics, speech recognition, and text and web data processing are also discussed.

Complete Playlist for the Course:
http://www.youtube.com/view_play_list?p=A89DCFA6ADACE599

CS 229 Course Website:
http://www.stanford.edu/class/cs229/

Stanford University:
http://www.stanford.edu/

Stanford University Channel on YouTube:
http://www.youtube.com/stanford

Category:

Education

Tags:

Download this video

LICENSE: Creative Commons (Attribution-Noncommercial-No Derivative Works).

For more information about this license, please read: http://creativecommons.org/licenses/by-nc-nd/3.0/.

High-quality MP4 Learn more

  • likes, 0 dislikes

Link to this comment:

Share to:

Top Comments

  • This has been very useful to me! thank you!

  • This is a very good introduction lecture on Reinforcement Learning!

see all

All Comments (6)

Sign In or Sign Up now to post a comment!
  • In the computation you start at 57, to determine why moving west is better than moving north from the (3,1) state, it seems that you disregarded or forgot the discount factor, without mentioning it. I do think that in this case it suffices to look at undiscounted values to determine the optimal action, because there are no intermediate rewards. I find find this just a bit misleading, but I also wanted to share my thoughts. Great lecture (so far)!

Loading...

0 / 00Unsaved Playlist Return to active list
    1. Your queue is empty. Add videos to your queue using this button:
      or sign in to load a different list.
    Loading...Loading...Saving...
    • Clear all videos from this list
    • Learn more