Alert icon
We're changing our privacy policy. This stuff matters.  Learn more  Dismiss

Lecture-2: REINFORCEMENT LEARNING: Value Functions and Markov Property: Part-3.mp4

Loading...

Sign in or sign up now!
Alert icon
Upgrade to the latest Flash Player for improved playback performance. Upgrade now or more info.
455 views
Loading...
Alert icon
Sign in or sign up now!
Alert icon

Uploaded by on Oct 2, 2010

REINFORCEMENT LEARNING: Lecture-2: Value Functions and Markov Property. Sanjeev Sharma, Founder & Co-Owner - searching-eye.com , undergraduate, IIT R.

In this lecture I discussed about the Eposidic & Continual Tasks. I also discussed about the State, Rewards, Returns, Discounted Return and Agent Environment Interaction Process. I also provided the details about the Discounting Parameter and proved that the Expected Return is Finite through Discounting. Then I also discussed about the Kind of value functions i.e. state value function and action value function of a policy. I also derived the expression for State-Value Function for a policy and provided the interpretation of each term involved in the BELLMAN Equation. I also provided a very brief introduction to MARKOV PROPERTY, MARKOV STATES and MDPs. More details about the BELLMAN Equation and MDPs will be discussed in Lecture 3. Much of the terms like Bellman Optimality Equation and relation between State-Value and Action Value Function is skipped in this lecture as this will form the topic of discussion in lecture 3.

  • likes, 1 dislikes

Link to this comment:

Share to:

Uploader Comments (sanjeev3007)

  • Every video is available on searching-eye[dot]com as a single file and full length. The youtube channel just shows some sample videos.

see all

All Comments (3)

Sign In or Sign Up now to post a comment!
  • We can't watch your videos, they are too slow to display

Loading...

Alert icon
0 / 00Unsaved Playlist Return to active list
    1. Your queue is empty. Add videos to your queue using this button:
      or sign in to load a different list.
    Loading...Loading...Saving...
    • Clear all videos from this list
    • Learn more