Alert icon
We're changing our privacy policy. This stuff matters.  Learn more  Dismiss

Lecture-3: REINFORCEMENT LEARNING: Iterative Algorithms: Part-1.mp4

Loading...

Sign in or sign up now!
Alert icon
Upgrade to the latest Flash Player for improved playback performance. Upgrade now or more info.
597 views
Loading...
Alert icon
Sign in or sign up now!
Alert icon

Uploaded by on Oct 3, 2010

REINFORCEMENT LEARNING: Lecture - 3: ITERATIVE ALGORITHMS & SINGLE AGENT PATH PLANNING IN FOMDPs. (Fully observable MDPs). Sanjeev Sharma - Founder & Co-Owner - searching-eye.com , Undergraduate Indian Institute of Technology Roorkee.

CONTENTS:
Optimal Value Functions, Bellman Optimality Equation, Relation b/w Optimal Action value function and Optimal State-Value Function, Policy Evaluation, Policy Iteration, Value Iteration, Policy Improvement, Agent Path Planning in Static Environment in FOMDPs.

DESCRIPTION:
In this lecture first of all I mentioned few things from the previous lecture. Then I provided an introduction to optimal policies, details about the relationship b/w the Optimal State Value Function and OPtimal Action Value Function. Then I mentioned the Bellman Optimality Equation for both the State-Value function and Action-Value Function. Then I provided a brief overview of my Example in Agent\'s Path Planning in Static Environment in Fully Observable MDPs. Then I provided the details of 4 most important algorithms i.e. Policy Evaluation, Policy Improvement, Policy Iteration and Value Iteration.

AGENT PATH PLANNING: For Path Planning of agent, in this lecture I used the Policy Evaluation Algorithm for FOMDPs. See the path planning channel

  • likes, 0 dislikes

Link to this comment:

Share to:
see all

All Comments (0)

Sign In or Sign Up now to post a comment!
Loading...

Alert icon
0 / 00Unsaved Playlist Return to active list
    1. Your queue is empty. Add videos to your queue using this button:
      or sign in to load a different list.
    Loading...Loading...Saving...
    • Clear all videos from this list
    • Learn more