Loading...

Generalized advantage estimation demo: learning to run and stand up

4,350 views

Loading...

Loading...

Rating is available when the video has been rented.
This feature is not available right now. Please try again later.
Published on Dec 28, 2016

As described in the paper "High-Dimensional Continuous Control Using Generalized Advantage Estimation" https://arxiv.org/abs/1506.02438
Observations = joint angles, joint velocities, and some cartesian positions
Actions = joint torques
Similar to the MuJoCo Humanoid in OpenAI Gym

Loading...

When autoplay is enabled, a suggested video will automatically play next.

Up next


to add this to Watch Later

Add to

Loading playlists...