Rating is available when the video has been rented.
This feature is not available right now. Please try again later.
Published on Dec 28, 2016
As described in the paper "High-Dimensional Continuous Control Using Generalized Advantage Estimation" https://arxiv.org/abs/1506.02438 Observations = joint angles, joint velocities, and some cartesian positions Actions = joint torques Similar to the MuJoCo Humanoid in OpenAI Gym