The dynamics of the environment is changed yet again during this stage - a human enters the scene and causes the out-of-reach object sometimes appear within reach of the robot when an indication of assistance request is observed through the robot's actions. Again, the robot notice the change in the environmental dynamics and discovers the appropriate cues and corresponding optimal action sequence such that rate of reward can increase once again.
Link to this comment:
All Comments (0)