the closed-loop system. In the future, more effort is going to be made toward this direction. Also, the constraints on the agent’s dynamics, such as the bounded input and the nonholonomic constraint, which are inevitable in practice, will be very interesting topics for the future research