We added extra experiments in simulation to evaluate the best-performing policy in environments with unseen obstacles. Here the pdf file describes the experiment design and shows the experimental settings and results in a figure and a table. A brief analysis of the results has been provided. We have also attached a video capturing part of the testing process in Gazebo