Learning Vision-based Reactive Policies for Obstacle Avoidance