Pushing the Limits of Reactive Planning: Learning to Escape Local Minima