A reinforcement learning path planning approach for range-only underwater target localization with autonomous vehicles