Spatial-temporal recurrent reinforcement learning for autonomous ships