Exploring multimodal implicit behavior learning for vehicle navigation in simulated cities