XVO: Generalized Visual Odometry via Cross-Modal Self-Training