Nonlinear Multi-objective Reinforcement Learning with Provable Guarantees