Reinforcement-Learning based routing for packet-optical networks with hybrid telemetry