Packet Routing in Dynamically Changing Networks: A Reinforcement Learning Approach