Optimising Call Centre Operations using Reinforcement Learning: Value Iteration versus Proximal Policy Optimisation