On Policy Evaluation Algorithms in Distributional Reinforcement Learning