PG-Rainbow: Using Distributional Reinforcement Learning in Policy Gradient Methods