Explaining Reinforcement Learning Policies through Counterfactual Trajectories