Learning Nonlinear Causal Reductions to Explain Reinforcement Learning Policies