What Matters in Reinforcement Learning for Tractography