Supplementary Material for: Parametrized Quantum Policies for Reinforcement Learning