Learning Setup Policies: Reliable Transition Between Locomotion Behaviours