Reinforcement Learning with Formal Performance Metrics for Quadcopter Attitude Control under Non-nominal Contexts