Fast and Continual Learning for Hybrid Control Policies using Generalized Benders Decomposition