Learning Generalized Policies for Fully Observable Non-Deterministic Planning Domains