Learning Novel Policies For Tasks