Sample-Efficient Multi-Objective Learning via Generalized Policy Improvement Prioritization