One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL