Learning To Guide Human Decision Makers With Vision-Language Models