Pretrained Language Models as Visual Planners for Human Assistance