Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search