Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search

Neural Information Processing Systems 

Given the description of an environment and a task, we use an LLM guided by the GIF-MCTS method to iteratively generate and refine a candidate CWM. The candidate's correctness is evaluated by checking if it correctly

Similar Docs  Excel Report  more

TitleSimilaritySource
None found