Approximate Policy Iteration with a Policy Language Bias: Solving Relational Markov Decision Processes

Open in new window