Reviews: Maximum Entropy Monte-Carlo Planning

Jan-25-2025, 02:42:22 GMT–Neural Information Processing Systems

This paper presents an appealing idea to combine current max-entropy methods in RL with Monte-Carlo Tree Search. A theoretical result shows improved rate of convergence, while empirical results show improved sample efficiency. The initial reviews were quite positive; I only noted a small number of issues mentioned in the reviews of R1 and R3. In our discussions after reading the author feedback, R3 noted that some of his concerns have not been addressed. R2 replied, saying that these concerns are relatively minor and can be addressed in the final version.

maximum entropy monte-carlo planning, result show

Neural Information Processing Systems

Jan-25-2025, 02:42:22 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Planning & Scheduling (0.79)
  - Machine Learning > Statistical Learning
    - Maximum Entropy (0.40)