Maximum Entropy Monte-Carlo Planning
Chenjun Xiao, Ruitong Huang, Jincheng Mei, Dale Schuurmans, Martin Müller
–Neural Information Processing Systems
The idea is to augment Monte-Carlo TreeSearch (MCTS) withmaximum entropypolicyoptimization, evaluatingeach search node bysoftmax values back-propagated from simulation.
Neural Information Processing Systems
Feb-12-2026, 17:47:59 GMT
- Country:
- Technology: