[D] How to deal with non-Markovian decision processes with large/infinite horizon using MCTS? • r/MachineLearning

Jan-2-2018, 14:30:52 GMT–@machinelearnbot

Quick google search will tell you that MCTS is applicable to large/infinite horizon RL tasks. But it seems that there's no empirical confirmation that it works as well as on Go. Assume that no rollout is used just as in AlphaZero. Go's state space is larger than other games, but its horizon length is small (not much larger than 100 timesteps). The state space of many real-world problems grows exponentially w.r.t. the timestep in the following sense.

artificial intelligence, non-markovian decision process, social media, (7 more...)

@machinelearnbot

Jan-2-2018, 14:30:52 GMT

News Web Page

Add feedback

Industry:
- Leisure & Entertainment > Games (0.41)
- Media > News (0.40)

Technology:
- Information Technology
  - Communications > Social Media (1.00)
  - Artificial Intelligence (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found