[P] Deep reinforcement learning tutorial, battleship • /r/MachineLearning
I think this application is kind of fascinating. There is a probability distribution mainatined on the board of possible ship locations, and samples are made based on this estimate. This "solves" battleship in a way. But I believe it uses a monte-carlo search to get the probability densities. A properly trained CNN might be able to do this in a single forward pass.
Oct-15-2016, 23:01:13 GMT