[P] Trying out DQN with gym and keras. Bug? • r/MachineLearning

Aug-18-2017, 17:50:33 GMT–@machinelearnbot

Do I have a bug, is it a matter of tweaking hyperparameters, or is the algorithm simply not that suited for these problems? The best performing submissions for cartpole you see on openai gym are discrete q learning with a q table represented as a lookup table. Because cartpole is simple you can discretize the 1x4 real valued state vector into a 1x4 integer vector with 8 or so bins per channel and then store a 84 element q table explicitly in memory. A neural net, even a small one is overkill as an approximation of this fixed sized q table so naturally the former performs better. Hyper parameters are a bitch to tune, and q learning isn't remotely stable 99% of the time until you balance them well enough so it can easily seem like nothing is working.

artificial intelligence, machine learning, social media, (5 more...)

@machinelearnbot

Aug-18-2017, 17:50:33 GMT

News Web Page

Add feedback

Industry:
- Media > News (0.40)

Technology:
- Information Technology
  - Communications > Social Media (1.00)
  - Artificial Intelligence > Machine Learning
    - Neural Networks (0.63)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found