A Comparison of Reward Functions in Q-Learning Applied to a Cart Position Problem

Open in new window