Combining Multiple Correlated Reward and Shaping Signals by Measuring Confidence

Open in new window