Hyperparametersforthealgorithm Wedefine Ed=3d e e 1 ln{ 3+3(2B/ε)

Neural Information Processing Systems 

We need some more notation in order to linearize the value function.