Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case Regret

Open in new window