Non-Stochastic Control with Bandit Feedback Paula Gradu 1,3 John Hallman 1, 3 Elad Hazan

Neural Information Processing Systems 

For this problem, with either a known or unknown system, we give an efficient sublinear regret algorithm.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found