Online Control of Unknown Time-Varying Dynamical Systems

Neural Information Processing Systems 

In fact, our algorithm enjoys sublinear adaptive regret bounds, which is a strictly stronger metric than standard regret and is more appropriate for time-varying systems.