Goto

Collaborating Authors

 definedin



DistributionallyRobustOptimizationviaBallOracle Acceleration

Neural Information Processing Systems

In particular, we consider group-structured and boundedfdivergence uncertainty sets. Our approach relies on an accelerated method that queries a ball optimization oracle, i.e., a subroutine that minimizes the objective within a small ball around the query point. Our main contribution is efficient implementations of this oracle for DRO objectives.





Stage-wiseConservativeLinearBandits

Neural Information Processing Systems

Forinstance,comparedto existing solutions, we showthat SCLTS plays the (non-optimal) baseline action at most O(logT) times (compared toO( T)). Finally, we make connections to another studied form of "safety constraints" that takes the form of anupper bound on the instantaneous reward.



244edd7e85dc81602b7615cd705545f5-Supplemental.pdf

Neural Information Processing Systems

We begin by proving the lower bound on coverage. The formal proof of this statement is standard at this point, so we simply refer to [3] for the remaining technical details. The proof for the upper bound also immediatelyfollowsfrom(S6)byapplyingLemma2in[3]. The proof is essentially an application of the main result in [2]. This will become apparent after we reduce our claim to the setting in the aforementioned paper.