Goto

Collaborating Authors

 Energy


Data-Driven Conditional Robust Optimization

Neural Information Processing Systems

In most real world decision problems, the decision maker (DM) faces uncertainty either in the objective function that he aims to optimize, or some of the constraints that he needs to satisfy.








A Provably Efficient Sample Collection Strategy for Reinforcement Learning

Neural Information Processing Systems

One of the challenges in online reinforcement learning (RL) is that the agent needs to trade off the exploration of the environment and the exploitation of the samples to optimize its behavior. Whether we optimize for regret, sample complexity, state-space coverage or model estimation, we need to strike a different exploration-exploitation trade-off.


A Provably Efficient Sample Collection Strategy for Reinforcement Learning

Neural Information Processing Systems

One of the challenges in online reinforcement learning (RL) is that the agent needs to trade off the exploration of the environment and the exploitation of the samples to optimize its behavior. Whether we optimize for regret, sample complexity, state-space coverage or model estimation, we need to strike a different exploration-exploitation trade-off.