Goto

Collaborating Authors

 South America


Prior-Free Dynamic Auctions with Low Regret Buyers

Neural Information Processing Systems

We study the problem of how to repeatedly sell to a buyer running a no-regret,mean-based algorithm. Previous work [Braverman et al., 2018] shows that it ispossible to design effective mechanisms in such a setting that extract almost allof the economic surplus, but these mechanisms require the buyer's values each





Multi-Step Generalized Policy Improvement by Leveraging Approximate Models Lucas N. Alegre 1, 2 Ana L. C. Bazzan 1 Ann Now é 2 Bruno C. da Silva 3 1

Neural Information Processing Systems

We introduce a principled method for performing zero-shot transfer in reinforcement learning (RL) by exploiting approximate models of the environment. Zero-shot transfer in RL has been investigated by leveraging methods rooted in generalized policy improvement (GPI) and successor features (SFs).