Online Learning of Optimal Bidding Strategy in Repeated Multi-Commodity Auctions

Baltaoglu, M. Sevi, Tong, Lang, Zhao, Qing

Feb-14-2020, 15:41:15 GMT–Neural Information Processing Systems

We study the online learning problem of a bidder who participates in repeated auctions. With the goal of maximizing his T-period payoff, the bidder determines the optimal allocation of his budget among his bids for $K$ goods at each period. As a bidding strategy, we propose a polynomial-time algorithm, inspired by the dynamic programming approach to the knapsack problem. The proposed algorithm, referred to as dynamic programming on discrete set (DPDS), achieves a regret order of $O(\sqrt{T\log{T}})$. By showing that the regret is lower bounded by $\Omega(\sqrt{T})$ for any strategy, we conclude that DPDS is order optimal up to a $\sqrt{\log{T}}$ term.

online learning, optimal bidding strategy, repeated multi-commodity auction, (3 more...)

Neural Information Processing Systems

Feb-14-2020, 15:41:15 GMT

Conferences Web Page

Add feedback

Country:
- North America > United States > New York (0.09)

Industry:
- Education > Educational Setting > Online (0.71)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning (1.00)
    - Representation & Reasoning (0.90)
  - Enterprise Applications > Human Resources
    - Learning Management (0.71)