Goto

Collaborating Authors

 operationsresearch




No-regretLearninginPriceCompetitionsunder ConsumerReferenceEffects

Neural Information Processing Systems

We focus on the setting where firms are not aware of demand functions and how reference prices areformed but haveaccess to an oracle that provides a measure of consumers' responsiveness to the current posted prices.


ProductRankingforRevenueMaximizationwith MultiplePurchases

Neural Information Processing Systems

Online retailing has become increasingly popular over the last decades [17, 28, 52]. The way of product ranking is the crux for online retailers because it determines the consumers' shopping behaviors [17] and thus influences the retailers' revenue [20, 49]. For instance, the probability of consumers' purchasing from a firm or clicking an advertisement is strongly related to the display order[8,3,33].



Multi-armedBanditRequiringMonotoneArm Sequences

Neural Information Processing Systems

Popular algorithms suchasUCB[4,5]andThompson sampling [3,34]typically explorethearms sufficiently and as more evidence is gathered, converge to the optimal arm.




1673a54332b2afc905722048c26f5a4c-Paper-Conference.pdf

Neural Information Processing Systems

We propose a randomized dynamic pricing policy based on a variant of the Online Newton Stepalgorithm (ONS)thatachievesaO(d T log(T))regretguarantee underan adversarial arrival model.