Characterizing truthful multi-armed bandit mechanisms

Moshe Babaioff*, Yogeshwer Sharma, Aleksandrs Slivkins

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

83 Scopus citations

Abstract

We consider a multi-round auction setting motivated by pay-per-click auctions for Internet advertising. In each round the auctioneer selects an advertiser and shows her ad, which is then either clicked or not. An advertiser derives value from clicks; the value of a click is her private information. Initially, neither the auctioneer nor the advertisers have any information about the likelihood of clicks on the advertisements. The auctioneer's goal is to design a (dominant strategies) truthful mechanism that (approximately) maximizes the social welfare. If the advertisers bid their true private values, our problem is equivalent to the multi-armed bandit problem, and thus can be viewed as a strategic version of the latter. In particular, for both problems the quality of an algorithm can be characterized by regret, the difference in social welfare between the algorithm and the benchmark which always selects the same "best" advertisement. We investigate how the design of multi-armed bandit algorithms is affected by the restriction that the resulting mechanism must be truthful. We find that truthful mechanisms have certain strong structural properties - essentially, they must separate exploration from exploitation - and they incur much higher regret than the optimal multi-armed bandit algorithms. Moreover, we provide a truthful mechanism which (essentially) matches our lower bound on regret.

Original languageEnglish
Title of host publicationEC'09 - Proceedings of the 2009 ACM Conference on Electronic Commerce
Pages79-88
Number of pages10
DOIs
StatePublished - 2009
Externally publishedYes
Event2009 ACM Conference on Electronic Commerce, EC'09 - Stanford, CA, United States
Duration: 6 Jul 200910 Jul 2009

Publication series

NameProceedings of the ACM Conference on Electronic Commerce

Conference

Conference2009 ACM Conference on Electronic Commerce, EC'09
Country/TerritoryUnited States
CityStanford, CA
Period6/07/0910/07/09

Keywords

  • Mechanism design
  • Multi-armed bandits
  • Online learning
  • Single-parameter auctions
  • Truthful mechanisms

Fingerprint

Dive into the research topics of 'Characterizing truthful multi-armed bandit mechanisms'. Together they form a unique fingerprint.

Cite this