Abstract
We study the Online Bookmaking problem, where a bookmaker dynamically updates betting odds on the possible outcomes of an event. In each betting round, the bookmaker can adjust the odds based on the cumulative betting behavior of gamblers, aiming to maximize profit while mitigating potential loss. We show that for any event and any number of betting rounds, in a worst-case setting over all possible gamblers and outcome realizations, the bookmaker’s optimal loss is the largest root of a simple polynomial. Our solution shows that bookmakers can be as fair as desired while avoiding financial risk, and the explicit characterization reveals an intriguing relation between the bookmaker’s regret and Hermite polynomials. We develop an efficient algorithm that computes the optimal bookmaking strategy: when facing an optimal gambler, the algorithm achieves the optimal loss, and in rounds where the gambler is suboptimal, it reduces the achieved loss to the optimal opportunistic loss, a notion that is related to subgame perfect Nash equilibrium. The key technical contribution to achieve these results is an explicit characterization of the Bellman-Pareto frontier, which unifies the dynamic programming updates for Bellman’s value function with the multi-criteria optimization framework of the Pareto frontier in the context of vector repeated games.
| Original language | English |
|---|---|
| Journal | Proceedings of Machine Learning Research |
| Volume | 291 |
| State | Published - 2025 |
| Event | 38th Annual Conference on Learning Theory, COLT 2025 - Lyon, France Duration: 30 Jun 2025 → 4 Jul 2025 |
Bibliographical note
Publisher Copyright:© 2025 H. Tal & O. Sabag.
Keywords
- Online decision making
- game theory
- regret analysis
Fingerprint
Dive into the research topics of 'Optimal Online Bookmaking for Any Number of Outcomes'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver