Skip to main navigation Skip to search Skip to main content

Modeling the Centaur: Human-Machine Synergy in Sequential Decision Making

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The field of collective intelligence studies how teams can achieve better results than any of the team members alone. The special case of human-machine teams carries unique challenges in this regard. For example, human teams often achieve synergy by communicating to discover their relative advantages, which is not an option if the team partner is an unexplainable deep neural network. Between 2005-2008 a set of "freestyle" chess tournaments were held, in which human-machine teams known as "centaurs", outperformed the best humans and best machines alone. Centaur players reported that they identified relative advantages between themselves and their chess program, even though the program was superhuman. Inspired by this and leveraging recent open-source models, we study simulated human-machine teams in chess. A human behavioral clone ("Maia") and a pure self-play RL-trained chess engine ("Leela") were composed into a team using a Mixture of Experts (MoE) architecture. By directing our research question at the selection mechanism of the MoE, we could isolate the issue of extracting relative advantages without knowledge sharing. We show that in principle, there is high potential for synergy between human and machine in a complex sequential decision environment such as chess. Furthermore, we show that an expert can identify only a small part of these relative advantages, and that the contribution of its subject matter expertise in doing so saturates quickly. This is probably due to the "curse of knowledge" phenomenon. We also train a network to recognize relative advantages using reinforcement learning, without chess expertise, and it outdoes the expert. Our experiments are repeated in asymmetric teams, in which identifying relative advantages is more challenging. Our findings contribute to the study of collective intelligence and human-centric AI.

Original languageEnglish
Title of host publicationProceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2025
EditorsYevgeniy Vorobeychik, Sanmay Das, Ann Nowe
PublisherInternational Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)
Pages1941-1949
Number of pages9
ISBN (Electronic)9798400714269
StatePublished - 2025
Event24th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2025 - Detroit, United States
Duration: 19 May 202523 May 2025

Publication series

NameProceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS
ISSN (Print)1548-8403
ISSN (Electronic)1558-2914

Conference

Conference24th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2025
Country/TerritoryUnited States
CityDetroit
Period19/05/2523/05/25

Bibliographical note

Publisher Copyright:
© 2025 International Foundation for Autonomous Agents and Multiagent Systems (www.ifaamas.org).

Keywords

  • Behavioral Clones
  • Collective Intelligence
  • Human-Machine Teaming
  • Mixture of Experts
  • Reinforcement Learning

Fingerprint

Dive into the research topics of 'Modeling the Centaur: Human-Machine Synergy in Sequential Decision Making'. Together they form a unique fingerprint.

Cite this