Skip to main content

Mitigating Cognitive Biases in Clinical Decision-Making Through Multi-Agent Conversations Using Large Language Models: Simulation Study.

Publication ,  Journal Article
Ke, Y; Yang, R; Lie, SA; Lim, TXY; Ning, Y; Li, I; Abdullah, HR; Ting, DSW; Liu, N
Published in: J Med Internet Res
November 19, 2024

BACKGROUND: Cognitive biases in clinical decision-making significantly contribute to errors in diagnosis and suboptimal patient outcomes. Addressing these biases presents a formidable challenge in the medical field. OBJECTIVE: This study aimed to explore the role of large language models (LLMs) in mitigating these biases through the use of the multi-agent framework. We simulate the clinical decision-making processes through multi-agent conversation and evaluate its efficacy in improving diagnostic accuracy compared with humans. METHODS: A total of 16 published and unpublished case reports where cognitive biases have resulted in misdiagnoses were identified from the literature. In the multi-agent framework, we leveraged GPT-4 (OpenAI) to facilitate interactions among different simulated agents to replicate clinical team dynamics. Each agent was assigned a distinct role: (1) making the final diagnosis after considering the discussions, (2) acting as a devil's advocate to correct confirmation and anchoring biases, (3) serving as a field expert in the required medical subspecialty, (4) facilitating discussions to mitigate premature closure bias, and (5) recording and summarizing findings. We tested varying combinations of these agents within the framework to determine which configuration yielded the highest rate of correct final diagnoses. Each scenario was repeated 5 times for consistency. The accuracy of the initial diagnoses and the final differential diagnoses were evaluated, and comparisons with human-generated answers were made using the Fisher exact test. RESULTS: A total of 240 responses were evaluated (3 different multi-agent frameworks). The initial diagnosis had an accuracy of 0% (0/80). However, following multi-agent discussions, the accuracy for the top 2 differential diagnoses increased to 76% (61/80) for the best-performing multi-agent framework (Framework 4-C). This was significantly higher compared with the accuracy achieved by human evaluators (odds ratio 3.49; P=.002). CONCLUSIONS: The multi-agent framework demonstrated an ability to re-evaluate and correct misconceptions, even in scenarios with misleading initial investigations. In addition, the LLM-driven, multi-agent conversation framework shows promise in enhancing diagnostic accuracy in diagnostically challenging medical scenarios.

Duke Scholars

Published In

J Med Internet Res

DOI

EISSN

1438-8871

Publication Date

November 19, 2024

Volume

26

Start / End Page

e59439

Location

Canada

Related Subject Headings

  • Medical Informatics
  • Language
  • Humans
  • Diagnostic Errors
  • Computer Simulation
  • Communication
  • Cognition
  • Clinical Decision-Making
  • Bias
  • 4203 Health services and systems
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Ke, Y., Yang, R., Lie, S. A., Lim, T. X. Y., Ning, Y., Li, I., … Liu, N. (2024). Mitigating Cognitive Biases in Clinical Decision-Making Through Multi-Agent Conversations Using Large Language Models: Simulation Study. J Med Internet Res, 26, e59439. https://doi.org/10.2196/59439
Ke, Yuhe, Rui Yang, Sui An Lie, Taylor Xin Yi Lim, Yilin Ning, Irene Li, Hairil Rizal Abdullah, Daniel Shu Wei Ting, and Nan Liu. “Mitigating Cognitive Biases in Clinical Decision-Making Through Multi-Agent Conversations Using Large Language Models: Simulation Study.J Med Internet Res 26 (November 19, 2024): e59439. https://doi.org/10.2196/59439.
Ke Y, Yang R, Lie SA, Lim TXY, Ning Y, Li I, et al. Mitigating Cognitive Biases in Clinical Decision-Making Through Multi-Agent Conversations Using Large Language Models: Simulation Study. J Med Internet Res. 2024 Nov 19;26:e59439.
Ke, Yuhe, et al. “Mitigating Cognitive Biases in Clinical Decision-Making Through Multi-Agent Conversations Using Large Language Models: Simulation Study.J Med Internet Res, vol. 26, Nov. 2024, p. e59439. Pubmed, doi:10.2196/59439.
Ke Y, Yang R, Lie SA, Lim TXY, Ning Y, Li I, Abdullah HR, Ting DSW, Liu N. Mitigating Cognitive Biases in Clinical Decision-Making Through Multi-Agent Conversations Using Large Language Models: Simulation Study. J Med Internet Res. 2024 Nov 19;26:e59439.

Published In

J Med Internet Res

DOI

EISSN

1438-8871

Publication Date

November 19, 2024

Volume

26

Start / End Page

e59439

Location

Canada

Related Subject Headings

  • Medical Informatics
  • Language
  • Humans
  • Diagnostic Errors
  • Computer Simulation
  • Communication
  • Cognition
  • Clinical Decision-Making
  • Bias
  • 4203 Health services and systems