Skip to main content

Advancing real-time infectious disease forecasting using large language models.

Publication ,  Journal Article
Du, H; Zhao, Y; Zhao, J; Xu, S; Lin, X; Chen, Y; Gardner, LM; Yang, HF
Published in: Nature computational science
June 2025

Forecasting the short-term spread of an ongoing disease outbreak poses a challenge owing to the complexity of contributing factors, some of which can be characterized through interlinked, multi-modality variables, and the intersection of public policy and human behavior. Here we introduce PandemicLLM, a framework with multi-modal large language models (LLMs) that reformulates real-time forecasting of disease spread as a text-reasoning problem, with the ability to incorporate real-time, complex, non-numerical information. This approach, through an artificial intelligence-human cooperative prompt design and time-series representation learning, encodes multi-modal data for LLMs. The model is applied to the COVID-19 pandemic, and trained to utilize textual public health policies, genomic surveillance, spatial and epidemiological time-series data, and is tested across all 50 states of the United States for a duration of 19 months. PandemicLLM opens avenues for incorporating various pandemic-related data in heterogeneous formats and shows performance benefits over existing models.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Nature computational science

DOI

EISSN

2662-8457

ISSN

2662-8457

Publication Date

June 2025

Volume

5

Issue

6

Start / End Page

467 / 480

Related Subject Headings

  • United States
  • SARS-CoV-2
  • Pandemics
  • Large Language Models
  • Language
  • Humans
  • Forecasting
  • Communicable Diseases
  • COVID-19
  • Artificial Intelligence
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Du, H., Zhao, Y., Zhao, J., Xu, S., Lin, X., Chen, Y., … Yang, H. F. (2025). Advancing real-time infectious disease forecasting using large language models. Nature Computational Science, 5(6), 467–480. https://doi.org/10.1038/s43588-025-00798-6
Du, Hongru, Yang Zhao, Jianan Zhao, Shaochong Xu, Xihong Lin, Yiran Chen, Lauren M. Gardner, and Hao “Frank” Yang. “Advancing real-time infectious disease forecasting using large language models.Nature Computational Science 5, no. 6 (June 2025): 467–80. https://doi.org/10.1038/s43588-025-00798-6.
Du H, Zhao Y, Zhao J, Xu S, Lin X, Chen Y, et al. Advancing real-time infectious disease forecasting using large language models. Nature computational science. 2025 Jun;5(6):467–80.
Du, Hongru, et al. “Advancing real-time infectious disease forecasting using large language models.Nature Computational Science, vol. 5, no. 6, June 2025, pp. 467–80. Epmc, doi:10.1038/s43588-025-00798-6.
Du H, Zhao Y, Zhao J, Xu S, Lin X, Chen Y, Gardner LM, Yang HF. Advancing real-time infectious disease forecasting using large language models. Nature computational science. 2025 Jun;5(6):467–480.

Published In

Nature computational science

DOI

EISSN

2662-8457

ISSN

2662-8457

Publication Date

June 2025

Volume

5

Issue

6

Start / End Page

467 / 480

Related Subject Headings

  • United States
  • SARS-CoV-2
  • Pandemics
  • Large Language Models
  • Language
  • Humans
  • Forecasting
  • Communicable Diseases
  • COVID-19
  • Artificial Intelligence