Skip to main content

Data-driven learning of structure augments quantitative prediction of biological responses.

Publication ,  Journal Article
Ha, Y; Ma, HR; Wu, F; Weiss, A; Duncker, K; Xu, HZ; Lu, J; Golovsky, M; Reker, D; You, L
Published in: PLoS computational biology
June 2024

Multi-factor screenings are commonly used in diverse applications in medicine and bioengineering, including optimizing combination drug treatments and microbiome engineering. Despite the advances in high-throughput technologies, large-scale experiments typically remain prohibitively expensive. Here we introduce a machine learning platform, structure-augmented regression (SAR), that exploits the intrinsic structure of each biological system to learn a high-accuracy model with minimal data requirement. Under different environmental perturbations, each biological system exhibits a unique, structured phenotypic response. This structure can be learned based on limited data and once learned, can constrain subsequent quantitative predictions. We demonstrate that SAR requires significantly fewer data comparing to other existing machine-learning methods to achieve a high prediction accuracy, first on simulated data, then on experimental data of various systems and input dimensions. We then show how a learned structure can guide effective design of new experiments. Our approach has implications for predictive control of biological systems and an integration of machine learning prediction and experimental design.

Duke Scholars

Published In

PLoS computational biology

DOI

EISSN

1553-7358

ISSN

1553-734X

Publication Date

June 2024

Volume

20

Issue

6

Start / End Page

e1012185

Related Subject Headings

  • Regression Analysis
  • Plasmids
  • Models, Biological
  • Machine Learning
  • Humans
  • Escherichia coli
  • Drug Resistance, Bacterial
  • Computer Simulation
  • Bioinformatics
  • Bioengineering
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Ha, Y., Ma, H. R., Wu, F., Weiss, A., Duncker, K., Xu, H. Z., … You, L. (2024). Data-driven learning of structure augments quantitative prediction of biological responses. PLoS Computational Biology, 20(6), e1012185. https://doi.org/10.1371/journal.pcbi.1012185
Ha, Yuanchi, Helena R. Ma, Feilun Wu, Andrea Weiss, Katherine Duncker, Helen Z. Xu, Jia Lu, Max Golovsky, Daniel Reker, and Lingchong You. “Data-driven learning of structure augments quantitative prediction of biological responses.PLoS Computational Biology 20, no. 6 (June 2024): e1012185. https://doi.org/10.1371/journal.pcbi.1012185.
Ha Y, Ma HR, Wu F, Weiss A, Duncker K, Xu HZ, et al. Data-driven learning of structure augments quantitative prediction of biological responses. PLoS computational biology. 2024 Jun;20(6):e1012185.
Ha, Yuanchi, et al. “Data-driven learning of structure augments quantitative prediction of biological responses.PLoS Computational Biology, vol. 20, no. 6, June 2024, p. e1012185. Epmc, doi:10.1371/journal.pcbi.1012185.
Ha Y, Ma HR, Wu F, Weiss A, Duncker K, Xu HZ, Lu J, Golovsky M, Reker D, You L. Data-driven learning of structure augments quantitative prediction of biological responses. PLoS computational biology. 2024 Jun;20(6):e1012185.

Published In

PLoS computational biology

DOI

EISSN

1553-7358

ISSN

1553-734X

Publication Date

June 2024

Volume

20

Issue

6

Start / End Page

e1012185

Related Subject Headings

  • Regression Analysis
  • Plasmids
  • Models, Biological
  • Machine Learning
  • Humans
  • Escherichia coli
  • Drug Resistance, Bacterial
  • Computer Simulation
  • Bioinformatics
  • Bioengineering