Skip to main content

Inferring protein fitness landscapes from laboratory evolution experiments.

Publication ,  Journal Article
D'Costa, S; Hinds, EC; Freschlin, CR; Song, H; Romero, PA
Published in: PLoS computational biology
March 2023

Directed laboratory evolution applies iterative rounds of mutation and selection to explore the protein fitness landscape and provides rich information regarding the underlying relationships between protein sequence, structure, and function. Laboratory evolution data consist of protein sequences sampled from evolving populations over multiple generations and this data type does not fit into established supervised and unsupervised machine learning approaches. We develop a statistical learning framework that models the evolutionary process and can infer the protein fitness landscape from multiple snapshots along an evolutionary trajectory. We apply our modeling approach to dihydrofolate reductase (DHFR) laboratory evolution data and the resulting landscape parameters capture important aspects of DHFR structure and function. We use the resulting model to understand the structure of the fitness landscape and find numerous examples of epistasis but an overall global peak that is evolutionarily accessible from most starting sequences. Finally, we use the model to perform an in silico extrapolation of the DHFR laboratory evolution trajectory and computationally design proteins from future evolutionary rounds.

Duke Scholars

Published In

PLoS computational biology

DOI

EISSN

1553-7358

ISSN

1553-734X

Publication Date

March 2023

Volume

19

Issue

3

Start / End Page

e1010956

Related Subject Headings

  • Tetrahydrofolate Dehydrogenase
  • Proteins
  • Mutation
  • Models, Genetic
  • Genetic Fitness
  • Evolution, Molecular
  • Epistasis, Genetic
  • Bioinformatics
  • Amino Acid Sequence
  • 08 Information and Computing Sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
D’Costa, S., Hinds, E. C., Freschlin, C. R., Song, H., & Romero, P. A. (2023). Inferring protein fitness landscapes from laboratory evolution experiments. PLoS Computational Biology, 19(3), e1010956. https://doi.org/10.1371/journal.pcbi.1010956
D’Costa, Sameer, Emily C. Hinds, Chase R. Freschlin, Hyebin Song, and Philip A. Romero. “Inferring protein fitness landscapes from laboratory evolution experiments.PLoS Computational Biology 19, no. 3 (March 2023): e1010956. https://doi.org/10.1371/journal.pcbi.1010956.
D’Costa S, Hinds EC, Freschlin CR, Song H, Romero PA. Inferring protein fitness landscapes from laboratory evolution experiments. PLoS computational biology. 2023 Mar;19(3):e1010956.
D’Costa, Sameer, et al. “Inferring protein fitness landscapes from laboratory evolution experiments.PLoS Computational Biology, vol. 19, no. 3, Mar. 2023, p. e1010956. Epmc, doi:10.1371/journal.pcbi.1010956.
D’Costa S, Hinds EC, Freschlin CR, Song H, Romero PA. Inferring protein fitness landscapes from laboratory evolution experiments. PLoS computational biology. 2023 Mar;19(3):e1010956.

Published In

PLoS computational biology

DOI

EISSN

1553-7358

ISSN

1553-734X

Publication Date

March 2023

Volume

19

Issue

3

Start / End Page

e1010956

Related Subject Headings

  • Tetrahydrofolate Dehydrogenase
  • Proteins
  • Mutation
  • Models, Genetic
  • Genetic Fitness
  • Evolution, Molecular
  • Epistasis, Genetic
  • Bioinformatics
  • Amino Acid Sequence
  • 08 Information and Computing Sciences