Skip to main content
Journal cover image

BBK* (Branch and Bound Over K*): A Provable and Efficient Ensemble-Based Protein Design Algorithm to Optimize Stability and Binding Affinity Over Large Sequence Spaces.

Publication ,  Journal Article
Ojewole, AA; Jou, JD; Fowler, VG; Donald, BR
Published in: J Comput Biol
July 2018

Computational protein design (CPD) algorithms that compute binding affinity, Ka, search for sequences with an energetically favorable free energy of binding. Recent work shows that three principles improve the biological accuracy of CPD: ensemble-based design, continuous flexibility of backbone and side-chain conformations, and provable guarantees of accuracy with respect to the input. However, previous methods that use all three design principles are single-sequence (SS) algorithms, which are very costly: linear in the number of sequences and thus exponential in the number of simultaneously mutable residues. To address this computational challenge, we introduce BBK*, a new CPD algorithm whose key innovation is the multisequence (MS) bound: BBK* efficiently computes a single provable upper bound to approximate Ka for a combinatorial number of sequences, and avoids SS computation for all provably suboptimal sequences. Thus, to our knowledge, BBK* is the first provable, ensemble-based CPD algorithm to run in time sublinear in the number of sequences. Computational experiments on 204 protein design problems show that BBK* finds the tightest binding sequences while approximating Ka for up to 105-fold fewer sequences than the previous state-of-the-art algorithms, which require exhaustive enumeration of sequences. Furthermore, for 51 protein-ligand design problems, BBK* provably approximates Ka up to 1982-fold faster than the previous state-of-the-art iMinDEE/[Formula: see text]/[Formula: see text] algorithm. Therefore, BBK* not only accelerates protein designs that are possible with previous provable algorithms, but also efficiently performs designs that are too large for previous methods.

Duke Scholars

Published In

J Comput Biol

DOI

EISSN

1557-8666

Publication Date

July 2018

Volume

25

Issue

7

Start / End Page

726 / 739

Location

United States

Related Subject Headings

  • Software
  • Proteins
  • Protein Conformation
  • Models, Molecular
  • Humans
  • Entropy
  • Computational Biology
  • Bioinformatics
  • Amino Acid Sequence
  • Algorithms
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Ojewole, A. A., Jou, J. D., Fowler, V. G., & Donald, B. R. (2018). BBK* (Branch and Bound Over K*): A Provable and Efficient Ensemble-Based Protein Design Algorithm to Optimize Stability and Binding Affinity Over Large Sequence Spaces. J Comput Biol, 25(7), 726–739. https://doi.org/10.1089/cmb.2017.0267
Ojewole, Adegoke A., Jonathan D. Jou, Vance G. Fowler, and Bruce R. Donald. “BBK* (Branch and Bound Over K*): A Provable and Efficient Ensemble-Based Protein Design Algorithm to Optimize Stability and Binding Affinity Over Large Sequence Spaces.J Comput Biol 25, no. 7 (July 2018): 726–39. https://doi.org/10.1089/cmb.2017.0267.
Ojewole, Adegoke A., et al. “BBK* (Branch and Bound Over K*): A Provable and Efficient Ensemble-Based Protein Design Algorithm to Optimize Stability and Binding Affinity Over Large Sequence Spaces.J Comput Biol, vol. 25, no. 7, July 2018, pp. 726–39. Pubmed, doi:10.1089/cmb.2017.0267.
Journal cover image

Published In

J Comput Biol

DOI

EISSN

1557-8666

Publication Date

July 2018

Volume

25

Issue

7

Start / End Page

726 / 739

Location

United States

Related Subject Headings

  • Software
  • Proteins
  • Protein Conformation
  • Models, Molecular
  • Humans
  • Entropy
  • Computational Biology
  • Bioinformatics
  • Amino Acid Sequence
  • Algorithms