Skip to main content

Topsy-Turvy: integrating a global view into sequence-based PPI prediction.

Publication ,  Journal Article
Singh, R; Devkota, K; Sledzieski, S; Berger, B; Cowen, L
Published in: Bioinformatics
June 24, 2022

SUMMARY: Computational methods to predict protein-protein interaction (PPI) typically segregate into sequence-based 'bottom-up' methods that infer properties from the characteristics of the individual protein sequences, or global 'top-down' methods that infer properties from the pattern of already known PPIs in the species of interest. However, a way to incorporate top-down insights into sequence-based bottom-up PPI prediction methods has been elusive. We thus introduce Topsy-Turvy, a method that newly synthesizes both views in a sequence-based, multi-scale, deep-learning model for PPI prediction. While Topsy-Turvy makes predictions using only sequence data, during the training phase it takes a transfer-learning approach by incorporating patterns from both global and molecular-level views of protein interaction. In a cross-species context, we show it achieves state-of-the-art performance, offering the ability to perform genome-scale, interpretable PPI prediction for non-model organisms with no existing experimental PPI data. In species with available experimental PPI data, we further present a Topsy-Turvy hybrid (TT-Hybrid) model which integrates Topsy-Turvy with a purely network-based model for link prediction that provides information about species-specific network rewiring. TT-Hybrid makes accurate predictions for both well- and sparsely-characterized proteins, outperforming both its constituent components as well as other state-of-the-art PPI prediction methods. Furthermore, running Topsy-Turvy and TT-Hybrid screens is feasible for whole genomes, and thus these methods scale to settings where other methods (e.g. AlphaFold-Multimer) might be infeasible. The generalizability, accuracy and genome-level scalability of Topsy-Turvy and TT-Hybrid unlocks a more comprehensive map of protein interaction and organization in both model and non-model organisms. AVAILABILITY AND IMPLEMENTATION: https://topsyturvy.csail.mit.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Bioinformatics

DOI

EISSN

1367-4811

Publication Date

June 24, 2022

Volume

38

Issue

Suppl 1

Start / End Page

i264 / i272

Location

England

Related Subject Headings

  • Proteins
  • Protein Interaction Mapping
  • Bioinformatics
  • Amino Acid Sequence
  • 49 Mathematical sciences
  • 46 Information and computing sciences
  • 31 Biological sciences
  • 08 Information and Computing Sciences
  • 06 Biological Sciences
  • 01 Mathematical Sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Singh, R., Devkota, K., Sledzieski, S., Berger, B., & Cowen, L. (2022). Topsy-Turvy: integrating a global view into sequence-based PPI prediction. Bioinformatics, 38(Suppl 1), i264–i272. https://doi.org/10.1093/bioinformatics/btac258
Singh, Rohit, Kapil Devkota, Samuel Sledzieski, Bonnie Berger, and Lenore Cowen. “Topsy-Turvy: integrating a global view into sequence-based PPI prediction.Bioinformatics 38, no. Suppl 1 (June 24, 2022): i264–72. https://doi.org/10.1093/bioinformatics/btac258.
Singh R, Devkota K, Sledzieski S, Berger B, Cowen L. Topsy-Turvy: integrating a global view into sequence-based PPI prediction. Bioinformatics. 2022 Jun 24;38(Suppl 1):i264–72.
Singh, Rohit, et al. “Topsy-Turvy: integrating a global view into sequence-based PPI prediction.Bioinformatics, vol. 38, no. Suppl 1, June 2022, pp. i264–72. Pubmed, doi:10.1093/bioinformatics/btac258.
Singh R, Devkota K, Sledzieski S, Berger B, Cowen L. Topsy-Turvy: integrating a global view into sequence-based PPI prediction. Bioinformatics. 2022 Jun 24;38(Suppl 1):i264–i272.

Published In

Bioinformatics

DOI

EISSN

1367-4811

Publication Date

June 24, 2022

Volume

38

Issue

Suppl 1

Start / End Page

i264 / i272

Location

England

Related Subject Headings

  • Proteins
  • Protein Interaction Mapping
  • Bioinformatics
  • Amino Acid Sequence
  • 49 Mathematical sciences
  • 46 Information and computing sciences
  • 31 Biological sciences
  • 08 Information and Computing Sciences
  • 06 Biological Sciences
  • 01 Mathematical Sciences