Skip to main content

Lipid discovery enabled by sequence statistics and machine learning.

Publication ,  Journal Article
Christensen, PM; Martin, J; Uppuluri, A; Joyce, LR; Wei, Y; Guan, Z; Morcos, F; Palmer, KL
Published in: bioRxiv
August 19, 2024

Bacterial membranes are complex and dynamic, arising from an array of evolutionary pressures. One enzyme that alters membrane compositions through covalent lipid modification is MprF. We recently identified that Streptococcus agalactiae MprF synthesizes lysyl-phosphatidylglycerol (Lys-PG) from anionic PG, and a novel cationic lipid, lysyl-glucosyl-diacylglycerol (Lys-Glc-DAG), from neutral glycolipid Glc-DAG. This unexpected result prompted us to investigate whether Lys-Glc-DAG occurs in other MprF-containing bacteria, and whether other novel MprF products exist. Here, we studied protein sequence features determining MprF substrate specificity. First, pairwise analyses identified several streptococ-cal MprFs synthesizing Lys-Glc-DAG. Second, a restricted Boltzmann machine-guided approach led us to discover an entirely new substrate for MprF in Enterococcus , diglucosyl-diacylglycerol (Glc 2 -DAG), and an expanded set of organisms that modify glycolipid substrates using MprF. Overall, we combined the wealth of available sequence data with machine learning to model evolutionary constraints on MprF sequences across the bacterial domain, thereby identifying a novel cationic lipid.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

bioRxiv

DOI

EISSN

2692-8205

Publication Date

August 19, 2024

Location

United States
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Christensen, P. M., Martin, J., Uppuluri, A., Joyce, L. R., Wei, Y., Guan, Z., … Palmer, K. L. (2024). Lipid discovery enabled by sequence statistics and machine learning. BioRxiv. https://doi.org/10.1101/2023.10.12.562061
Christensen, Priya M., Jonathan Martin, Aparna Uppuluri, Luke R. Joyce, Yahan Wei, Ziqiang Guan, Faruck Morcos, and Kelli L. Palmer. “Lipid discovery enabled by sequence statistics and machine learning.BioRxiv, August 19, 2024. https://doi.org/10.1101/2023.10.12.562061.
Christensen PM, Martin J, Uppuluri A, Joyce LR, Wei Y, Guan Z, et al. Lipid discovery enabled by sequence statistics and machine learning. bioRxiv. 2024 Aug 19;
Christensen, Priya M., et al. “Lipid discovery enabled by sequence statistics and machine learning.BioRxiv, Aug. 2024. Pubmed, doi:10.1101/2023.10.12.562061.
Christensen PM, Martin J, Uppuluri A, Joyce LR, Wei Y, Guan Z, Morcos F, Palmer KL. Lipid discovery enabled by sequence statistics and machine learning. bioRxiv. 2024 Aug 19;

Published In

bioRxiv

DOI

EISSN

2692-8205

Publication Date

August 19, 2024

Location

United States