Skip to content

bzhanglab/CoPheeMap

Repository files navigation

CoPheeMap: a Machine-Learned Co-Regulation Map of 26,280 Phosphosites

Introduction: Mass spectrometry-based phosphoproteomics offers a comprehensive view of protein phosphorylation, but limited knowledge about the regulation and function of most phosphosites restricts our ability to extract meaningful biological insights from phosphoproteomics data. To address this, we combine machine learning and phosphoproteomic data from 1,195 tumor specimens spanning 11 cancer types to construct CoPheeMap, a network mapping the co-regulation of 26,280 phosphosites. Integrating network features from CoPheeMap into a machine learning model, CoPheeKSA, we achieve superior performance in predicting kinase-substrate associations. CoPheeKSA reveals 24,015 associations between 9,399 phosphosites and 104 serine/threonine kinases, including many unannotated phosphosites and under-studied kinases. We validate the accuracy of these predictions using experimentally determined kinase-substrate specificities. By applying CoPheeMap and CoPheeKSA to phosphosites with high computationally predicted functional significance and cancer-associated phosphosites, we demonstrate the effectiveness of these tools in systematically illuminating phosphosites of interest, revealing dysregulated signaling processes in human cancer, and identifying under-studied kinases as putative therapeutic targets.

Prerequisites: Python 3 or higher; Anaconda jupyter notebook 6.5.2

Section 1 CoPheeMap:

CoPheeMap.ipynb

Section 2 CoPheeKSA:

PSSM.ipynb kinase_network.ipynb CoPheeKSA.ipynb CoPheeKSA_XGBoost.ipynb

Jiang W, Jaehnig EJ, Liao Y, Yaron-Barir TM, Johnson JL, Cantley LC, Zhang B. Illuminating the Dark Cancer Phosphoproteome Through a Machine-Learned Co-Regulation Map of 26,280 Phosphosites. bioRxiv [Preprint]. 2024 Mar 21:2024.03.19.585786. doi: 10.1101/2024.03.19.585786. PMID: 38562798; PMCID: PMC10983930.

About

Co-regulation map of phosphosites

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published