Location: Soybean and Nitrogen Fixation ResearchTitle: Functional annotation of proteins for signaling network inference in non-model species
|VAN DEN BROECK, LISA - North Carolina State University|
|BHOSALE, DINESH KIRAN - North Carolina State University|
|SONG, KUNCHENG - North Carolina State University|
|DE LIMA, CASSIO F.F. - Ghent University|
|ASHLEY, MICHAEL - North Carolina State University|
|ZHU, TINGTING - Ghent University|
|ZHU, SHANSHUO - Ghent University|
|VAN DE COTTE, BRIGETTE - Ghent University|
|NEYT, PIA - Ghent University|
|APER, JONAS - Flanders Research Institute For Agriculture|
|LOOTENS, PETER - Flanders Research Institute For Agriculture|
|DE SMET, IVE - Ghent University|
|SOZZANI, ROSANGELA - North Carolina State University|
Submitted to: Nature Communications
Publication Type: Peer Reviewed Journal
Publication Acceptance Date: 5/30/2023
Publication Date: 8/3/2023
Citation: Van Den Broeck, L., Bhosale, D., Song, K., De Lima, C., Ashley, M., Zhu, T., Zhu, S., Van De Cotte, B., Neyt, P., Ortiz, A.C., Sikes, T.R., Aper, J., Lootens, P., Locke, A.M., De Smet, I., Sozzani, R. 2023. Functional annotation of proteins for signaling network inference in non-model species. Nature Communications. 14:4654. https://doi.org/10.1038/s41467-023-40365-z.
Interpretive Summary: A new neural network algorithm called PF-NET was developed to classify proteins, and this new algorithm requires less prior information than other commonly used algorithms. PF-NET identified phosphatase proteins in a model plant species that have been experimentally validated but were not correctly identified by the older, more commonly used protein classification algorithm. PF-NET classified the soybean kinase and phosphatase protein families, which are important for stress signaling. These protein classifications were then used to help determine the protein signaling network that regulates cold stress responses in soybean seedlings. We identified important protein regulators of soybean temperature responses, which are important targets for future experiments and could be candidates for improving soybean temperature stress tolerance.
Technical Abstract: Molecular biology aims to understand the molecular basis of cellular responses, unravel dynamic regulatory networks, and model complex biological systems. However, these studies remain challenging in non-model species as a result of poor functional annotation of regulatory proteins, like kinases or phosphatases. To overcome this limitation, we developed a multi-layer neural network that annotates proteins by determining functionality directly from the protein sequence. We annotated the kinases and phosphatases in the non-model species, Glycine max (soybean), achieving a prediction sensitivity of up to 97%. To demonstrate the applicability, we used our functional annotations in combination with Bayesian network principles to predict signaling cascades using time series phosphoproteomics. We shed light on phosphorylation cascades in soybean seedlings upon cold treatment and identified Glyma.10G173000 (TOI5) and Glyma.19G007300 (TOT3) as predicted key temperature response regulators in soybean. Importantly, the network inference does not rely upon known upstream kinases, kinase motifs, or protein interaction data, enabling de novo identification of kinase-substrate interactions. In addition to high accuracy and strong generalization, we showed that our functional prediction neural network is scalable to other model and non-model species, including Oryza sativa (rice), Zea mays (maize), Sorghum bicolor (sorghum), and Triticum aestivum (wheat). Taking together, we demonstrated a data-driven systems biology approach for non-model species leveraging our predicted upstream kinases and phosphatases.