Title: Functional and informatics analysis enables glycosyltransferase activity prediction

Authors (10): M. Yang, C. Fehl, K. V. Lees, E. -K. Lim, W. A. Offen, G. J. Davies, D. J. Bowles, M. G. Davidson, S. J. Roberts, B. G. Davis

Themes: Transformations (2018)

DOI: 10.1038/s41589-018-0154-9

Citations: 84

Pub type: article-journal

Publisher: Springer Science and Business Media LLC

Issue: 12

License: http://www.springer.com/tdm

Publication date(s): 2018/12 (print) 2018/11/12 (online)

Pages: 1109-1117

Volume: 14 Issue: 12

Journal: Nature Chemical Biology

Link: http://www.nature.com/articles/s41589-018-0154-9

URL: http://dx.doi.org/10.1038/s41589-018-0154-9

The elucidation and prediction of how changes in a protein result in altered activities and selectivities remain a major challenge in chemistry. Two hurdles have prevented accurate family-wide models: obtaining (i) diverse datasets and (ii) suitable parameter frameworks that encapsulate activities in large sets. Here, we show that a relatively small but broad activity dataset is sufficient to train algorithms for functional prediction over the entire glycosyltransferase superfamily 1 (GT1) of the plant Arabidopsis thaliana. Whereas sequence analysis alone failed for GT1 substrate utilization patterns, our chemical–bioinformatic model, GT-Predict, succeeded by coupling physicochemical features with isozyme-recognition patterns over the family. GT-Predict identified GT1 biocatalysts for novel substrates and enabled functional annotation of uncharacterized GT1s. Finally, analyses of GT-Predict decision pathways revealed structural modulators of substrate recognition, thus providing information on mechanisms. This multifaceted approach to enzyme prediction may guide the streamlined utilization (and design) of biocatalysts and the discovery of other family-wide protein functions. Bioinformatic analysis coupled to substrate-reactivity profiling for the glycosyltransferase (GT) enzyme superfamily supports the development of ‘GT-Predict’ as a tool for functional prediction of GT–substrate relationships.

Name Description Publised
GT-Predict We utilized informatics and machine learning to build predictive softwar... 2018
41589_2018_154_MOESM1_ESM.pdf Supl. data for Functional and informatics analysis enables glycosyltrans... 2018
41589_2018_154_MOESM2_ESM.pdf Supl. data for Functional and informatics analysis enables glycosyltrans... 2018
41589_2018_154_MOESM3_ESM.mp4 Supl. data for Functional and informatics analysis enables glycosyltrans... 2018


Back