Record Details

GlycoPP: a webserver for prediction of N- and O-glycosites in prokaryotic protein sequences.

DIR@IMTECH: CSIR-Institute of Microbial Technology

View Archive Info
 
 
Field Value
 
Title GlycoPP: a webserver for prediction of N- and O-glycosites in prokaryotic protein sequences.
 
Creator Chauhan, Jagat S
Bhat, Adil H
Raghava, G.P.S.
Rao, Alka
 
Subject QR Microbiology
 
Description Glycosylation is one of the most abundant post-translational modifications (PTMs) required for various structure/function modulations of proteins in a living cell. Although elucidated recently in prokaryotes, this type of PTM is present across all three domains of life. In prokaryotes, two types of protein glycan linkages are more widespread namely, N- linked, where a glycan moiety is attached to the amide group of Asn, and O- linked, where a glycan moiety is attached to the hydroxyl group of Ser/Thr/Tyr. For their biologically ubiquitous nature, significance, and technology applications, the study of prokaryotic glycoproteins is a fast emerging area of research. Here we describe new Support Vector Machine (SVM) based algorithms (models) developed for predicting glycosylated-residues (glycosites) with high accuracy in prokaryotic protein sequences. The models are based on binary profile of patterns, composition profile of patterns, and position-specific scoring matrix profile of patterns as training features. The study employ an extensive dataset of 107 N-linked and 116 O-linked glycosites extracted from 59 experimentally characterized glycoproteins of prokaryotes. This dataset includes validated N-glycosites from phyla Crenarchaeota, Euryarchaeota (domain Archaea), Proteobacteria (domain Bacteria) and validated O-glycosites from phyla Actinobacteria, Bacteroidetes, Firmicutes and Proteobacteria (domain Bacteria). In view of the current understanding that glycosylation occurs on folded proteins in bacteria, hybrid models have been developed using information on predicted secondary structures and accessible surface area in various combinations with training features. Using these models, N-glycosites and O-glycosites could be predicted with an accuracy of 82.71% (MCC 0.65) and 73.71% (MCC 0.48), respectively. An evaluation of the best performing models with 28 independent prokaryotic glycoproteins confirms the suitability of these models in predicting N- and O-glycosites in potential glycoproteins from aforementioned organisms, with reasonably high confidence. A web server GlycoPP, implementing these models is available freely at http:/www.imtech.res.in/raghava/glycopp/.
 
Publisher Public Library of Science
 
Date 2012
 
Type Article
PeerReviewed
 
Format application/pdf
 
Identifier http://crdd.osdd.net/open/1229/1/alka2012.pdf
Chauhan, Jagat S and Bhat, Adil H and Raghava, G.P.S. and Rao, Alka (2012) GlycoPP: a webserver for prediction of N- and O-glycosites in prokaryotic protein sequences. PloS one, 7 (7). e40155. ISSN 1932-6203
 
Relation http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0040155
http://crdd.osdd.net/open/1229/