Record Details

Functional sites in protein families uncovered via an objective and automated graph theoretic approach

DSpace at IIT Bombay

View Archive Info
 
 
Field Value
 
Title Functional sites in protein families uncovered via an objective and automated graph theoretic approach
 
Creator WANGIKAR, PRAMOD P
TENDULKAR, ASHISH V
RAMYA, S
MALI, DEEPALI N
SARAWAGI, SUNITA
 
Subject algorithms
structure
backtracking
proteins
 
Description We report a method for detection of recurring side-chain patterns (DRESPAT) using an unbiased and automated graph theoretic approach. We first list all structural patterns as sub-graphs where the protein is represented as a graph. The patterns from proteins are compared pair-wise to detect patterns common to a protein pair based on content and geometry criteria. The recurring pattern is then detected using an automated search algorithm from the all-against-all pair-wise comparison data of proteins. Intra-protein pattern comparison data are used to enable detection of patterns recurring within a protein. A method has been proposed for empirical calculation of statistical significance of recurring pattern. The method was tested on 17 protein sets of varying size, composed of non-redundant representatives from SCOP superfamilies. Recurring patterns in serine proteases, cysteine proteases, lipases, cupredoxin, ferredoxin, ferritin, cytochrome c, aspartoyl proteases, peroxidases, phospholipase A2, endonuclease, SH3 domain, EF-hand and lectins show additional residues conserved in the vicinity of the known functional sites. On the basis of the recurring patterns in ferritin, EF-hand and lectins, we could separate proteins or domains that are structurally similar yet different in metal ion-binding characteristics. In addition, novel recurring patterns were observed in glutathione-S-transferase, phospholipase A2 and ferredoxin with potential structural/functional roles. The results are discussed in relation to the known functional sites in each family. Between 2000 and 50,000 patterns were enumerated from each protein with between ten and 500 patterns detected as common to an evolutionarily related protein pair. Our results show that unbiased extraction of functional site pattern is not feasible from an evolutionarily related protein pair but is feasible from protein sets comprising five or more proteins. The DRESPAT method does not require a user-defined pattern, size or location of the pattern and therefore, has the potential to uncover new functional sites in protein families.
 
Publisher Elsevier
 
Date 2009-03-17T10:03:50Z
2011-11-25T19:11:15Z
2011-12-26T13:07:25Z
2011-12-27T05:55:27Z
2009-03-17T10:03:50Z
2011-11-25T19:11:15Z
2011-12-26T13:07:25Z
2011-12-27T05:55:27Z
2003
 
Type Article
 
Identifier Journal of Molecular Biology 326(3), 955-978
0022-2836
http://dx.doi.org/10.1016/S0022-2836(02)01384-0
http://hdl.handle.net/10054/948
http://dspace.library.iitb.ac.in/xmlui/handle/10054/948
 
Language en