Record Details

Prediction of polyadenylation signals in human DNA sequences using nucleotide frequencies.

DIR@IMTECH: CSIR-Institute of Microbial Technology

View Archive Info
 
 
Field Value
 
Title Prediction of polyadenylation signals in human DNA sequences using nucleotide frequencies.
 
Creator Ahmed, Firoz
Kumar, Manish
Raghava, G.P.S.
 
Subject QH301 Biology
 
Description The polyadenylation signal plays a key role in determining the site for addition of a polyadenylated tail to nascent mRNA and its mutation(s) are reported in many diseases. Thus, identifying poly(A) sites is important for understanding the regulation and stability of mRNA. In this study, Support Vector Machine (SVM) models have been developed for predicting poly(A) signals in a DNA sequence using 100 nucleotides, each upstream and downstream of this signal. Here, we introduced a novel split nucleotide frequency technique, and the models thus developed achieved maximum Matthews correlation coefficients (MCC) of 0.58, 0.69, 0.70 and 0.69 using mononucleotide, dinucleotide, trinucleotide, and tetranucleotide frequencies, respectively. Finally, a hybrid model developed using a combination of dinucleotide, 2nd order dinucleotide and tetranucleotide frequencies, achieved a maximum MCC of 0.72. Moreover, for independent datasets this model achieved a precision ranging from 75.8-95.7% with a sensitivity of 57%, which is better than any other known methods.
 
Publisher Bioinformation Systems e.V.
 
Date 2009
 
Type Article
PeerReviewed
 
Format text/html
 
Identifier http://crdd.osdd.net/open/547/1/raghvasilico2.mht
Ahmed, Firoz and Kumar, Manish and Raghava, G.P.S. (2009) Prediction of polyadenylation signals in human DNA sequences using nucleotide frequencies. In silico biology, 9 (3). pp. 135-48. ISSN 1386-6338
 
Relation http://www.bioinfo.de/isb/2009/09/0012/
http://crdd.osdd.net/open/547/