KRISHI
ICAR RESEARCH DATA REPOSITORY FOR KNOWLEDGE MANAGEMENT
(An Institutional Publication and Data Inventory Repository)
"Not Available": Please do not remove the default option "Not Available" for the fields where metadata information is not available
"1001-01-01": Date not available or not applicable for filling metadata infromation
"1001-01-01": Date not available or not applicable for filling metadata infromation
Please use this identifier to cite or link to this item:
http://krishi.icar.gov.in/jspui/handle/123456789/52770
Title: | Structural SCOP super family level classification using unsupervised machine learning |
Other Titles: | Not Available Structural SCOP superfamily level classification using unsupervised machine learning , IEEE Transaction on Computational Biology and Bioinformatics, 9: 601-608 |
Authors: | Angadi UB Venkatesulu M |
ICAR Data Use Licennce: | http://krishi.icar.gov.in/PDF/ICAR_Data_Use_Licence.pdf |
Author's Affiliated institute: | ICAR-NIANP |
Published/ Complete Date: | 2011-08-04 |
Project Code: | Not Available |
Keywords: | machine learning |
Publisher: | Europe PMC |
Citation: | Angadi UB, Venkatesulu M. Structural SCOP superfamily level classification using unsupervised machine learning. IEEE/ACM Trans Comput Biol Bioinform. 2012 Mar-Apr;9(2) 601-608. doi:10.1109/tcbb.2011.114. PMID: 21844638.j |
Series/Report no.: | Not Available; |
Abstract/Description: | One of the major research directions in bioinformatics is that of assigning superfamily classification to a given set of proteins. The classification reflects the structural, evolutionary, and functional relatedness. These relationships are embodied in a hierarchical classification, such as the Structural Classification of Protein (SCOP), which is mostly manually curated. Such a classification is essential for the structural and functional analyses of proteins. Yet a large number of proteins remain unclassified. In this study, we have proposed an unsupervised machine learning approach to classify and assign a given set of proteins to SCOP superfamilies. In the method, we have constructed a database and similarity matrix using P-values obtained from an all-against-all BLAST run and trained the network with the ART2 unsupervised learning algorithm using the rows of the similarity matrix as input vectors, enabling the trained network to classify the proteins from 0.82 to 0.97 f-measure accuracy. The performance of ART2 has been compared with that of spectral clustering, Random forest, SVM, and HHpred. ART2 performs better than the others except HHpred. HHpred performs better than ART2 and the sum of errors is smaller than that of the other methods evaluated. |
Description: | Not Available |
ISSN: | Not Available |
Type(s) of content: | Journal |
Sponsors: | Not Available |
Language: | English |
Name of Journal: | IEEE/ACM Trans Comput Biol Bioinform. |
Volume No.: | 9(2) |
Page Number: | 601-608 |
Name of the Division/Regional Station: | Knowledge Management and Bio statistic Section |
Source, DOI or any other URL: | doi:10.1109/tcbb.2011.114. PMID: 21844638. |
URI: | http://krishi.icar.gov.in/jspui/handle/123456789/52770 |
Appears in Collections: | AS-NIANP-Publication |
Files in This Item:
There are no files associated with this item.
Items in KRISHI are protected by copyright, with all rights reserved, unless otherwise indicated.