KRISHI
ICAR RESEARCH DATA REPOSITORY FOR KNOWLEDGE MANAGEMENT
(An Institutional Publication and Data Inventory Repository)
"Not Available": Please do not remove the default option "Not Available" for the fields where metadata information is not available
"1001-01-01": Date not available or not applicable for filling metadata infromation
"1001-01-01": Date not available or not applicable for filling metadata infromation
Please use this identifier to cite or link to this item:
http://krishi.icar.gov.in/jspui/handle/123456789/73731
Title: | Identification of species based on DNA barcode using k-mer feature vector and Random forest classifier |
Other Titles: | Not Available |
Authors: | Prabina Kumar Meher Tanmaya Kumar Sahu A R Rao |
ICAR Data Use Licennce: | http://krishi.icar.gov.in/PDF/ICAR_Data_Use_Licence.pdf |
Author's Affiliated institute: | ICAR::Indian Agricultural Statistics Research Institute |
Published/ Complete Date: | 2016-11-05 |
Project Code: | Not Available |
Keywords: | BOLD systems DNA barcode Oligomer frequency Random forest SPIDBAR |
Publisher: | Not Available |
Citation: | Meher PK, Sahu TK, Rao AR. (2016). Identification of species based on DNA barcode using k-mer feature vector and Random forest classifier. Gene.;592(2):316-24. doi: 10.1016/j.gene.2016.07.010. Epub 2016 Jul 5. PMID: 27393648. |
Series/Report no.: | Not Available; |
Abstract/Description: | DNA barcoding is a molecular diagnostic method that allows automated and accurate identification of species based on a short and standardized fragment of DNA. To this end, an attempt has been made in this study to develop a computational approach for identifying the species by comparing its barcode with the barcode sequence of known species present in the reference library. Each barcode sequence was first mapped onto a numeric feature vector based on k-mer frequencies and then Random forest methodology was employed on the transformed dataset for species identification. The proposed approach outperformed similarity-based, tree-based, diagnostic-based approaches and found comparable with existing supervised learning based approaches in terms of species identification success rate, while compared using real and simulated datasets. Based on the proposed approach, an online web interface SPIDBAR has also been developed and made freely available at http://cabgrid.res.in:8080/spidbar/ for species identification by the taxonomists. |
Description: | Not Available |
ISSN: | Not Available |
Type(s) of content: | Research Paper |
Sponsors: | Not Available |
Language: | English |
Name of Journal: | Gene |
Volume No.: | 592 |
Page Number: | 316–324 |
Name of the Division/Regional Station: | Not Available |
Source, DOI or any other URL: | 10.1016/j.gene.2016.07.010 |
URI: | http://krishi.icar.gov.in/jspui/handle/123456789/73731 |
Appears in Collections: | AEdu-IASRI-Publication |
Files in This Item:
There are no files associated with this item.
Items in KRISHI are protected by copyright, with all rights reserved, unless otherwise indicated.