KRISHI
ICAR RESEARCH DATA REPOSITORY FOR KNOWLEDGE MANAGEMENT
(An Institutional Publication and Data Inventory Repository)
"Not Available": Please do not remove the default option "Not Available" for the fields where metadata information is not available
"1001-01-01": Date not available or not applicable for filling metadata infromation
"1001-01-01": Date not available or not applicable for filling metadata infromation
Please use this identifier to cite or link to this item:
http://krishi.icar.gov.in/jspui/handle/123456789/71653
Title: | Development of Methodology for Trait Specific Genes Identification |
Other Titles: | Not Available |
Authors: | M.S. Farooqi D.C. Mishra K.K. Chaturvedi S.Srivastava |
ICAR Data Use Licennce: | http://krishi.icar.gov.in/PDF/ICAR_Data_Use_Licence.pdf |
Author's Affiliated institute: | ICAR::Indian Agricultural Statistics Research Institute |
Published/ Complete Date: | 2021 |
Project Code: | AGEDIASRISIL201900300149 |
Keywords: | RNA-seq feature selection SVM Genetic Algorithm informative genes gene expression |
Publisher: | ICAR-IASRI |
Citation: | Not Available |
Series/Report no.: | I.A.S.R.I./P.R. - 06/2021; |
Abstract/Description: | Comprehensive profiling of biological system is being continuously done using expression data obtained through high-throughput technologies, such as gene expression (GE) data, protein expression data and medical imaging data. The resulting data sets obtained through these technologies are huge in size and have several common characteristics making their analysis challenging. Many times the number of genes is much larger than the number of sample and the relevant informative genes which are associated with the outcome are less in the data sets. It is important to select most relevant genes related to condition class from thousands of genes with the help of appropriate statistical and computational techniques. These high dimensional data can be transformed into a meaningful representation of reduced dimensions (informative data) using dimensionality reduction techniques. Informative gene selection plays a bigger role in removing irrelevant and redundant genes from the data set. In this study methodology for obtaining relevant set of trait specific genes from gene expression data by applying combination of two conventional machine learning algorithms, support vector machine (SVM) and a genetic algorithm (GA) has been developed. Using SVM as the classifier performance and the Genetic algorithm for feature selection, a set of informative genes set can be obtained. The classification accuracy of the obtained genes set from the developed methodology was compared with the genes set obtained from methods such as Boot-MRMR, MRMR, t-score and F-score of R- package “GSAQ”. It has been observed that the performance of the developed methodology is better as compared to above given techniques for selecting robust set of informative genes. Based on this proposed approach, an R package, i.e., TSGS (https://cran.r-project.org/package=TSGS) has also been developed. |
Description: | Not Available |
ISSN: | Not Available |
Type(s) of content: | Project Report |
Sponsors: | Not Available |
Language: | English |
Volume No.: | Not Available |
Page Number: | 1-43 |
Name of the Division/Regional Station: | Division of Agricultural Bioinformatics |
Source, DOI or any other URL: | Not Available |
URI: | http://krishi.icar.gov.in/jspui/handle/123456789/71653 |
Appears in Collections: | AEdu-IASRI-Publication |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
TSGS proj-report-05.04.22.pdf | 4.13 MB | Adobe PDF | View/Open |
Items in KRISHI are protected by copyright, with all rights reserved, unless otherwise indicated.