Development of Methodology for Trait Specific Genes Identification

M.S. Farooqi; D.C. Mishra; K.K. Chaturvedi; S.Srivastava

KRISHI

ICAR RESEARCH DATA REPOSITORY FOR KNOWLEDGE MANAGEMENT
(An Institutional Publication and Data Inventory Repository)

"Not Available": Please do not remove the default option "Not Available" for the fields where metadata information is not available
"1001-01-01": Date not available or not applicable for filling metadata infromation

Please use this identifier to cite or link to this item: http://krishi.icar.gov.in/jspui/handle/123456789/71653

Title:	Development of Methodology for Trait Specific Genes Identification
Other Titles:	Not Available
Authors:	M.S. Farooqi D.C. Mishra K.K. Chaturvedi S.Srivastava
ICAR Data Use Licennce:	http://krishi.icar.gov.in/PDF/ICAR_Data_Use_Licence.pdf
Author's Affiliated institute:	ICAR::Indian Agricultural Statistics Research Institute
Published/ Complete Date:	2021
Project Code:	AGEDIASRISIL201900300149
Keywords:	RNA-seq feature selection SVM Genetic Algorithm informative genes gene expression
Publisher:	ICAR-IASRI
Citation:	Not Available
Series/Report no.:	I.A.S.R.I./P.R. - 06/2021;
Abstract/Description:	Comprehensive profiling of biological system is being continuously done using expression data obtained through high-throughput technologies, such as gene expression (GE) data, protein expression data and medical imaging data. The resulting data sets obtained through these technologies are huge in size and have several common characteristics making their analysis challenging. Many times the number of genes is much larger than the number of sample and the relevant informative genes which are associated with the outcome are less in the data sets. It is important to select most relevant genes related to condition class from thousands of genes with the help of appropriate statistical and computational techniques. These high dimensional data can be transformed into a meaningful representation of reduced dimensions (informative data) using dimensionality reduction techniques. Informative gene selection plays a bigger role in removing irrelevant and redundant genes from the data set. In this study methodology for obtaining relevant set of trait specific genes from gene expression data by applying combination of two conventional machine learning algorithms, support vector machine (SVM) and a genetic algorithm (GA) has been developed. Using SVM as the classifier performance and the Genetic algorithm for feature selection, a set of informative genes set can be obtained. The classification accuracy of the obtained genes set from the developed methodology was compared with the genes set obtained from methods such as Boot-MRMR, MRMR, t-score and F-score of R- package “GSAQ”. It has been observed that the performance of the developed methodology is better as compared to above given techniques for selecting robust set of informative genes. Based on this proposed approach, an R package, i.e., TSGS (https://cran.r-project.org/package=TSGS) has also been developed.
Description:	Not Available
ISSN:	Not Available
Type(s) of content:	Project Report
Sponsors:	Not Available
Language:	English
Volume No.:	Not Available
Page Number:	1-43
Name of the Division/Regional Station:	Division of Agricultural Bioinformatics
Source, DOI or any other URL:	Not Available
URI:	http://krishi.icar.gov.in/jspui/handle/123456789/71653
Appears in Collections:	AEdu-IASRI-Publication

Files in This Item:

File	Description	Size	Format
TSGS proj-report-05.04.22.pdf		4.13 MB	Adobe PDF	View/Open

Show full item record

KRISHI

ICAR RESEARCH DATA REPOSITORY FOR KNOWLEDGE MANAGEMENT (An Institutional Publication and Data Inventory Repository)

ICAR RESEARCH DATA REPOSITORY FOR KNOWLEDGE MANAGEMENT
(An Institutional Publication and Data Inventory Repository)