KRISHI
ICAR RESEARCH DATA REPOSITORY FOR KNOWLEDGE MANAGEMENT
(An Institutional Publication and Data Inventory Repository)
"Not Available": Please do not remove the default option "Not Available" for the fields where metadata information is not available
"1001-01-01": Date not available or not applicable for filling metadata infromation
"1001-01-01": Date not available or not applicable for filling metadata infromation
Please use this identifier to cite or link to this item:
http://krishi.icar.gov.in/jspui/handle/123456789/76506
Title: | Segmentation of Genomic Data: Comparative Analysis among Different Multivariate Statistical Approaches |
Other Titles: | Not Available |
Authors: | Anjum Arfa, Lall Shwetank, Verghesh Eldho, Rai Anil, Bhowmik Arpan, Mishra Dwijesh Chandra, Jaggi Seema |
ICAR Data Use Licennce: | http://krishi.icar.gov.in/PDF/ICAR_Data_Use_Licence.pdf |
Author's Affiliated institute: | ICAR::Indian Agricultural Statistics Research Institute ICAR::Central Marine Fisheries Research Institute |
Published/ Complete Date: | 2022-03-30 |
Project Code: | Not Available |
Keywords: | Genome; Segmentation; Multivariate analysis; Sequential clustering |
Publisher: | Not Available |
Citation: | Not Available |
Series/Report no.: | Not Available; |
Abstract/Description: | Segmenting a series of measurements along a genome into regions with distinct characteristics is widely used to identify functional components of a genome. The majority of the research on biological data segmentation focuses on the statistical problem of identifying break or change-points in a simulated scenario using a single variable. Despite the fact that various strategies for finding change-points in a multivariate setup through simulation are available, work on segmenting actual multivariate genomic data is limited. This is due to the fact that genomic data is huge in size and contains a lot of variation within it. Therefore, a study was carried out at the ICAR-Indian Agricultural Statistics Research Institute, New Delhi during 2021 to know the best multivariate statistical method to segment the sequences which may influence the properties or function of a sequence into homogeneous segments. This will reduce the volume of data and ease the analysis of these segments further to know the actual properties of these segments. The genomic data of Rice (Oryza sativa L.) was considered for the comparative analysis of several multivariate approaches and was found that agglomerative sequential clustering was the most acceptable due to its low computational cost and feasibility. |
Description: | Not Available |
ISSN: | Not Available |
Type(s) of content: | Research Paper |
Sponsors: | Not Available |
Language: | English |
Name of Journal: | Indian Journal of Agricultural Sciences |
Journal Type: | NAAS Journal |
NAAS Rating: | 6.37 |
Volume No.: | 92(7) |
Page Number: | 92-96 |
Name of the Division/Regional Station: | Not Available |
Source, DOI or any other URL: | http://eprints.cmfri.org.in/id/eprint/16126 |
URI: | http://krishi.icar.gov.in/jspui/handle/123456789/76506 |
Appears in Collections: | AEdu-IASRI-Publication |
Files in This Item:
There are no files associated with this item.
Items in KRISHI are protected by copyright, with all rights reserved, unless otherwise indicated.