Record Details

G-DIRT: a web server for identification and removal of duplicate germplasms based on identity-by-state analysis using single nucleotide polymorphism genotyping data

OAR@ICRISAT

View Archive Info
 
 
Field Value
 
Relation http://oar.icrisat.org/12286/
https://academic.oup.com/bib/article-abstract/23/5/bbac348/6678959
https://doi.org/10.1093/bib/bbac348
 
Title G-DIRT: a web server for identification and removal of duplicate germplasms based on identity-by-state analysis using single nucleotide polymorphism genotyping data
 
Creator Sahu, T K
Singh, A K
Mittal, S
Jha, S K
Kumar, S
Jacob, S R
Singh, K
 
Subject Germplasm
Gene Bank
 
Description Maintaining duplicate germplasms in genebanks hampers effective conservation and utilization of genebank resources. The redundant germplasm adds to the cost of germplasm conservation by requiring a large proportion of the genebank financial resources towards conservation rather than enriching the diversity. Besides, genome-wide-association analysis using an association panel with over-represented germplasms can be biased resulting in spurious marker-trait associations. The conventional methods of germplasm duplicate removal using passport information suffer from incomplete or missing passport information and data handling errors at various stages of germplasm enrichment. This limitation is less likely in the case of genotypic data. Therefore, we developed a web-based tool, Germplasm Duplicate Identification and Removal Tool (G-DIRT), which allows germplasm duplicate identification based on identity-by-state analysis using single-nucleotide polymorphism genotyping information along with pre-processing of genotypic data. A homozygous genotypic difference threshold of 0.1% for germplasm duplicates has been determined using tetraploid wheat genotypic data with 94.97% of accuracy. Based on the genotypic difference, the tool also builds a dendrogram that can visually depict the relationship between genotypes. To overcome the constraint of high-dimensional genotypic data, an offline version of G-DIRT in the interface of R has also been developed. The G-DIRT is expected to help genebank curators, breeders and other researchers across the world in identifying germplasm duplicates from the global genebank collections by only using the easily sharable genotypic data instead of physically exchanging the seeds or propagating materials. The web server will complement the existing methods of germplasm duplicate identification based on passport or phenotypic information being freely accessible at
 
Publisher Oxford University Press
 
Date 2022-08-30
 
Type Article
PeerReviewed
 
Identifier Sahu, T K and Singh, A K and Mittal, S and Jha, S K and Kumar, S and Jacob, S R and Singh, K (2022) G-DIRT: a web server for identification and removal of duplicate germplasms based on identity-by-state analysis using single nucleotide polymorphism genotyping data. Briefings in Bioinformatics, 23 (5). ISSN 1477-4054