G-DIRT: a web server for identification and removal of duplicate germplasms based on identity-by-state analysis using single nucleotide polymorphism genotyping data
OAR@ICRISAT
View Archive InfoField | Value | |
Relation |
http://oar.icrisat.org/12286/
https://academic.oup.com/bib/article-abstract/23/5/bbac348/6678959 https://doi.org/10.1093/bib/bbac348 |
|
Title |
G-DIRT: a web server for identification and removal of duplicate germplasms based on identity-by-state analysis using single nucleotide polymorphism genotyping data
|
|
Creator |
Sahu, T K
Singh, A K Mittal, S Jha, S K Kumar, S Jacob, S R Singh, K |
|
Subject |
Germplasm
Gene Bank |
|
Description |
Maintaining duplicate germplasms in genebanks hampers effective conservation and utilization of genebank resources. The redundant germplasm adds to the cost of germplasm conservation by requiring a large proportion of the genebank financial resources towards conservation rather than enriching the diversity. Besides, genome-wide-association analysis using an association panel with over-represented germplasms can be biased resulting in spurious marker-trait associations. The conventional methods of germplasm duplicate removal using passport information suffer from incomplete or missing passport information and data handling errors at various stages of germplasm enrichment. This limitation is less likely in the case of genotypic data. Therefore, we developed a web-based tool, Germplasm Duplicate Identification and Removal Tool (G-DIRT), which allows germplasm duplicate identification based on identity-by-state analysis using single-nucleotide polymorphism genotyping information along with pre-processing of genotypic data. A homozygous genotypic difference threshold of 0.1% for germplasm duplicates has been determined using tetraploid wheat genotypic data with 94.97% of accuracy. Based on the genotypic difference, the tool also builds a dendrogram that can visually depict the relationship between genotypes. To overcome the constraint of high-dimensional genotypic data, an offline version of G-DIRT in the interface of R has also been developed. The G-DIRT is expected to help genebank curators, breeders and other researchers across the world in identifying germplasm duplicates from the global genebank collections by only using the easily sharable genotypic data instead of physically exchanging the seeds or propagating materials. The web server will complement the existing methods of germplasm duplicate identification based on passport or phenotypic information being freely accessible at
|
|
Publisher |
Oxford University Press
|
|
Date |
2022-08-30
|
|
Type |
Article
PeerReviewed |
|
Identifier |
Sahu, T K and Singh, A K and Mittal, S and Jha, S K and Kumar, S and Jacob, S R and Singh, K (2022) G-DIRT: a web server for identification and removal of duplicate germplasms based on identity-by-state analysis using single nucleotide polymorphism genotyping data. Briefings in Bioinformatics, 23 (5). ISSN 1477-4054
|
|