Optimization of de novo short read assembly of seabuckthorn (Hippophae rhamnoides L.) transcriptome
NIPGR Digital Knowledge Repository (NDKR)
View Archive InfoField | Value | |
Title |
Optimization of de novo short read assembly of seabuckthorn (Hippophae rhamnoides L.) transcriptome
|
|
Creator |
Ghangal, Rajesh
Chaudhary, Saurabh Jain, Mukesh Purty, Ram Singh Sharma, Prakash Chand |
|
Subject |
De Novo Short Read Assembly
Seabuckthorn Hippophae rhamnoides L. Transcriptome |
|
Description |
Accepted date: July 9, 2013
Seabuckthorn (Hippophaerhamnoides L.) is known for its medicinal, nutritional and environmental importance since ancient times. However, very limited efforts have been made to characterize the genome and transcriptome of this wonder plant. Here, we report the use of next generation massive parallel sequencing technology (Illumina platform) and de novo assembly to gain a comprehensive view of the seabuckthorn transcriptome. We assembled 86,253,874 high quality short reads using six assembly tools. At our hand, assembly of non-redundant short reads following a two-step procedure was found to be the best considering various assembly quality parameters. Initially, ABySS tool was used following an additive k-mer approach. The assembled transcripts were subsequently subjected to TGICL suite. Finally, de novo short read assembly yielded 88,297 transcripts (> 100 bp), representing about 53 Mb of seabuckthorn transcriptome. The average length of transcripts was 610 bp, N50 length 1198 BP and 91% of the short reads uniquely mapped back to seabuckthorn transcriptome. A total of 41,340 (46.8%) transcripts showed significant similarity with sequences present in nr protein databases of NCBI (E-value < 1E-06). We also screened the assembled transcripts for the presence of transcription factors and simple sequence repeats. Our strategy involving the use of short read assembler (ABySS) followed by TGICL will be useful for the researchers working with a non-model organism's transcriptome in terms of saving time and reducing complexity in data management. The seabuckthorn transcriptome data generated here provide a valuable resource for gene discovery and development of functional molecular markers. |
|
Date |
2015-11-03T09:10:21Z
2015-11-03T09:10:21Z 2013 |
|
Type |
Article
|
|
Identifier |
PLoS One, 8(8): e72516
1932-6203 http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0072516 http://172.16.0.77:8080/jspui/handle/123456789/309 |
|
Language |
en_US
|
|
Publisher |
PLOS
|
|