Record Details

Optimization of de novo short read assembly of seabuckthorn (Hippophae rhamnoides L.) transcriptome

NIPGR Digital Knowledge Repository (NDKR)

View Archive Info
 
 
Field Value
 
Title Optimization of de novo short read assembly of seabuckthorn (Hippophae rhamnoides L.) transcriptome
 
Creator Ghangal, Rajesh
Chaudhary, Saurabh
Jain, Mukesh
Purty, Ram Singh
Sharma, Prakash Chand
 
Subject De Novo Short Read Assembly
Seabuckthorn
Hippophae rhamnoides L.
Transcriptome
 
Description Accepted date: July 9, 2013
Seabuckthorn (Hippophaerhamnoides L.) is known for its medicinal, nutritional and environmental importance since ancient times. However, very limited efforts have been made to characterize the genome and transcriptome of this wonder plant. Here, we report the use of next generation massive parallel sequencing technology (Illumina platform) and de novo assembly to gain a comprehensive view of the seabuckthorn transcriptome. We assembled 86,253,874 high quality short reads using six assembly tools. At our hand, assembly of non-redundant short reads following a two-step procedure was found to be the best considering various assembly quality parameters. Initially, ABySS tool was used following an additive k-mer approach. The assembled transcripts were subsequently subjected to TGICL suite. Finally, de novo short read assembly yielded 88,297 transcripts (> 100 bp), representing about 53 Mb of seabuckthorn transcriptome. The average length of transcripts was 610 bp, N50 length 1198 BP and 91% of the short reads uniquely mapped back to seabuckthorn transcriptome. A total of 41,340 (46.8%) transcripts showed significant similarity with sequences present in nr protein databases of NCBI (E-value < 1E-06). We also screened the assembled transcripts for the presence of transcription factors and simple sequence repeats. Our strategy involving the use of short read assembler (ABySS) followed by TGICL will be useful for the researchers working with a non-model organism's transcriptome in terms of saving time and reducing complexity in data management. The seabuckthorn transcriptome data generated here provide a valuable resource for gene discovery and development of functional molecular markers.
 
Date 2015-11-03T09:10:21Z
2015-11-03T09:10:21Z
2013
 
Type Article
 
Identifier PLoS One, 8(8): e72516
1932-6203
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0072516
http://172.16.0.77:8080/jspui/handle/123456789/309
 
Language en_US
 
Publisher PLOS