Title :
Computer simulation reveals selection of library is related to read length of solexa sequencing in genome projects
Author :
You-Jie Zhao ; Fei Xiong ; Kai-Lai Zhou ; Kun-Rong Hu ; Yong-Ke Sun ; Wei-Li Kou ; Yue-Yu Dong ; Zheng-Ping Qiang ; Xiao-Rui Wang ; Guang-Zhi Di ; Yan Zhang ; Qing-Hui Zhang ; Tong-Lin Zhao ; Yong Cao
Author_Institution :
Dept. of Comput. & Inf. Sci., Southwest Forestry Univ., Kunming, China
Abstract :
In order to find the difference role of 200bp and 500bp libraries for genome assembly, a tool named SRSL was developed. It can simulate random solexa libraries based on reference sequence. Different depth and different read length of 200bp and 500bp libraries were produced by SRSL in four model species (rice, Arabidopsis, fruit fly and yeast). After assembling these sequences and calculating their contig N50 and scaffold N50, 200bp and 500bp libraries were compared for genome assembly. It is suggested that selection of 200bp or 500bp is closely related to read length of solexa sequence in the four genomes. When the read length of solexa is 50bp, it is shown 200bp and 500bp libraries should be selected together. When the read length of solexa is 100bp, it is sugguested 200bp library is not necessary to be sequenced. It is obviously different with past experience of solexa sequencing in genome projects. It would provide effective guide for solexa sequencing in the future genome projects.
Keywords :
biology computing; genomics; 200bp library; 500bp library; Arabidopsis; SRSL; computer simulation; fruit fly; genome assembly; genome projects; model species; random solexa library; reference sequence; rice; scaffold N50; solexa sequencing; yeast; Assembly; Bioinformatics; Error analysis; Gaussian distribution; Genomics; Libraries; Sequential analysis; Computer simulation; Genome project; Next-generation sequencing; Solexa library;
Conference_Titel :
Information Science, Electronics and Electrical Engineering (ISEEE), 2014 International Conference on
Conference_Location :
Sapporo
Print_ISBN :
978-1-4799-3196-5
DOI :
10.1109/InfoSEEE.2014.6948102