TY - JOUR
T1 - DeepSimulator1.5: a more powerful, quicker and lighter simulator for Nanopore sequencing.
AU - Li, Yu
AU - wang, sheng
AU - Bi, Chongwei
AU - Qiu, Zhaowen
AU - Li, Mo
AU - Gao, Xin
N1 - KAUST Repository Item: Exported on 2020-10-01
Acknowledgements: We would like to thank all the users that have discussed with us on Github, which have definitely improved DeepSimulator greatly, resulting in DeepSimulator1.5. We want to thank Jia-Xing Yue especially for his constructive feedbacks.
PY - 2020/1/8
Y1 - 2020/1/8
N2 - MOTIVATION:Nanopore sequencing is one of the leading third-generation sequencing technologies. A number of computational tools have been developed to facilitate the processing and analysis of the Nanopore data. Previously, we have developed DeepSimulator1.0 (DS1.0), which is the first simulator for Nanopore sequencing to produce both the raw electrical signals and the reads. However, although DS1.0 can produce high-quality reads, for some sequences, the divergence between the simulated raw signals and the real signals can be large. Furthermore, the Nanopore sequencing technology has evolved greatly since DS1.0 was released. It is thus necessary to update DS1.0 to accommodate those changes. RESULTS:We propose DeepSimulator1.5 (DS1.5), all three modules of which have been updated substantially from DS1.0. As for the sequence generator, we updated the sample read length distribution to reflect the newest real reads' features. In terms of the signal generator, which is the core of DeepSimulator, we added one more pore model, the context-independent pore model, which is much faster than the previous context-dependent one. Furthermore, to make the generated signals more similar to the real ones, we added a low-pass filter to post-process the pore model signals. Regarding the basecaller, we added the support for the newest official basecaller, Guppy, which can support both GPU and CPU. In addition, multiple optimizations, related to multiprocessing control, memory, and storage management, have been implemented to make DS1.5 a much more amenable and lighter simulator than DS1.0. AVAILABILITY:The main program and the data are available at https://github.com/lykaust15/DeepSimulator.
AB - MOTIVATION:Nanopore sequencing is one of the leading third-generation sequencing technologies. A number of computational tools have been developed to facilitate the processing and analysis of the Nanopore data. Previously, we have developed DeepSimulator1.0 (DS1.0), which is the first simulator for Nanopore sequencing to produce both the raw electrical signals and the reads. However, although DS1.0 can produce high-quality reads, for some sequences, the divergence between the simulated raw signals and the real signals can be large. Furthermore, the Nanopore sequencing technology has evolved greatly since DS1.0 was released. It is thus necessary to update DS1.0 to accommodate those changes. RESULTS:We propose DeepSimulator1.5 (DS1.5), all three modules of which have been updated substantially from DS1.0. As for the sequence generator, we updated the sample read length distribution to reflect the newest real reads' features. In terms of the signal generator, which is the core of DeepSimulator, we added one more pore model, the context-independent pore model, which is much faster than the previous context-dependent one. Furthermore, to make the generated signals more similar to the real ones, we added a low-pass filter to post-process the pore model signals. Regarding the basecaller, we added the support for the newest official basecaller, Guppy, which can support both GPU and CPU. In addition, multiple optimizations, related to multiprocessing control, memory, and storage management, have been implemented to make DS1.5 a much more amenable and lighter simulator than DS1.0. AVAILABILITY:The main program and the data are available at https://github.com/lykaust15/DeepSimulator.
UR - http://hdl.handle.net/10754/661086
UR - https://academic.oup.com/bioinformatics/advance-article/doi/10.1093/bioinformatics/btz963/5698265
UR - http://www.scopus.com/inward/record.url?scp=85084025101&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btz963
DO - 10.1093/bioinformatics/btz963
M3 - Article
C2 - 31913436
SN - 1367-4803
VL - 36
SP - 2578
EP - 2580
JO - Bioinformatics (Oxford, England)
JF - Bioinformatics (Oxford, England)
IS - 8
ER -