TY - JOUR
T1 - De novo assembly of a new Solanum pennellii accession using nanopore sequencing
AU - Schmidt, Maximilian H.W.
AU - Vogel, Alexander
AU - Denton, Alisandra K.
AU - Istace, Benjamin
AU - Wormit, Alexandra
AU - van de Geest, Henri
AU - Bolger, Marie E.
AU - Alseekh, Saleh
AU - Maß, Janina
AU - Pfaff, Christian
AU - Schurr, Ulrich
AU - Chetelat, Roger
AU - Maumus, Florian
AU - Aury, Jean Marc
AU - Koren, Sergey
AU - Fernie, Alisdair R.
AU - Zamir, Dani
AU - Bolger, Anthony M.
AU - Usadel, Björn
N1 - Publisher Copyright:
© 2017 The author(s).
PY - 2017/10
Y1 - 2017/10
N2 - Updates in nanopore technology have made it possible to obtain gigabases of sequence data. Prior to this, nanopore sequencing technology was mainly used to analyze microbial samples. Here, we describe the generation of a comprehensive nanopore sequencing data set with a median read length of 11,979 bp for a self-compatible accession of the wild tomato species Solanum pennellii. We describe the assembly of its genome to a contig N50 of 2.5 MB. The assembly pipeline comprised initial read correction with Canu and assembly with SMARTdenovo. The resulting raw nanopore-based de novo genome is structurally highly similar to that of the reference S. pennellii LA716 accession but has a high error rate and was rich in homopolymer deletions. After polishing the assembly with Illumina reads, we obtained an error rate of <0.02% when assessed versus the same Illumina data. We obtained a gene completeness of 96.53%, slightly surpassing that of the reference S. pennellii. Taken together, our data indicate that such long read sequencing data can be used to affordably sequence and assemble gigabase-sized plant genomes.
AB - Updates in nanopore technology have made it possible to obtain gigabases of sequence data. Prior to this, nanopore sequencing technology was mainly used to analyze microbial samples. Here, we describe the generation of a comprehensive nanopore sequencing data set with a median read length of 11,979 bp for a self-compatible accession of the wild tomato species Solanum pennellii. We describe the assembly of its genome to a contig N50 of 2.5 MB. The assembly pipeline comprised initial read correction with Canu and assembly with SMARTdenovo. The resulting raw nanopore-based de novo genome is structurally highly similar to that of the reference S. pennellii LA716 accession but has a high error rate and was rich in homopolymer deletions. After polishing the assembly with Illumina reads, we obtained an error rate of <0.02% when assessed versus the same Illumina data. We obtained a gene completeness of 96.53%, slightly surpassing that of the reference S. pennellii. Taken together, our data indicate that such long read sequencing data can be used to affordably sequence and assemble gigabase-sized plant genomes.
UR - http://www.scopus.com/inward/record.url?scp=85033690821&partnerID=8YFLogxK
U2 - 10.1105/tpc.17.00521
DO - 10.1105/tpc.17.00521
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 29025960
AN - SCOPUS:85033690821
SN - 1040-4651
VL - 29
SP - 2336
EP - 2348
JO - Plant Cell
JF - Plant Cell
IS - 10
ER -