TY - JOUR
T1 - Full-length transcriptome assembly from RNA-Seq data without a reference genome
AU - Grabherr, Manfred G.
AU - Haas, Brian J.
AU - Yassour, Moran
AU - Levin, Joshua Z.
AU - Thompson, Dawn A.
AU - Amit, Ido
AU - Adiconis, Xian
AU - Fan, Lin
AU - Raychowdhury, Raktima
AU - Zeng, Qiandong
AU - Chen, Zehua
AU - Mauceli, Evan
AU - Hacohen, Nir
AU - Gnirke, Andreas
AU - Rhind, Nicholas
AU - Di Palma, Federica
AU - Birren, Bruce W.
AU - Nusbaum, Chad
AU - Lindblad-Toh, Kerstin
AU - Friedman, Nir
AU - Regev, Aviv
PY - 2011/7
Y1 - 2011/7
N2 - Massively parallel sequencing of cDNA has enabled deep and efficient probing of transcriptomes. Current approaches for transcript reconstruction from such data often rely on aligning reads to a reference genome, and are thus unsuitable for samples with a partial or missing reference genome. Here we present the Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available. By efficiently constructing and analyzing sets of de Bruijn graphs, Trinity fully reconstructs a large fraction of transcripts, including alternatively spliced isoforms and transcripts from recently duplicated genes. Compared with other de novo transcriptome assemblers, Trinity recovers more full-length transcripts across a broad range of expression levels, with a sensitivity similar to methods that rely on genome alignments. Our approach provides a unified solution for transcriptome reconstruction in any sample, especially in the absence of a reference genome.
AB - Massively parallel sequencing of cDNA has enabled deep and efficient probing of transcriptomes. Current approaches for transcript reconstruction from such data often rely on aligning reads to a reference genome, and are thus unsuitable for samples with a partial or missing reference genome. Here we present the Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available. By efficiently constructing and analyzing sets of de Bruijn graphs, Trinity fully reconstructs a large fraction of transcripts, including alternatively spliced isoforms and transcripts from recently duplicated genes. Compared with other de novo transcriptome assemblers, Trinity recovers more full-length transcripts across a broad range of expression levels, with a sensitivity similar to methods that rely on genome alignments. Our approach provides a unified solution for transcriptome reconstruction in any sample, especially in the absence of a reference genome.
UR - https://www.scopus.com/pages/publications/79960264362
U2 - 10.1038/nbt.1883
DO - 10.1038/nbt.1883
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 21572440
AN - SCOPUS:79960264362
SN - 1087-0156
VL - 29
SP - 644
EP - 652
JO - Nature Biotechnology
JF - Nature Biotechnology
IS - 7
ER -