TY - JOUR
T1 - Comprehensive transcriptome analysis using synthetic long-read sequencing reveals molecular co-association of distant splicing events
AU - Tilgner, Hagen
AU - Jahanbani, Fereshteh
AU - Blauwkamp, Tim
AU - Moshrefi, Ali
AU - Jaeger, Erich
AU - Chen, Feng
AU - Harel, Itamar
AU - Bustamante, Carlos D.
AU - Rasmussen, Morten
AU - Snyder, Michael P.
N1 - Publisher Copyright:
© 2015 Nature America, Inc. All rights reserved.
PY - 2015/7/13
Y1 - 2015/7/13
N2 - Alternative splicing shapes mammalian transcriptomes, with many RNA molecules undergoing multiple distant alternative splicing events. Comprehensive transcriptome analysis, including analysis of exon co-association in the same molecule, requires deep, long-read sequencing. Here we introduce an RNA sequencing method, synthetic long-read RNA sequencing (SLR-RNA-seq), in which small pools (≤ 1,000 molecules/pool, ≤ 1 molecule/gene for most genes) of full-length cDNAs are amplified, fragmented and short-read-sequenced. We demonstrate that these RNA sequences reconstructed from the short reads from each of the pools are mostly close to full length and contain few insertion and deletion errors. We report many previously undescribed isoforms (human brain: ∼ 13,800 affected genes, 14.5% of molecules; mouse brain ∼ 8,600 genes, 18% of molecules) and up to 165 human distant molecularly associated exon pairs (dMAPs) and distant molecularly and mutually exclusive pairs (dMEPs). Of 16 associated pairs detected in the mouse brain, 9 are conserved in human. Our results indicate conserved mechanisms that can produce distant but phased features on transcript and proteome isoforms.
AB - Alternative splicing shapes mammalian transcriptomes, with many RNA molecules undergoing multiple distant alternative splicing events. Comprehensive transcriptome analysis, including analysis of exon co-association in the same molecule, requires deep, long-read sequencing. Here we introduce an RNA sequencing method, synthetic long-read RNA sequencing (SLR-RNA-seq), in which small pools (≤ 1,000 molecules/pool, ≤ 1 molecule/gene for most genes) of full-length cDNAs are amplified, fragmented and short-read-sequenced. We demonstrate that these RNA sequences reconstructed from the short reads from each of the pools are mostly close to full length and contain few insertion and deletion errors. We report many previously undescribed isoforms (human brain: ∼ 13,800 affected genes, 14.5% of molecules; mouse brain ∼ 8,600 genes, 18% of molecules) and up to 165 human distant molecularly associated exon pairs (dMAPs) and distant molecularly and mutually exclusive pairs (dMEPs). Of 16 associated pairs detected in the mouse brain, 9 are conserved in human. Our results indicate conserved mechanisms that can produce distant but phased features on transcript and proteome isoforms.
UR - http://www.scopus.com/inward/record.url?scp=84929697731&partnerID=8YFLogxK
U2 - 10.1038/nbt.3242
DO - 10.1038/nbt.3242
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 25985263
AN - SCOPUS:84929697731
SN - 1087-0156
VL - 33
SP - 736
EP - 742
JO - Nature Biotechnology
JF - Nature Biotechnology
IS - 7
ER -