TY - JOUR
T1 - A high quality Arabidopsis transcriptome for accurate transcript-level analysis of alternative splicing
AU - Zhang, Runxuan
AU - Calixto, Cristiane P. G.
AU - Marquez, Yamile
AU - Venhuizen, Peter
AU - Tzioutziou, Nikoleta A.
AU - Guo, Wenbin
AU - Spensley, Mark
AU - Entizne, Juan Carlos
AU - Lewandowska, Dominika
AU - ten Have, Sara
AU - Frei dit Frey, Nicolas
AU - Hirt, Heribert
AU - James, Allan B.
AU - Nimmo, Hugh G.
AU - Barta, Andrea
AU - Kalyna, Maria
AU - Brown, John W. S.
N1 - KAUST Repository Item: Exported on 2020-10-01
Acknowledgements: Biotechnology and Biological Sciences Research Council (BBSRC) [BB/K013661/1 and BB/K006568/1 to J.B.; BB/K006835/1 to H.N.]; Scottish Government Rural and Environment Science and Analytical Services division (RESAS) [to J.B. and R.Z.]; Austrian Science Fund (FWF) [P26333 to M.K., DK W1207 to A.B.]; LABEX Saclay Plant Sciences [to H.H.]; BBSRC EASTBIO PhD studentships (to N.T. and J.C.); European Alternative Splicing Network of Excellence (EURASNET) [LSHG-CT-2005-518238] for catalyzing important collaborations. Funding for open access charge: University of Dundee.
PY - 2017/4/11
Y1 - 2017/4/11
N2 - Alternative splicing generates multiple transcript and protein isoforms from the same gene and thus is important in gene expression regulation. To date, RNA-sequencing (RNA-seq) is the standard method for quantifying changes in alternative splicing on a genome-wide scale. Understanding the current limitations of RNA-seq is crucial for reliable analysis and the lack of high quality, comprehensive transcriptomes for most species, including model organisms such as Arabidopsis, is a major constraint in accurate quantification of transcript isoforms. To address this, we designed a novel pipeline with stringent filters and assembled a comprehensive Reference Transcript Dataset for Arabidopsis (AtRTD2) containing 82,190 non-redundant transcripts from 34 212 genes. Extensive experimental validation showed that AtRTD2 and its modified version, AtRTD2-QUASI, for use in Quantification of Alternatively Spliced Isoforms, outperform other available transcriptomes in RNA-seq analysis. This strategy can be implemented in other species to build a pipeline for transcript-level expression and alternative splicing analyses.
AB - Alternative splicing generates multiple transcript and protein isoforms from the same gene and thus is important in gene expression regulation. To date, RNA-sequencing (RNA-seq) is the standard method for quantifying changes in alternative splicing on a genome-wide scale. Understanding the current limitations of RNA-seq is crucial for reliable analysis and the lack of high quality, comprehensive transcriptomes for most species, including model organisms such as Arabidopsis, is a major constraint in accurate quantification of transcript isoforms. To address this, we designed a novel pipeline with stringent filters and assembled a comprehensive Reference Transcript Dataset for Arabidopsis (AtRTD2) containing 82,190 non-redundant transcripts from 34 212 genes. Extensive experimental validation showed that AtRTD2 and its modified version, AtRTD2-QUASI, for use in Quantification of Alternatively Spliced Isoforms, outperform other available transcriptomes in RNA-seq analysis. This strategy can be implemented in other species to build a pipeline for transcript-level expression and alternative splicing analyses.
UR - http://hdl.handle.net/10754/623319
UR - https://academic.oup.com/nar/article-lookup/doi/10.1093/nar/gkx267
UR - http://www.scopus.com/inward/record.url?scp=85026999986&partnerID=8YFLogxK
U2 - 10.1093/nar/gkx267
DO - 10.1093/nar/gkx267
M3 - Article
SN - 0305-1048
VL - 45
SP - 5061
EP - 5073
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - 9
ER -