TY - JOUR
T1 - An atlas of human long non-coding RNAs with accurate 5′ ends
AU - Hon, Chung-Chau
AU - Ramilowski, Jordan A.
AU - Harshbarger, Jayson
AU - Bertin, Nicolas
AU - Rackham, Owen J. L.
AU - Gough, Julian
AU - Denisenko, Elena
AU - Schmeier, Sebastian
AU - Poulsen, Thomas M.
AU - Severin, Jessica
AU - Lizio, Marina
AU - Kawaji, Hideya
AU - Kasukawa, Takeya
AU - Itoh, Masayoshi
AU - Burroughs, A. Maxwell
AU - Noma, Shohei
AU - Djebali, Sarah
AU - Alam, Tanvir
AU - Medvedeva, Yulia A.
AU - Testa, Alison C.
AU - Lipovich, Leonard
AU - Yip, Chi-Wai
AU - Abugessaisa, Imad
AU - Mendez, Mickaël
AU - Hasegawa, Akira
AU - Tang, Dave
AU - Lassmann, Timo
AU - Heutink, Peter
AU - Babina, Magda
AU - Wells, Christine A.
AU - Kojima, Soichi
AU - Nakamura, Yukio
AU - Suzuki, Harukazu
AU - Daub, Carsten O.
AU - Hoon, Michiel J. L. de
AU - Arner, Erik
AU - Hayashizaki, Yoshihide
AU - Carninci, Piero
AU - Forrest, Alistair R. R.
N1 - KAUST Repository Item: Exported on 2020-10-01
Acknowledgements: FANTOM5 was made possible by research grants for the RIKEN Omics Science Center and the Innovative Cell Biology by Innovative Technology (Cell Innovation Program) from the MEXT to Y.H. It was also supported by research grants for the RIKEN Preventive Medicine and Diagnosis Innovation Program (RIKEN PMI) to Y.H. and the RIKEN Centre for Life Science Technologies, Division of Genomic Technologies (RIKEN CLST (DGT)) from the MEXT, Japan. A.R.R.F. is supported by a Senior Cancer Research Fellowship from the Cancer Research Trust, the MACA Ride to Conquer Cancer and the Australian Research Council’s Discovery Projects funding scheme (DP160101960). S.D. is supported by award number U54HG007004 from the National Human Genome Research Institute of the National Institutes of Health, funding from the Ministry of Economy and Competitiveness (MINECO) under grant number BIO2011-26205, and SEV-2012-0208 from the Spanish Ministry of Economy and Competitiveness. Y.A.M. is supported by the Russian Science Foundation, grant 15-14-30002. We thank RIKEN GeNAS for generation of the CAGE and RNA-seq libraries, the Netherlands Brain Bank for brain materials, the RIKEN BioResource Centre for providing cell lines and all members of the FANTOM5 consortium for discussions, in particular H. Ashoor, M. Frith, R. Guigo, A. Tanzer, E. Wood, H. Jia, K. Bailie, J. Harrow, E. Valen, R. Andersson, K. Vitting-Seerup, A. Sandelin, M. Taylor, J. Shin, R. Mori, C. Mungall and T. Meehan.
PY - 2017/3/1
Y1 - 2017/3/1
N2 - Long non-coding RNAs (lncRNAs) are largely heterogeneous and functionally uncharacterized. Here, using FANTOM5 cap analysis of gene expression (CAGE) data, we integrate multiple transcript collections to generate a comprehensive atlas of 27,919 human lncRNA genes with high-confidence 5′ ends and expression profiles across 1,829 samples from the major human primary cell types and tissues. Genomic and epigenomic classification of these lncRNAs reveals that most intergenic lncRNAs originate from enhancers rather than from promoters. Incorporating genetic and expression data, we show that lncRNAs overlapping trait-associated single nucleotide polymorphisms are specifically expressed in cell types relevant to the traits, implicating these lncRNAs in multiple diseases. We further demonstrate that lncRNAs overlapping expression quantitative trait loci (eQTL)-associated single nucleotide polymorphisms of messenger RNAs are co-expressed with the corresponding messenger RNAs, suggesting their potential roles in transcriptional regulation. Combining these findings with conservation data, we identify 19,175 potentially functional lncRNAs in the human genome.
AB - Long non-coding RNAs (lncRNAs) are largely heterogeneous and functionally uncharacterized. Here, using FANTOM5 cap analysis of gene expression (CAGE) data, we integrate multiple transcript collections to generate a comprehensive atlas of 27,919 human lncRNA genes with high-confidence 5′ ends and expression profiles across 1,829 samples from the major human primary cell types and tissues. Genomic and epigenomic classification of these lncRNAs reveals that most intergenic lncRNAs originate from enhancers rather than from promoters. Incorporating genetic and expression data, we show that lncRNAs overlapping trait-associated single nucleotide polymorphisms are specifically expressed in cell types relevant to the traits, implicating these lncRNAs in multiple diseases. We further demonstrate that lncRNAs overlapping expression quantitative trait loci (eQTL)-associated single nucleotide polymorphisms of messenger RNAs are co-expressed with the corresponding messenger RNAs, suggesting their potential roles in transcriptional regulation. Combining these findings with conservation data, we identify 19,175 potentially functional lncRNAs in the human genome.
UR - http://hdl.handle.net/10754/623789
UR - http://www.nature.com/nature/journal/v543/n7644/full/nature21374.html
UR - http://www.scopus.com/inward/record.url?scp=85015225394&partnerID=8YFLogxK
U2 - 10.1038/nature21374
DO - 10.1038/nature21374
M3 - Article
C2 - 28241135
SN - 0028-0836
VL - 543
SP - 199
EP - 204
JO - Nature
JF - Nature
IS - 7644
ER -