TY - JOUR
T1 - Building gene regulatory networks from scATAC-seq and scRNA-seq using Linked Self Organizing Maps
AU - Jansen, Camden
AU - Ramirez, Ricardo N.
AU - El-Ali, Nicole C.
AU - Gomez-Cabrero, David
AU - Tegner, Jesper
AU - Merkenschlager, Matthias
AU - Conesa, Ana
AU - Mortazavi, Ali
N1 - KAUST Repository Item: Exported on 2020-10-01
Acknowledgements: This work was supported by were supported by EU FP7 306000 STATegra.
PY - 2019/11/1
Y1 - 2019/11/1
N2 - Rapid advances in single-cell assays have outpaced methods for analysis of those data types. Different single-cell assays show extensive variation in sensitivity and signal to noise levels. In particular, scATAC-seq generates extremely sparse and noisy datasets. Existing methods developed to analyze this data require cells amenable to pseudo-time analysis or require datasets with drastically different cell-types. We describe a novel approach using self-organizing maps (SOM) to link scATAC-seq regions with scRNA-seq genes that overcomes these challenges and can generate draft regulatory networks. Our SOMatic package generates chromatin and gene expression SOMs separately and combines them using a linking function. We applied SOMatic on a mouse pre-B cell differentiation time-course using controlled Ikaros over-expression to recover gene ontology enrichments, identify motifs in genomic regions showing similar single-cell profiles, and generate a gene regulatory network that both recovers known interactions and predicts new Ikaros targets during the differentiation process. The ability of linked SOMs to detect emergent properties from multiple types of highly-dimensional genomic data with very different signal properties opens new avenues for integrative analysis of heterogeneous data.
AB - Rapid advances in single-cell assays have outpaced methods for analysis of those data types. Different single-cell assays show extensive variation in sensitivity and signal to noise levels. In particular, scATAC-seq generates extremely sparse and noisy datasets. Existing methods developed to analyze this data require cells amenable to pseudo-time analysis or require datasets with drastically different cell-types. We describe a novel approach using self-organizing maps (SOM) to link scATAC-seq regions with scRNA-seq genes that overcomes these challenges and can generate draft regulatory networks. Our SOMatic package generates chromatin and gene expression SOMs separately and combines them using a linking function. We applied SOMatic on a mouse pre-B cell differentiation time-course using controlled Ikaros over-expression to recover gene ontology enrichments, identify motifs in genomic regions showing similar single-cell profiles, and generate a gene regulatory network that both recovers known interactions and predicts new Ikaros targets during the differentiation process. The ability of linked SOMs to detect emergent properties from multiple types of highly-dimensional genomic data with very different signal properties opens new avenues for integrative analysis of heterogeneous data.
UR - http://hdl.handle.net/10754/660332
UR - https://dx.plos.org/10.1371/journal.pcbi.1006555
UR - http://www.scopus.com/inward/record.url?scp=85075091477&partnerID=8YFLogxK
U2 - 10.1371/journal.pcbi.1006555
DO - 10.1371/journal.pcbi.1006555
M3 - Article
C2 - 31682608
SN - 1553-7358
VL - 15
SP - e1006555
JO - PLoS computational biology
JF - PLoS computational biology
IS - 11
ER -