TY - JOUR
T1 - Conditional Disease Development extracted from Longitudinal Health Care Cohort Data using Layered Network Construction
AU - Kannan, Venkateshan
AU - Swartz, Fredrik
AU - Kiani, Narsis A.
AU - Silberberg, Gilad
AU - Tsipras, Giorgos
AU - Gomez-Cabrero, David
AU - Alexanderson, Kristina
AU - Tegnèr, Jesper
N1 - Generated from Scopus record by KAUST IRTS on 2021-02-16
PY - 2016/5/23
Y1 - 2016/5/23
N2 - Health care data holds great promise to be used in clinical decision support systems. However, frequent near-synonymous diagnoses recorded separately, as well as the sheer magnitude and complexity of the disease data makes it challenging to extract non-trivial conclusions beyond confirmatory associations from such a web of interactions. Here we present a systematic methodology to derive statistically valid conditional development of diseases. To this end we utilize a cohort of 5,512,469 individuals followed over 13 years at inpatient care, including data on disability pension and cause of death. By introducing a causal information fraction measure and taking advantage of the composite structure in the ICD codes, we extract an effective directed lower dimensional network representation (100 nodes and 130 edges) of our cohort. Unpacking composite nodes into bipartite graphs retrieves, for example, that individuals with behavioral disorders are more likely to be followed by prescription drug poisoning episodes, whereas women with leiomyoma were more likely to subsequently experience endometriosis. The conditional disease development represent putative causal relations, indicating possible novel clinical relationships and pathophysiological associations that have not been explored yet.
AB - Health care data holds great promise to be used in clinical decision support systems. However, frequent near-synonymous diagnoses recorded separately, as well as the sheer magnitude and complexity of the disease data makes it challenging to extract non-trivial conclusions beyond confirmatory associations from such a web of interactions. Here we present a systematic methodology to derive statistically valid conditional development of diseases. To this end we utilize a cohort of 5,512,469 individuals followed over 13 years at inpatient care, including data on disability pension and cause of death. By introducing a causal information fraction measure and taking advantage of the composite structure in the ICD codes, we extract an effective directed lower dimensional network representation (100 nodes and 130 edges) of our cohort. Unpacking composite nodes into bipartite graphs retrieves, for example, that individuals with behavioral disorders are more likely to be followed by prescription drug poisoning episodes, whereas women with leiomyoma were more likely to subsequently experience endometriosis. The conditional disease development represent putative causal relations, indicating possible novel clinical relationships and pathophysiological associations that have not been explored yet.
UR - http://www.nature.com/articles/srep26170
UR - http://www.scopus.com/inward/record.url?scp=84971268466&partnerID=8YFLogxK
U2 - 10.1038/srep26170
DO - 10.1038/srep26170
M3 - Article
C2 - 27211115
SN - 2045-2322
VL - 6
JO - Scientific Reports
JF - Scientific Reports
ER -