TY - JOUR
T1 - DeepIDA: predicting isoform-disease associations by data fusion and deep neural networks
AU - Yu, Guoxian
AU - Yang, Yeqian
AU - Yan, Yangyang
AU - Guo, Maozu
AU - Zhang, Xiangliang
AU - Wang, Jun
N1 - KAUST Repository Item: Exported on 2021-02-15
Acknowledgements: This work is supported by Natural Science Foundation of China (61872300, 62031003 and 62072380), and Qilu Scholarship of Shandong University. G. Yu and Y. Yang are with the School of Software and the Joint SDU-NTU Centre for Artificial Intelligence Research, Shandong University, Jinan 250101, China.
PY - 2021
Y1 - 2021
N2 - Alternative splicing produces different isoforms from the same gene locus. Although the prediction of gene(miRNA)-disease associations have been extensively studied, few (or no) computational solutions have been proposed for the prediction of isoform-disease association (IDA) at a large scale, mainly due to the lack of disease annotations of isoforms. However, increasing evidences confirm the close connections between diseases and isoforms, which can more precisely uncover the pathology of complex diseases. Therefore, it is highly desirable to predict IDAs. To bridge this gap, we propose a deep neural network based solution (DeepIDA) to fuse multi-type genomics and transcriptomics data to predict IDAs. Particularly, DeepIDA uses gene-isoform relations to dispatch gene-disease associations to isoforms. In addition, it utilizes two DNN sub-networks with different structures to capture nucleotide and expression features of isoforms, Gene Ontology data and miRNA target data, respectively. After that, these two sub-networks are merged in a dense layer to predict IDAs. The experimental results on public datasets show that DeepIDA can effectively predict IDAs with AUPRC of 0.9141 and macro F-measure of 0.9155, which are much higher than those of competitive methods. Further study on sixteen isoform-disease association cases again corroborate the superiority of DeepIDA.
AB - Alternative splicing produces different isoforms from the same gene locus. Although the prediction of gene(miRNA)-disease associations have been extensively studied, few (or no) computational solutions have been proposed for the prediction of isoform-disease association (IDA) at a large scale, mainly due to the lack of disease annotations of isoforms. However, increasing evidences confirm the close connections between diseases and isoforms, which can more precisely uncover the pathology of complex diseases. Therefore, it is highly desirable to predict IDAs. To bridge this gap, we propose a deep neural network based solution (DeepIDA) to fuse multi-type genomics and transcriptomics data to predict IDAs. Particularly, DeepIDA uses gene-isoform relations to dispatch gene-disease associations to isoforms. In addition, it utilizes two DNN sub-networks with different structures to capture nucleotide and expression features of isoforms, Gene Ontology data and miRNA target data, respectively. After that, these two sub-networks are merged in a dense layer to predict IDAs. The experimental results on public datasets show that DeepIDA can effectively predict IDAs with AUPRC of 0.9141 and macro F-measure of 0.9155, which are much higher than those of competitive methods. Further study on sixteen isoform-disease association cases again corroborate the superiority of DeepIDA.
UR - http://hdl.handle.net/10754/667398
UR - https://ieeexplore.ieee.org/document/9353272/
U2 - 10.1109/TCBB.2021.3058801
DO - 10.1109/TCBB.2021.3058801
M3 - Article
C2 - 33571094
SN - 2374-0043
SP - 1
EP - 1
JO - IEEE/ACM Transactions on Computational Biology and Bioinformatics
JF - IEEE/ACM Transactions on Computational Biology and Bioinformatics
ER -