TY - JOUR
T1 - Neuro-symbolic representation learning on biological knowledge graphs
AU - AlShahrani, Mona
AU - Khan, Mohammed Asif
AU - Maddouri, Omar
AU - Kinjo, Akira R
AU - Queralt-Rosinach, Núria
AU - Hoehndorf, Robert
N1 - KAUST Repository Item: Exported on 2020-10-01
Acknowledgements: This work was supported by funding from King Abdullah University of Science and Technology (KAUST).
PY - 2017/4/25
Y1 - 2017/4/25
N2 - Biological data and knowledge bases increasingly rely on Semantic Web technologies and the use of knowledge graphs for data integration, retrieval and federated queries. In the past years, feature learning methods that are applicable to graph-structured data are becoming available, but have not yet widely been applied and evaluated on structured biological knowledge.We develop a novel method for feature learning on biological knowledge graphs. Our method combines symbolic methods, in particular knowledge representation using symbolic logic and automated reasoning, with neural networks to generate embeddings of nodes that encode for related information within knowledge graphs. Through the use of symbolic logic, these embeddings contain both explicit and implicit information. We apply these embeddings to the prediction of edges in the knowledge graph representing problems of function prediction, finding candidate genes of diseases, protein-protein interactions, or drug target relations, and demonstrate performance that matches and sometimes outperforms traditional approaches based on manually crafted features. Our method can be applied to any biological knowledge graph, and will thereby open up the increasing amount of SemanticWeb based knowledge bases in biology to use in machine learning and data analytics.https://github.com/bio-ontology-research-group/[email protected] data are available at Bioinformatics online.
AB - Biological data and knowledge bases increasingly rely on Semantic Web technologies and the use of knowledge graphs for data integration, retrieval and federated queries. In the past years, feature learning methods that are applicable to graph-structured data are becoming available, but have not yet widely been applied and evaluated on structured biological knowledge.We develop a novel method for feature learning on biological knowledge graphs. Our method combines symbolic methods, in particular knowledge representation using symbolic logic and automated reasoning, with neural networks to generate embeddings of nodes that encode for related information within knowledge graphs. Through the use of symbolic logic, these embeddings contain both explicit and implicit information. We apply these embeddings to the prediction of edges in the knowledge graph representing problems of function prediction, finding candidate genes of diseases, protein-protein interactions, or drug target relations, and demonstrate performance that matches and sometimes outperforms traditional approaches based on manually crafted features. Our method can be applied to any biological knowledge graph, and will thereby open up the increasing amount of SemanticWeb based knowledge bases in biology to use in machine learning and data analytics.https://github.com/bio-ontology-research-group/[email protected] data are available at Bioinformatics online.
UR - http://hdl.handle.net/10754/623293
UR - https://academic.oup.com/bioinformatics/article-lookup/doi/10.1093/bioinformatics/btx275
UR - http://www.scopus.com/inward/record.url?scp=85042544877&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btx275
DO - 10.1093/bioinformatics/btx275
M3 - Article
C2 - 28449114
SN - 1367-4803
VL - 33
SP - 2723
EP - 2730
JO - Bioinformatics
JF - Bioinformatics
IS - 17
ER -