TY - GEN
T1 - T-PAIR: Temporal Node pair Embedding for Automatic Biomedical Hypothesis Generation (Extended abstract)
AU - Akujuobi, Uchenna
AU - Spranger, Michael
AU - Palaniappan, Sucheendra K.
AU - Zhang, Xiangliang
N1 - Generated from Scopus record by KAUST IRTS on 2023-09-20
PY - 2023/1/1
Y1 - 2023/1/1
N2 - In this paper, we study an automatic hypothesis generation (HG) problem, which refers to the discovery of meaningful implicit connections between scientific terms, including but not limited to diseases, chemicals, drugs, and genes extracted from databases of biomedical publications. Most prior studies of this problem focused on using static information of terms and largely ignored the temporal dynamics of scientific term relations. Even when the dynamics were considered in a few recent studies, they learned the representations for the scientific terms rather than focusing on the term-pair relations. Since the HG problem is to predict term-pair connections, it is not enough to know with whom the terms are connected; it is more important to know how the connections have been formed (in a dynamic process). We formulate this HG problem as a future connectivity prediction in a dynamic attributed graph and propose an inductive edge (node pair) embedding method named T-PAIR, utilizing both the graphical structure and node attribute to encode the temporal node pair relationship. We demonstrate the efficiency of the proposed model on real-world biomedical datasets in predicting future term-pair relations between millions of seen terms (in the transductive setting), as well as on the relations involving unseen terms (in the inductive setting).
AB - In this paper, we study an automatic hypothesis generation (HG) problem, which refers to the discovery of meaningful implicit connections between scientific terms, including but not limited to diseases, chemicals, drugs, and genes extracted from databases of biomedical publications. Most prior studies of this problem focused on using static information of terms and largely ignored the temporal dynamics of scientific term relations. Even when the dynamics were considered in a few recent studies, they learned the representations for the scientific terms rather than focusing on the term-pair relations. Since the HG problem is to predict term-pair connections, it is not enough to know with whom the terms are connected; it is more important to know how the connections have been formed (in a dynamic process). We formulate this HG problem as a future connectivity prediction in a dynamic attributed graph and propose an inductive edge (node pair) embedding method named T-PAIR, utilizing both the graphical structure and node attribute to encode the temporal node pair relationship. We demonstrate the efficiency of the proposed model on real-world biomedical datasets in predicting future term-pair relations between millions of seen terms (in the transductive setting), as well as on the relations involving unseen terms (in the inductive setting).
UR - https://ieeexplore.ieee.org/document/10184755/
UR - http://www.scopus.com/inward/record.url?scp=85167714951&partnerID=8YFLogxK
U2 - 10.1109/ICDE55515.2023.00348
DO - 10.1109/ICDE55515.2023.00348
M3 - Conference contribution
SN - 9798350322279
SP - 3843
EP - 3844
BT - Proceedings - International Conference on Data Engineering
PB - IEEE Computer Society
ER -