TY - GEN
T1 - A Distribution Preserving Model for Molecular Graph Generation
AU - Ma, Changsheng
AU - Yang, Qiang
AU - Liang, Shangsong
AU - Gao, Xin
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Generating molecular graphs using deep graph generative models is a challenging task that involves optimizing a given target within an enormous search space while adhering to chemical valence rules. Despite promising results, existing models mainly focus on learning molecular graph structures at the individual level while ignoring inter-molecular relationships regarding molecular characterization features and molecular activity. This can lead to the generation of molecules that are unresponsive to their true neighbors possessing similar characterization features, resulting in a divergence between the learned generation distribution and the actual molecular distribution. In this paper, we propose a distribution preserving model, designed to maintain the inter-molecular relationships of the original distribution within the generated space. Specifically, the model operates on a student-teacher paradigm, where the teacher module learns the inter-molecular relationship dynamics of the original distribution, and imparts this knowledge to the student module, which is responsible for generating molecules. Experimental results on two benchmark datasets demonstrate that our model outperforms the state-of-the-art models in generating valid, novel and unique molecules. Moreover, our model is verified on preserving molecule distribution in the generation space.
AB - Generating molecular graphs using deep graph generative models is a challenging task that involves optimizing a given target within an enormous search space while adhering to chemical valence rules. Despite promising results, existing models mainly focus on learning molecular graph structures at the individual level while ignoring inter-molecular relationships regarding molecular characterization features and molecular activity. This can lead to the generation of molecules that are unresponsive to their true neighbors possessing similar characterization features, resulting in a divergence between the learned generation distribution and the actual molecular distribution. In this paper, we propose a distribution preserving model, designed to maintain the inter-molecular relationships of the original distribution within the generated space. Specifically, the model operates on a student-teacher paradigm, where the teacher module learns the inter-molecular relationship dynamics of the original distribution, and imparts this knowledge to the student module, which is responsible for generating molecules. Experimental results on two benchmark datasets demonstrate that our model outperforms the state-of-the-art models in generating valid, novel and unique molecules. Moreover, our model is verified on preserving molecule distribution in the generation space.
KW - contrastive learning
KW - molecular graph generation
KW - student-teacher framework
UR - http://www.scopus.com/inward/record.url?scp=85184864007&partnerID=8YFLogxK
U2 - 10.1109/BIBM58861.2023.10385921
DO - 10.1109/BIBM58861.2023.10385921
M3 - Conference contribution
AN - SCOPUS:85184864007
T3 - Proceedings - 2023 2023 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2023
SP - 379
EP - 386
BT - Proceedings - 2023 2023 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2023
A2 - Jiang, Xingpeng
A2 - Wang, Haiying
A2 - Alhajj, Reda
A2 - Hu, Xiaohua
A2 - Engel, Felix
A2 - Mahmud, Mufti
A2 - Pisanti, Nadia
A2 - Cui, Xuefeng
A2 - Song, Hong
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2023 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2023
Y2 - 5 December 2023 through 8 December 2023
ER -