Learning Hierarchical Document Graphs From Multilevel Sentence Relations

Hao Zhang, Chaojie Wang, Zhengjue Wang, Zhibin Duan, Bo Chen, Mingyuan Zhou, Ricardo Henao, Lawrence Carin

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

Organizing the implicit topology of a document as a graph, and further performing feature extraction via the graph convolutional network (GCN), has proven effective in document analysis. However, existing document graphs are often restricted to expressing single-level relations, which are predefined and independent of downstream learning. A set of learnable hierarchical graphs are built to explore multilevel sentence relations, assisted by a hierarchical probabilistic topic model. Based on these graphs, multiple parallel GCNs are used to extract multilevel semantic features, which are aggregated by an attention mechanism for different document-comprehension tasks. Equipped with variational inference, the graph construction and GCN are learned jointly, allowing the graphs to evolve dynamically to better match the downstream task. The effectiveness and efficiency of the proposed multilevel sentence relation graph convolutional network (MuserGCN) is demonstrated via experiments on document classification, abstractive summarization, and matching.
Original languageEnglish (US)
JournalIEEE Transactions on Neural Networks and Learning Systems
DOIs
StatePublished - Jan 1 2021
Externally publishedYes

Fingerprint

Dive into the research topics of 'Learning Hierarchical Document Graphs From Multilevel Sentence Relations'. Together they form a unique fingerprint.

Cite this