TY - GEN
T1 - Deep Multi-type Objects Multi-view Multi-instance Multi-label Learning
AU - Yang, Yuanlin
AU - Yu, Guoxian
AU - Domeniconi, Carlotta
AU - Zhang, Xiangliang
N1 - KAUST Repository Item: Exported on 2021-05-04
Acknowledgements: Supported by NSFC (61872300, 62031003 and 62072380)
PY - 2021/4/26
Y1 - 2021/4/26
N2 - Multi-view multi-instance multi-label learning (M3L) can model complex objects (bags) that are composed of multiple instances, represented with heterogeneous feature views and annotated with multiple related semantic labels. Although significant progress has been made toward M3L tasks, the current solutions still focus on a single-type of complex objects, and cannot effectively mine the widely-witnessed interconnected objects of multi-types. To bridge this gap, we propose a Deep Multi-type objects Multi-view Multi-instance Multi-label Learning solution (DeepM4L) based on heterogeneous network embedding. DeepM4L first encodes the inter- and intra-relations among multi-type objects using a heterogeneous network, and performs instance neighbor embedding to learn the representation vectors of instances. Next, it obtains the instance-label score tensor for each view and uses a max pooling operation to induce the bag-label score tensor for each bag. After that, it combines bag-label scores by multi-view learning to guarantee the semantic consistency between bags of different views. Our empirical study on benchmark datasets shows that DeepM4L is significantly superior to the recent advanced baselines.
AB - Multi-view multi-instance multi-label learning (M3L) can model complex objects (bags) that are composed of multiple instances, represented with heterogeneous feature views and annotated with multiple related semantic labels. Although significant progress has been made toward M3L tasks, the current solutions still focus on a single-type of complex objects, and cannot effectively mine the widely-witnessed interconnected objects of multi-types. To bridge this gap, we propose a Deep Multi-type objects Multi-view Multi-instance Multi-label Learning solution (DeepM4L) based on heterogeneous network embedding. DeepM4L first encodes the inter- and intra-relations among multi-type objects using a heterogeneous network, and performs instance neighbor embedding to learn the representation vectors of instances. Next, it obtains the instance-label score tensor for each view and uses a max pooling operation to induce the bag-label score tensor for each bag. After that, it combines bag-label scores by multi-view learning to guarantee the semantic consistency between bags of different views. Our empirical study on benchmark datasets shows that DeepM4L is significantly superior to the recent advanced baselines.
UR - http://hdl.handle.net/10754/669050
UR - https://epubs.siam.org/doi/10.1137/1.9781611976700.55
U2 - 10.1137/1.9781611976700.55
DO - 10.1137/1.9781611976700.55
M3 - Conference contribution
SN - 9781611976700
SP - 486
EP - 494
BT - Proceedings of the 2021 SIAM International Conference on Data Mining (SDM)
PB - Society for Industrial and Applied Mathematics
ER -