TY - GEN
T1 - Attention-Based Multimodal Entity Linking with High-Quality Images
AU - Zhang, Li
AU - Li, Zhixu
AU - Yang, Qiang
N1 - KAUST Repository Item: Exported on 2021-05-06
Acknowledgements: This research is supported by National Key R&D Program of China (No. 2018-AAA0101900), the Priority Academic Program Development of Jiangsu Higher Education Institutions, National Natural Science Foundation of China (Grant No. 62072323, 61632016), Natural Science Foundation of Jiangsu Province (No. BK20191420), and the Suda-Toycloud Data Intelligence Joint Laboratory.
PY - 2021/4/6
Y1 - 2021/4/6
N2 - Multimodal entity linking (MEL) is an emerging research field which uses both textual and visual information to map an ambiguous mention to an entity in a knowledge base (KB). However, images do not always help, which may also backfire if they are irrelevant to the textual content at all. Besides, the existing efforts mainly focus on learning a representation of both mentions and entities from their textual and visual contexts, without considering the negative impact brought by noisy irrelevant images, which happens frequently with social media posts. In this paper, we propose a novel MEL model, which not only removes the negative impact of noisy images, but also uses multiple attention mechanism to better capture the connection between mention representation and its corresponding entity representation. Our empirical study on a large real data collection demonstrates the effectiveness of our approach.
AB - Multimodal entity linking (MEL) is an emerging research field which uses both textual and visual information to map an ambiguous mention to an entity in a knowledge base (KB). However, images do not always help, which may also backfire if they are irrelevant to the textual content at all. Besides, the existing efforts mainly focus on learning a representation of both mentions and entities from their textual and visual contexts, without considering the negative impact brought by noisy irrelevant images, which happens frequently with social media posts. In this paper, we propose a novel MEL model, which not only removes the negative impact of noisy images, but also uses multiple attention mechanism to better capture the connection between mention representation and its corresponding entity representation. Our empirical study on a large real data collection demonstrates the effectiveness of our approach.
UR - http://hdl.handle.net/10754/669086
UR - http://link.springer.com/10.1007/978-3-030-73197-7_35
UR - http://www.scopus.com/inward/record.url?scp=85104756682&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-73197-7_35
DO - 10.1007/978-3-030-73197-7_35
M3 - Conference contribution
SN - 9783030731960
SP - 533
EP - 548
BT - Database Systems for Advanced Applications
PB - Springer International Publishing
ER -