TY - JOUR
T1 - Spectrum-irrelevant fine-grained representation for visible–infrared person re-identification
AU - Gong, Jiahao
AU - Zhao, Sanyuan
AU - Lam, Kin Man
AU - Gao, Xin
AU - Shen, Jianbing
N1 - KAUST Repository Item: Exported on 2023-05-02
Acknowledgements: This work was supported by the National Natural Science Foundation of China under Grant 61902027.
PY - 2023/4/26
Y1 - 2023/4/26
N2 - Visible–infrared person re-identification (VI-ReID) is an important and practical task for full-time intelligent surveillance systems. Compared to visible person re-identification, it is more challenging due to the large cross-modal discrepancy. Existing VI-ReID methods suffer from heterogeneous structures and the different spectra of visible and infrared images. In this work, we propose the Spectrum-Insensitive Data Augmentation (SIDA) strategy, which effectively alleviates the disturbance in the visible and infrared spectra and forces the network to learn spectrum-irrelevant features. The network also compares samples with both global and local features. We devise a Feature Relation Reasoning (FRR) module to learn discriminative fine-grained representations according to the graph reasoning principle. Compared to the most commonly used uniform partition, our FRR better adopts to the case of VI-ReID, in which human bodies are difficult to align. Furthermore, we design the dual center loss for learning the global feature in order to maintain the intra-modality relations, while learning the cross-modal similarities. Our method achieves better convergence in training. Extensive experiments demonstrate that our method achieves state-of-the-art performance on two visible–infrared cross-modal Re-ID datasets.
AB - Visible–infrared person re-identification (VI-ReID) is an important and practical task for full-time intelligent surveillance systems. Compared to visible person re-identification, it is more challenging due to the large cross-modal discrepancy. Existing VI-ReID methods suffer from heterogeneous structures and the different spectra of visible and infrared images. In this work, we propose the Spectrum-Insensitive Data Augmentation (SIDA) strategy, which effectively alleviates the disturbance in the visible and infrared spectra and forces the network to learn spectrum-irrelevant features. The network also compares samples with both global and local features. We devise a Feature Relation Reasoning (FRR) module to learn discriminative fine-grained representations according to the graph reasoning principle. Compared to the most commonly used uniform partition, our FRR better adopts to the case of VI-ReID, in which human bodies are difficult to align. Furthermore, we design the dual center loss for learning the global feature in order to maintain the intra-modality relations, while learning the cross-modal similarities. Our method achieves better convergence in training. Extensive experiments demonstrate that our method achieves state-of-the-art performance on two visible–infrared cross-modal Re-ID datasets.
UR - http://hdl.handle.net/10754/691354
UR - https://linkinghub.elsevier.com/retrieve/pii/S1077314223000838
UR - http://www.scopus.com/inward/record.url?scp=85153504159&partnerID=8YFLogxK
U2 - 10.1016/j.cviu.2023.103703
DO - 10.1016/j.cviu.2023.103703
M3 - Article
SN - 1090-235X
VL - 232
SP - 103703
JO - Computer Vision and Image Understanding
JF - Computer Vision and Image Understanding
ER -