Abstract
Visible-infrared person re-identification (VI Re-ID) is challenging work due to huge inter-modality discrepancies and high similarity among inter-identity infrared images. Current methods aim to alleviate the modality discrepancies by using attention mechanisms and identity learning. However, most of these methods are too complex or fine-grained, which can instead destroy the integrity of subtle information and unavoidably diminishes the distinctiveness of features. Different from existing methods, we propose a novel Dual-Attentive Cascade Clustering Learning Network (DA\(C^2\)LNet) to alleviate inter-modality differences and reduce inter-identity similarities. DA\(C^2\)LNet focuses on learning key and useful information by discovering subtle information distributed in each part of the person’s body, which includes the channel attention module (CAM) and part-based attention module (PbAM). Specifically, we first apply CAM to alleviate modality discrepancies and enhance feature discrimination. Then, we design a PbAM, which is different from spatial attention in pixels, it generates several part pattern maps corresponding to different parts of the person’s body to mine overall nuances for minimizing inter-identity similarities. The two modules are cascaded together to learn distinguishing features. Finally, we introduce a center cluster learning manner to reduce intra-identity inter-modality discrepancies and increase inter-identity variances. Extensive experimental results on two public datasets (SYSU-MM01 and RegDB) demonstrate that DA\(C^2\)LNet outperforms state-of-the-art methods.
Similar content being viewed by others
References
Aggarwal AK, Jaidka P (2022) Segmentation of crop images for crop yield prediction. International Journal of Biology and Biomedicine, 7
Chen B, Deng W, Hu J (2019) Mixed high-order attention network for person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 371–381
Chen Y, Wan L, Li Z et al (2021) Neural Feature Search for RGB-Infrared Person Re-Identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 587–597
Choi S, Lee S, Kim Y et al (2020) Hi-cmd: Hierarchical cross-modality disentanglement for visible-infrared person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 10257–10266
Dai P, Ji R, Wang H et al (2018) Cross-modality person re-identification with generative adversarial training. In: IJCAI, pp 1: 2
Farooq A, Awais M, Kittler J et al (2021) AXM-Net: Cross-Modal Context Sharing Attention Network for Person Re-ID. arXiv preprint arXiv:2101.08238
Fu C, Hu Y, Wu X et al (2021) CM-NAS: Cross-modality neural architecture search for visible-infrared person re-identification. arXiv preprint arXiv:2101.08467
Gao G, Shao H, Yu Y et al (2021) Leaning compact and representative features for cross-modality person re-identification. arXiv preprint arXiv:2103.14210
Gu X, Chang H, Ma B et al (2020) Appearance-preserving 3d convolution for video-based person re-identification. European conference on computer vision. Springer, Cham, pp 228–243
Hao Y, Wang N, Gao X et al (2019) Dual-alignment feature embedding for cross-modality person re-identification. In: Proceedings of the 27th ACM international conference on multimedia, pp 57–65
Hao Y, Wang N, Li J, Gao X (2019) HSME: hypersphere manifold embedding for visible thermal person re- identification. In: AAAI, pp 8385–8392
Hao Y, Wang N, Li J, Gao X (2019) Hsme: Hypersphere manifold embedding for visible thermal person re-identification. In: AAAI, pp 8385–8392
Hermans A, Beyer L, Leibe B (2017) In defense of the triplet loss for person re-identification. arXiv preprint arXiv:1703.07737
Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7132–7141
Liu H, Cheng J, Wang W et al (2020) Enhancing the discriminative feature learning for visible-thermal cross-modality person re-identification. Neurocomputing 398:11–19
Liu H, Shi W, Huang W et al (2018) A discriminatively learned feature embedding based on multi-loss fusion for person search. In: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE pp 1668–1672
Liu H, Tan X, Zhou X (2020) Parameter sharing exploration and hetero-center triplet loss for visible-thermal person re-identification. IEEE Trans Multimedia
Liu H, Tan X, Zhou X (2020) Parameter sharing exploration and hetero-center triplet loss for visible-thermal person re-identification. IEEE Trans on Multimedia
Li D, Wei X, Hong X, et al (2020) Infrared-visible cross-modal person re-identification with an x modality. In: Proceedings of the AAAI conference on artificial intelligence. 34(04), pp 4610–4617
Lu Y, Wu Y, Liu B et al (2020) Cross-modality person re-identification with shared-specific feature transfer. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 13379–13389
Miao J, Wu Y, Liu P et al (2019) Pose-guided feature alignment for occluded person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 542-551
Nguyen DT, Hong HG, Kim KW et al (2017) Person recognition system based on a combination of body images from visible light and thermal cameras. Sensors 17(3):605
Song C, Huang Y, Ouyang W, Wang L (2018) Mask-guided contrastive attention model for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1179–1188
Su C, Zhang S, Xing J, Gao W, Tian Q (2016) Deep attributes driven multi-camera person re-identification. In: ECCV, pp 475–491
Tay CP, Roy S, Yap KH (2019) Aanet: Attribute attention network for person re-identifications. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 7134–7143
Wan C, Wu Y, Tian X et al (2019) Concentrated local Part Discovery with fine-grained Part Representation for person Re-identification. IEEE Trans Multimedia 22(6):1605–1618
Wang G A, Zhang T, Yang Y, et al (2020) Cross-modality paired-images generation for RGB-infrared person re-identification. In: Proceedings of the AAAI conference on artificial intelligence, 34(07), pp 12144–12151
Wang X, Cordova RS (2022) Global and part feature fusion for cross-modality person re-identification. IEEE Access 10:122038–122046
Wang X, Chen C, Zhu Y, Chen S (2022) Feature fusion and center aggregation for visible-infrared person re-identification. IEEE Access 10:30949–30958
Wang X, Cordova RS (2022) Heterogenous center alignment of dual-path features for text-image person re-identification. In: 2022 International conference on artificial intelligence, information processing and cloud computing (AIIPCC), IEEE. pp 145–148
Wang X, Girshick R, Gupta A et al (2018) Non-local neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7794–7803
Wang Z, Wang Z, Zheng Y et al (2019) Learning to reduce dual-level discrepancy for infrared-visible person re-identification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 618–626
Wang G, Yuan Y, Chen X et al (2018) Learning discriminative features with multiple granularities for person re-identification. In: Proceedings of the 26th ACM international conference on multimedia, pp 274–282
Wang G, Zhang T, Cheng J et al (2019) Rgb-infrared cross-modality person re-identification via joint pixel and feature alignment. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 3623–3632
Wang G, Zhang T, Cheng J et al (2019) Rgb-infrared cross-modality person re-identification via joint pixel and feature alignment. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 3623–3632
Wang J, Zhu X, Gong S et al (2018) Transferable joint attribute-identity deep learning for unsupervised person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2275–2284
Wan L, Sun Z, Jing Q et al (2021) G \(^ 2\) DA: Geometry-Guided Dual-Alignment Learning for RGB-Infrared Person Re-Identification. arXiv preprint arXiv:2106.07853
Wei X, Li D, Hong X et al (2020) Co-attentive lifting for infrared-visible person re-identification. In: Proceedings of the 28th ACM international conference on multimedia, pp 1028–1037
Wojke N, Bewley A (2018) Deep cosine metric learning for person re-identification. In: 2018 IEEE winter conference on applications of computer vision (WACV). IEEE, pp 748–756
Woo S, Park J, Lee JY et al (2018) Cbam: Convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp 3–19
Wu A, Zheng WS, Gong S et al (2020) RGB-IR person re-identification by cross-modality similarity preservation. Int J Comput Vis 128(6):1765–1785
Wu Q, Dai P, Chen J et al (2021) Discover cross-modality nuances for visible-infrared person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 4330–4339
Wu A, Zheng WS, Lai JH (2019) Unsupervised person re-identification by camera-aware similarity consistency learning. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 6922–6931
Wu A, Zheng W-S, Yu H-X, Gong S, Lai J (2017) Rgb-infrared cross-modality person re-identification. In ICCV, pp 5390–5399
Yao H, Zhang S, Hong R et al (2019) Deep representation learning with part loss for person re-identification. IEEE Trans Image Process 28(6):2860–2871
Ye M, Lan X, Wang Z et al (2019) Bi-directional center-constrained top-ranking for visible thermal person re-identification. IEEE Trans Inf Forensics Secur 15:407–419
Ye M, Shen J, Shao L (2020) Visible-infrared person re-identification via homogeneous augmented tri-modal learning. IEEE Trans Inf Forensics Secur 16:728–739
Ye M, Lan X, Li J et al (2018) Hierarchical discriminative learning for visible thermal person re-identification. In: Proceedings of the AAAI conference on artificial intelligence 32(1)
Ye M, Lan X, Li J et al (2018) Hierarchical discriminative learning for visible thermal person re-identification. In: Proceedings of the AAAI conference on artificial intelligence, 32(1)
Ye M, Shen J, Crandall D J et al (2020) Dynamic dual-attentive aggregation learning for visible-infrared person re-identification. In: Computer Vision-ECCV 2020: 16th European Conference, Glasgow, UK, August 23-28, 2020, Proceedings, Part XVII 16. Springer International Publishing, pp 229–247
Ye M, Shen J, Crandall D J et al (2020) Dynamic dual-attentive aggregation learning for visible-infrared person re-identification. In: Computer Vision-ECCV 2020: 16th European Conference, Glasgow, UK, August 23-28, 2020, Proceedings, Part XVII 16. Springer International Publishing, pp 229–247
Ye M, Shen J, Lin G et al (2021) Deep learning for person re-identification: A survey and outlook. IEEE Trans Pattern Anal Mach Intell
Ye M, Wang Z, Lan X et al (2018) Visible thermal person re-identification via dual-constrained top-ranking. In: IJCAI, pp 1: 2
Yin J, Wu A, Zheng WS (2020) Fine-grained person re-identification. Int J Comput Vis 128(6):1654–1672
Yin J, Ma Z, Xie J et al (2021) D\(F^2\)AM: Dual-level feature fusion and affinity modeling for rgb-infrared cross-modality person re-identification. arXiv preprint arXiv:2104.00226
Yu HX, Zheng WS (2020) Weakly supervised discriminative feature learning with state information for person identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 5528–5538
Zhang S, Chen C, Song W, Gan Z (2020) Deep feature learning with attributes for cross-modality person re-identification. J Electron Imaging, vol. 29, no. 3
Zhang Q, Lai C, Liu J, Huang N, Han J (2022) Fmcnet: Feature-level modality compensation for visible-infrared person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition pp 7349–7358
Zhang Z, Lan C, Zeng W et al (2020) Relation-aware global attention for person re-identification. In: Proceedings of the ieee/cvf conference on computer vision and pattern recognition, pp 3186–3195
Zhao C, Lv X, Zhang Z et al (2020) Deep fusion feature representation learning with hard mining center-triplet loss for person re-identification. IEEE Trans Multimedia 22(12):3180–3195
Zhao C, Wang X, Zuo W et al (2020) Similarity learning with joint transfer constraints for person re-identification. Pattern Recog 97:107014
Zhao Y, Shen X, Jin Z et al (2019) Attribute-driven feature disentangling and temporal aggregation for video person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 4913–4922
Zheng L, Zhang H, Sun S et al (2017) Person re-identification in the wild. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1367–1376
Zhou Q, Zhong B, Lan X et al (2020) Fine-grained spatial alignment model for person re-identification with focal triplet loss. IEEE Tran Image Process 29:7578–7589
Zhu Y, Yang Z, Wang L et al (2020) Hetero-center loss for cross-modality person re-identification. Neurocomputing 386:97–109
Acknowledgements
This work was supported by Anhui Science and Technology Department Project (Grant No. 202004a05020030),Anhui Photovoltaic Industry Generic Technology Research Center (Granted No. 2022AHPV000001) ,and Natural Science Foundation Project of Anhui Province(Granted No. 2022AH040200).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
The authors have no competing interests to declare that are relevant to the content of this article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Chen Cuiqun, Zhu Yong, and Chen Shuguang contributed equally to this work.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wang, X., Chen, C., Zhu, Y. et al. Dual-attentive cascade clustering learning for visible-infrared person re-identification. Multimed Tools Appl 83, 19729–19746 (2024). https://doi.org/10.1007/s11042-023-16260-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-16260-6