Dual-attentive cascade clustering learning for visible-infrared person re-identification

Wang, Xianju; Chen, Cuiqun; Zhu, Yong; Chen, Shuguang

doi:10.1007/s11042-023-16260-6

Dual-attentive cascade clustering learning for visible-infrared person re-identification

Published: 28 July 2023

Volume 83, pages 19729–19746, (2024)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Xianju Wang ORCID: orcid.org/0000-0001-9303-8574^1,2,
Cuiqun Chen³,
Yong Zhu¹ &
…
Shuguang Chen¹

188 Accesses
Explore all metrics

Abstract

Visible-infrared person re-identification (VI Re-ID) is challenging work due to huge inter-modality discrepancies and high similarity among inter-identity infrared images. Current methods aim to alleviate the modality discrepancies by using attention mechanisms and identity learning. However, most of these methods are too complex or fine-grained, which can instead destroy the integrity of subtle information and unavoidably diminishes the distinctiveness of features. Different from existing methods, we propose a novel Dual-Attentive Cascade Clustering Learning Network (DA$C^2$LNet) to alleviate inter-modality differences and reduce inter-identity similarities. DA$C^2$LNet focuses on learning key and useful information by discovering subtle information distributed in each part of the person’s body, which includes the channel attention module (CAM) and part-based attention module (PbAM). Specifically, we first apply CAM to alleviate modality discrepancies and enhance feature discrimination. Then, we design a PbAM, which is different from spatial attention in pixels, it generates several part pattern maps corresponding to different parts of the person’s body to mine overall nuances for minimizing inter-identity similarities. The two modules are cascaded together to learn distinguishing features. Finally, we introduce a center cluster learning manner to reduce intra-identity inter-modality discrepancies and increase inter-identity variances. Extensive experimental results on two public datasets (SYSU-MM01 and RegDB) demonstrate that DA$C^2$LNet outperforms state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Cross-Modality Visible-Infrared Person Re-Identification with Multi-scale Attention and Part Aggregation

Mask-guided dual attention-aware network for visible-infrared person re-identification

Article 10 February 2021

Position Attention-Guided Learning for Infrared-Visible Person Re-identification

References

Aggarwal AK, Jaidka P (2022) Segmentation of crop images for crop yield prediction. International Journal of Biology and Biomedicine, 7
Chen B, Deng W, Hu J (2019) Mixed high-order attention network for person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 371–381
Chen Y, Wan L, Li Z et al (2021) Neural Feature Search for RGB-Infrared Person Re-Identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 587–597
Choi S, Lee S, Kim Y et al (2020) Hi-cmd: Hierarchical cross-modality disentanglement for visible-infrared person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 10257–10266
Dai P, Ji R, Wang H et al (2018) Cross-modality person re-identification with generative adversarial training. In: IJCAI, pp 1: 2
Farooq A, Awais M, Kittler J et al (2021) AXM-Net: Cross-Modal Context Sharing Attention Network for Person Re-ID. arXiv preprint arXiv:2101.08238
Fu C, Hu Y, Wu X et al (2021) CM-NAS: Cross-modality neural architecture search for visible-infrared person re-identification. arXiv preprint arXiv:2101.08467
Gao G, Shao H, Yu Y et al (2021) Leaning compact and representative features for cross-modality person re-identification. arXiv preprint arXiv:2103.14210
Gu X, Chang H, Ma B et al (2020) Appearance-preserving 3d convolution for video-based person re-identification. European conference on computer vision. Springer, Cham, pp 228–243
Google Scholar
Hao Y, Wang N, Gao X et al (2019) Dual-alignment feature embedding for cross-modality person re-identification. In: Proceedings of the 27th ACM international conference on multimedia, pp 57–65
Hao Y, Wang N, Li J, Gao X (2019) HSME: hypersphere manifold embedding for visible thermal person re- identification. In: AAAI, pp 8385–8392
Hao Y, Wang N, Li J, Gao X (2019) Hsme: Hypersphere manifold embedding for visible thermal person re-identification. In: AAAI, pp 8385–8392
Hermans A, Beyer L, Leibe B (2017) In defense of the triplet loss for person re-identification. arXiv preprint arXiv:1703.07737
Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7132–7141
Liu H, Cheng J, Wang W et al (2020) Enhancing the discriminative feature learning for visible-thermal cross-modality person re-identification. Neurocomputing 398:11–19
Article Google Scholar
Liu H, Shi W, Huang W et al (2018) A discriminatively learned feature embedding based on multi-loss fusion for person search. In: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE pp 1668–1672
Liu H, Tan X, Zhou X (2020) Parameter sharing exploration and hetero-center triplet loss for visible-thermal person re-identification. IEEE Trans Multimedia
Liu H, Tan X, Zhou X (2020) Parameter sharing exploration and hetero-center triplet loss for visible-thermal person re-identification. IEEE Trans on Multimedia
Li D, Wei X, Hong X, et al (2020) Infrared-visible cross-modal person re-identification with an x modality. In: Proceedings of the AAAI conference on artificial intelligence. 34(04), pp 4610–4617
Lu Y, Wu Y, Liu B et al (2020) Cross-modality person re-identification with shared-specific feature transfer. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 13379–13389
Miao J, Wu Y, Liu P et al (2019) Pose-guided feature alignment for occluded person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 542-551
Nguyen DT, Hong HG, Kim KW et al (2017) Person recognition system based on a combination of body images from visible light and thermal cameras. Sensors 17(3):605
Article ADS PubMed PubMed Central Google Scholar
Song C, Huang Y, Ouyang W, Wang L (2018) Mask-guided contrastive attention model for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1179–1188
Su C, Zhang S, Xing J, Gao W, Tian Q (2016) Deep attributes driven multi-camera person re-identification. In: ECCV, pp 475–491
Tay CP, Roy S, Yap KH (2019) Aanet: Attribute attention network for person re-identifications. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 7134–7143
Wan C, Wu Y, Tian X et al (2019) Concentrated local Part Discovery with fine-grained Part Representation for person Re-identification. IEEE Trans Multimedia 22(6):1605–1618
Article Google Scholar
Wang G A, Zhang T, Yang Y, et al (2020) Cross-modality paired-images generation for RGB-infrared person re-identification. In: Proceedings of the AAAI conference on artificial intelligence, 34(07), pp 12144–12151
Wang X, Cordova RS (2022) Global and part feature fusion for cross-modality person re-identification. IEEE Access 10:122038–122046
Article Google Scholar
Wang X, Chen C, Zhu Y, Chen S (2022) Feature fusion and center aggregation for visible-infrared person re-identification. IEEE Access 10:30949–30958
Article Google Scholar
Wang X, Cordova RS (2022) Heterogenous center alignment of dual-path features for text-image person re-identification. In: 2022 International conference on artificial intelligence, information processing and cloud computing (AIIPCC), IEEE. pp 145–148
Wang X, Girshick R, Gupta A et al (2018) Non-local neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7794–7803
Wang Z, Wang Z, Zheng Y et al (2019) Learning to reduce dual-level discrepancy for infrared-visible person re-identification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 618–626
Wang G, Yuan Y, Chen X et al (2018) Learning discriminative features with multiple granularities for person re-identification. In: Proceedings of the 26th ACM international conference on multimedia, pp 274–282
Wang G, Zhang T, Cheng J et al (2019) Rgb-infrared cross-modality person re-identification via joint pixel and feature alignment. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 3623–3632
Wang G, Zhang T, Cheng J et al (2019) Rgb-infrared cross-modality person re-identification via joint pixel and feature alignment. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 3623–3632
Wang J, Zhu X, Gong S et al (2018) Transferable joint attribute-identity deep learning for unsupervised person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2275–2284
Wan L, Sun Z, Jing Q et al (2021) G $^ 2$ DA: Geometry-Guided Dual-Alignment Learning for RGB-Infrared Person Re-Identification. arXiv preprint arXiv:2106.07853
Wei X, Li D, Hong X et al (2020) Co-attentive lifting for infrared-visible person re-identification. In: Proceedings of the 28th ACM international conference on multimedia, pp 1028–1037
Wojke N, Bewley A (2018) Deep cosine metric learning for person re-identification. In: 2018 IEEE winter conference on applications of computer vision (WACV). IEEE, pp 748–756
Woo S, Park J, Lee JY et al (2018) Cbam: Convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp 3–19
Wu A, Zheng WS, Gong S et al (2020) RGB-IR person re-identification by cross-modality similarity preservation. Int J Comput Vis 128(6):1765–1785
Article MathSciNet Google Scholar
Wu Q, Dai P, Chen J et al (2021) Discover cross-modality nuances for visible-infrared person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 4330–4339
Wu A, Zheng WS, Lai JH (2019) Unsupervised person re-identification by camera-aware similarity consistency learning. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 6922–6931
Wu A, Zheng W-S, Yu H-X, Gong S, Lai J (2017) Rgb-infrared cross-modality person re-identification. In ICCV, pp 5390–5399
Yao H, Zhang S, Hong R et al (2019) Deep representation learning with part loss for person re-identification. IEEE Trans Image Process 28(6):2860–2871
Article ADS MathSciNet Google Scholar
Ye M, Lan X, Wang Z et al (2019) Bi-directional center-constrained top-ranking for visible thermal person re-identification. IEEE Trans Inf Forensics Secur 15:407–419
Article Google Scholar
Ye M, Shen J, Shao L (2020) Visible-infrared person re-identification via homogeneous augmented tri-modal learning. IEEE Trans Inf Forensics Secur 16:728–739
Article Google Scholar
Ye M, Lan X, Li J et al (2018) Hierarchical discriminative learning for visible thermal person re-identification. In: Proceedings of the AAAI conference on artificial intelligence 32(1)
Ye M, Lan X, Li J et al (2018) Hierarchical discriminative learning for visible thermal person re-identification. In: Proceedings of the AAAI conference on artificial intelligence, 32(1)
Ye M, Shen J, Crandall D J et al (2020) Dynamic dual-attentive aggregation learning for visible-infrared person re-identification. In: Computer Vision-ECCV 2020: 16th European Conference, Glasgow, UK, August 23-28, 2020, Proceedings, Part XVII 16. Springer International Publishing, pp 229–247
Ye M, Shen J, Crandall D J et al (2020) Dynamic dual-attentive aggregation learning for visible-infrared person re-identification. In: Computer Vision-ECCV 2020: 16th European Conference, Glasgow, UK, August 23-28, 2020, Proceedings, Part XVII 16. Springer International Publishing, pp 229–247
Ye M, Shen J, Lin G et al (2021) Deep learning for person re-identification: A survey and outlook. IEEE Trans Pattern Anal Mach Intell
Ye M, Wang Z, Lan X et al (2018) Visible thermal person re-identification via dual-constrained top-ranking. In: IJCAI, pp 1: 2
Yin J, Wu A, Zheng WS (2020) Fine-grained person re-identification. Int J Comput Vis 128(6):1654–1672
Article Google Scholar
Yin J, Ma Z, Xie J et al (2021) D$F^2$AM: Dual-level feature fusion and affinity modeling for rgb-infrared cross-modality person re-identification. arXiv preprint arXiv:2104.00226
Yu HX, Zheng WS (2020) Weakly supervised discriminative feature learning with state information for person identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 5528–5538
Zhang S, Chen C, Song W, Gan Z (2020) Deep feature learning with attributes for cross-modality person re-identification. J Electron Imaging, vol. 29, no. 3
Zhang Q, Lai C, Liu J, Huang N, Han J (2022) Fmcnet: Feature-level modality compensation for visible-infrared person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition pp 7349–7358
Zhang Z, Lan C, Zeng W et al (2020) Relation-aware global attention for person re-identification. In: Proceedings of the ieee/cvf conference on computer vision and pattern recognition, pp 3186–3195
Zhao C, Lv X, Zhang Z et al (2020) Deep fusion feature representation learning with hard mining center-triplet loss for person re-identification. IEEE Trans Multimedia 22(12):3180–3195
Article ADS Google Scholar
Zhao C, Wang X, Zuo W et al (2020) Similarity learning with joint transfer constraints for person re-identification. Pattern Recog 97:107014
Article Google Scholar
Zhao Y, Shen X, Jin Z et al (2019) Attribute-driven feature disentangling and temporal aggregation for video person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 4913–4922
Zheng L, Zhang H, Sun S et al (2017) Person re-identification in the wild. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1367–1376
Zhou Q, Zhong B, Lan X et al (2020) Fine-grained spatial alignment model for person re-identification with focal triplet loss. IEEE Tran Image Process 29:7578–7589
Article ADS Google Scholar
Zhu Y, Yang Z, Wang L et al (2020) Hetero-center loss for cross-modality person re-identification. Neurocomputing 386:97–109
Article Google Scholar

Download references

Acknowledgements

This work was supported by Anhui Science and Technology Department Project (Grant No. 202004a05020030),Anhui Photovoltaic Industry Generic Technology Research Center (Granted No. 2022AHPV000001) ,and Natural Science Foundation Project of Anhui Province(Granted No. 2022AH040200).

Author information

Authors and Affiliations

School of Physics and Electronic Engineering, Fuyang Normal University, Qinghe, Fuyang, 236037, Anhui, China
Xianju Wang, Yong Zhu & Shuguang Chen
Graduate School, Angeles University Foundation, Angeles, 2009, Philippines
Xianju Wang
School of Computer Science and Information Engineering, Hefei University of Technology, Tunxi, Hefei, 230009, Anhui, China
Cuiqun Chen

Authors

Xianju Wang
View author publications
You can also search for this author in PubMed Google Scholar
Cuiqun Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yong Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Shuguang Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xianju Wang.

Ethics declarations

Conflicts of interest

The authors have no competing interests to declare that are relevant to the content of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Chen Cuiqun, Zhu Yong, and Chen Shuguang contributed equally to this work.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Wang, X., Chen, C., Zhu, Y. et al. Dual-attentive cascade clustering learning for visible-infrared person re-identification. Multimed Tools Appl 83, 19729–19746 (2024). https://doi.org/10.1007/s11042-023-16260-6

Download citation

Received: 12 January 2022
Revised: 01 May 2023
Accepted: 04 July 2023
Published: 28 July 2023
Issue Date: February 2024
DOI: https://doi.org/10.1007/s11042-023-16260-6

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Dual-attentive cascade clustering learning for visible-infrared person re-identification

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Cross-Modality Visible-Infrared Person Re-Identification with Multi-scale Attention and Part Aggregation

Mask-guided dual attention-aware network for visible-infrared person re-identification

Position Attention-Guided Learning for Infrared-Visible Person Re-identification

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Dual-attentive cascade clustering learning for visible-infrared person re-identification

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Cross-Modality Visible-Infrared Person Re-Identification with Multi-scale Attention and Part Aggregation

Mask-guided dual attention-aware network for visible-infrared person re-identification

Position Attention-Guided Learning for Infrared-Visible Person Re-identification

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation