iBet uBet web content aggregator. Adding the entire web to your favor.
iBet uBet web content aggregator. Adding the entire web to your favor.



Link to original content: https://unpaywall.org/10.1007/S11042-023-16260-6
Dual-attentive cascade clustering learning for visible-infrared person re-identification | Multimedia Tools and Applications Skip to main content
Log in

Dual-attentive cascade clustering learning for visible-infrared person re-identification

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Visible-infrared person re-identification (VI Re-ID) is challenging work due to huge inter-modality discrepancies and high similarity among inter-identity infrared images. Current methods aim to alleviate the modality discrepancies by using attention mechanisms and identity learning. However, most of these methods are too complex or fine-grained, which can instead destroy the integrity of subtle information and unavoidably diminishes the distinctiveness of features. Different from existing methods, we propose a novel Dual-Attentive Cascade Clustering Learning Network (DA\(C^2\)LNet) to alleviate inter-modality differences and reduce inter-identity similarities. DA\(C^2\)LNet focuses on learning key and useful information by discovering subtle information distributed in each part of the person’s body, which includes the channel attention module (CAM) and part-based attention module (PbAM). Specifically, we first apply CAM to alleviate modality discrepancies and enhance feature discrimination. Then, we design a PbAM, which is different from spatial attention in pixels, it generates several part pattern maps corresponding to different parts of the person’s body to mine overall nuances for minimizing inter-identity similarities. The two modules are cascaded together to learn distinguishing features. Finally, we introduce a center cluster learning manner to reduce intra-identity inter-modality discrepancies and increase inter-identity variances. Extensive experimental results on two public datasets (SYSU-MM01 and RegDB) demonstrate that DA\(C^2\)LNet outperforms state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  1. Aggarwal AK, Jaidka P (2022) Segmentation of crop images for crop yield prediction. International Journal of Biology and Biomedicine, 7

  2. Chen B, Deng W, Hu J (2019) Mixed high-order attention network for person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 371–381

  3. Chen Y, Wan L, Li Z et al (2021) Neural Feature Search for RGB-Infrared Person Re-Identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 587–597

  4. Choi S, Lee S, Kim Y et al (2020) Hi-cmd: Hierarchical cross-modality disentanglement for visible-infrared person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 10257–10266

  5. Dai P, Ji R, Wang H et al (2018) Cross-modality person re-identification with generative adversarial training. In: IJCAI, pp 1: 2

  6. Farooq A, Awais M, Kittler J et al (2021) AXM-Net: Cross-Modal Context Sharing Attention Network for Person Re-ID. arXiv preprint arXiv:2101.08238

  7. Fu C, Hu Y, Wu X et al (2021) CM-NAS: Cross-modality neural architecture search for visible-infrared person re-identification. arXiv preprint arXiv:2101.08467

  8. Gao G, Shao H, Yu Y et al (2021) Leaning compact and representative features for cross-modality person re-identification. arXiv preprint arXiv:2103.14210

  9. Gu X, Chang H, Ma B et al (2020) Appearance-preserving 3d convolution for video-based person re-identification. European conference on computer vision. Springer, Cham, pp 228–243

    Google Scholar 

  10. Hao Y, Wang N, Gao X et al (2019) Dual-alignment feature embedding for cross-modality person re-identification. In: Proceedings of the 27th ACM international conference on multimedia, pp 57–65

  11. Hao Y, Wang N, Li J, Gao X (2019) HSME: hypersphere manifold embedding for visible thermal person re- identification. In: AAAI, pp 8385–8392

  12. Hao Y, Wang N, Li J, Gao X (2019) Hsme: Hypersphere manifold embedding for visible thermal person re-identification. In: AAAI, pp 8385–8392

  13. Hermans A, Beyer L, Leibe B (2017) In defense of the triplet loss for person re-identification. arXiv preprint arXiv:1703.07737

  14. Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7132–7141

  15. Liu H, Cheng J, Wang W et al (2020) Enhancing the discriminative feature learning for visible-thermal cross-modality person re-identification. Neurocomputing 398:11–19

    Article  Google Scholar 

  16. Liu H, Shi W, Huang W et al (2018) A discriminatively learned feature embedding based on multi-loss fusion for person search. In: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE pp 1668–1672

  17. Liu H, Tan X, Zhou X (2020) Parameter sharing exploration and hetero-center triplet loss for visible-thermal person re-identification. IEEE Trans Multimedia

  18. Liu H, Tan X, Zhou X (2020) Parameter sharing exploration and hetero-center triplet loss for visible-thermal person re-identification. IEEE Trans on Multimedia

  19. Li D, Wei X, Hong X, et al (2020) Infrared-visible cross-modal person re-identification with an x modality. In: Proceedings of the AAAI conference on artificial intelligence. 34(04), pp 4610–4617

  20. Lu Y, Wu Y, Liu B et al (2020) Cross-modality person re-identification with shared-specific feature transfer. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 13379–13389

  21. Miao J, Wu Y, Liu P et al (2019) Pose-guided feature alignment for occluded person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 542-551

  22. Nguyen DT, Hong HG, Kim KW et al (2017) Person recognition system based on a combination of body images from visible light and thermal cameras. Sensors 17(3):605

    Article  ADS  PubMed  PubMed Central  Google Scholar 

  23. Song C, Huang Y, Ouyang W, Wang L (2018) Mask-guided contrastive attention model for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1179–1188

  24. Su C, Zhang S, Xing J, Gao W, Tian Q (2016) Deep attributes driven multi-camera person re-identification. In: ECCV, pp 475–491

  25. Tay CP, Roy S, Yap KH (2019) Aanet: Attribute attention network for person re-identifications. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 7134–7143

  26. Wan C, Wu Y, Tian X et al (2019) Concentrated local Part Discovery with fine-grained Part Representation for person Re-identification. IEEE Trans Multimedia 22(6):1605–1618

    Article  Google Scholar 

  27. Wang G A, Zhang T, Yang Y, et al (2020) Cross-modality paired-images generation for RGB-infrared person re-identification. In: Proceedings of the AAAI conference on artificial intelligence, 34(07), pp 12144–12151

  28. Wang X, Cordova RS (2022) Global and part feature fusion for cross-modality person re-identification. IEEE Access 10:122038–122046

    Article  Google Scholar 

  29. Wang X, Chen C, Zhu Y, Chen S (2022) Feature fusion and center aggregation for visible-infrared person re-identification. IEEE Access 10:30949–30958

    Article  Google Scholar 

  30. Wang X, Cordova RS (2022) Heterogenous center alignment of dual-path features for text-image person re-identification. In: 2022 International conference on artificial intelligence, information processing and cloud computing (AIIPCC), IEEE. pp 145–148

  31. Wang X, Girshick R, Gupta A et al (2018) Non-local neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7794–7803

  32. Wang Z, Wang Z, Zheng Y et al (2019) Learning to reduce dual-level discrepancy for infrared-visible person re-identification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 618–626

  33. Wang G, Yuan Y, Chen X et al (2018) Learning discriminative features with multiple granularities for person re-identification. In: Proceedings of the 26th ACM international conference on multimedia, pp 274–282

  34. Wang G, Zhang T, Cheng J et al (2019) Rgb-infrared cross-modality person re-identification via joint pixel and feature alignment. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 3623–3632

  35. Wang G, Zhang T, Cheng J et al (2019) Rgb-infrared cross-modality person re-identification via joint pixel and feature alignment. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 3623–3632

  36. Wang J, Zhu X, Gong S et al (2018) Transferable joint attribute-identity deep learning for unsupervised person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2275–2284

  37. Wan L, Sun Z, Jing Q et al (2021) G \(^ 2\) DA: Geometry-Guided Dual-Alignment Learning for RGB-Infrared Person Re-Identification. arXiv preprint arXiv:2106.07853

  38. Wei X, Li D, Hong X et al (2020) Co-attentive lifting for infrared-visible person re-identification. In: Proceedings of the 28th ACM international conference on multimedia, pp 1028–1037

  39. Wojke N, Bewley A (2018) Deep cosine metric learning for person re-identification. In: 2018 IEEE winter conference on applications of computer vision (WACV). IEEE, pp 748–756

  40. Woo S, Park J, Lee JY et al (2018) Cbam: Convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp 3–19

  41. Wu A, Zheng WS, Gong S et al (2020) RGB-IR person re-identification by cross-modality similarity preservation. Int J Comput Vis 128(6):1765–1785

    Article  MathSciNet  Google Scholar 

  42. Wu Q, Dai P, Chen J et al (2021) Discover cross-modality nuances for visible-infrared person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 4330–4339

  43. Wu A, Zheng WS, Lai JH (2019) Unsupervised person re-identification by camera-aware similarity consistency learning. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 6922–6931

  44. Wu A, Zheng W-S, Yu H-X, Gong S, Lai J (2017) Rgb-infrared cross-modality person re-identification. In ICCV, pp 5390–5399

  45. Yao H, Zhang S, Hong R et al (2019) Deep representation learning with part loss for person re-identification. IEEE Trans Image Process 28(6):2860–2871

    Article  ADS  MathSciNet  Google Scholar 

  46. Ye M, Lan X, Wang Z et al (2019) Bi-directional center-constrained top-ranking for visible thermal person re-identification. IEEE Trans Inf Forensics Secur 15:407–419

    Article  Google Scholar 

  47. Ye M, Shen J, Shao L (2020) Visible-infrared person re-identification via homogeneous augmented tri-modal learning. IEEE Trans Inf Forensics Secur 16:728–739

    Article  Google Scholar 

  48. Ye M, Lan X, Li J et al (2018) Hierarchical discriminative learning for visible thermal person re-identification. In: Proceedings of the AAAI conference on artificial intelligence 32(1)

  49. Ye M, Lan X, Li J et al (2018) Hierarchical discriminative learning for visible thermal person re-identification. In: Proceedings of the AAAI conference on artificial intelligence, 32(1)

  50. Ye M, Shen J, Crandall D J et al (2020) Dynamic dual-attentive aggregation learning for visible-infrared person re-identification. In: Computer Vision-ECCV 2020: 16th European Conference, Glasgow, UK, August 23-28, 2020, Proceedings, Part XVII 16. Springer International Publishing, pp 229–247

  51. Ye M, Shen J, Crandall D J et al (2020) Dynamic dual-attentive aggregation learning for visible-infrared person re-identification. In: Computer Vision-ECCV 2020: 16th European Conference, Glasgow, UK, August 23-28, 2020, Proceedings, Part XVII 16. Springer International Publishing, pp 229–247

  52. Ye M, Shen J, Lin G et al (2021) Deep learning for person re-identification: A survey and outlook. IEEE Trans Pattern Anal Mach Intell

  53. Ye M, Wang Z, Lan X et al (2018) Visible thermal person re-identification via dual-constrained top-ranking. In: IJCAI, pp 1: 2

  54. Yin J, Wu A, Zheng WS (2020) Fine-grained person re-identification. Int J Comput Vis 128(6):1654–1672

    Article  Google Scholar 

  55. Yin J, Ma Z, Xie J et al (2021) D\(F^2\)AM: Dual-level feature fusion and affinity modeling for rgb-infrared cross-modality person re-identification. arXiv preprint arXiv:2104.00226

  56. Yu HX, Zheng WS (2020) Weakly supervised discriminative feature learning with state information for person identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp 5528–5538

  57. Zhang S, Chen C, Song W, Gan Z (2020) Deep feature learning with attributes for cross-modality person re-identification. J Electron Imaging, vol. 29, no. 3

  58. Zhang Q, Lai C, Liu J, Huang N, Han J (2022) Fmcnet: Feature-level modality compensation for visible-infrared person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition pp 7349–7358

  59. Zhang Z, Lan C, Zeng W et al (2020) Relation-aware global attention for person re-identification. In: Proceedings of the ieee/cvf conference on computer vision and pattern recognition, pp 3186–3195

  60. Zhao C, Lv X, Zhang Z et al (2020) Deep fusion feature representation learning with hard mining center-triplet loss for person re-identification. IEEE Trans Multimedia 22(12):3180–3195

    Article  ADS  Google Scholar 

  61. Zhao C, Wang X, Zuo W et al (2020) Similarity learning with joint transfer constraints for person re-identification. Pattern Recog 97:107014

    Article  Google Scholar 

  62. Zhao Y, Shen X, Jin Z et al (2019) Attribute-driven feature disentangling and temporal aggregation for video person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 4913–4922

  63. Zheng L, Zhang H, Sun S et al (2017) Person re-identification in the wild. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1367–1376

  64. Zhou Q, Zhong B, Lan X et al (2020) Fine-grained spatial alignment model for person re-identification with focal triplet loss. IEEE Tran Image Process 29:7578–7589

    Article  ADS  Google Scholar 

  65. Zhu Y, Yang Z, Wang L et al (2020) Hetero-center loss for cross-modality person re-identification. Neurocomputing 386:97–109

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported by Anhui Science and Technology Department Project (Grant No. 202004a05020030),Anhui Photovoltaic Industry Generic Technology Research Center (Granted No. 2022AHPV000001) ,and Natural Science Foundation Project of Anhui Province(Granted No. 2022AH040200).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xianju Wang.

Ethics declarations

Conflicts of interest

The authors have no competing interests to declare that are relevant to the content of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Chen Cuiqun, Zhu Yong, and Chen Shuguang contributed equally to this work.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, X., Chen, C., Zhu, Y. et al. Dual-attentive cascade clustering learning for visible-infrared person re-identification. Multimed Tools Appl 83, 19729–19746 (2024). https://doi.org/10.1007/s11042-023-16260-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-16260-6

Keywords

Navigation