Abstract
Gait recognition is gaining attention as an emerging biotechnology for identifying subjects from a remote distance. Existing works have confirmed that modeling the motion trajectory of the subject is the key to gait recognition. Considering the motion continuity in the gait sequences, 3D CNN is a suitable and powerful tool for extracting spatiotemporal features. However, directly stacking 3D convolutions to extract sequence features will increase the model complexity and the number of parameters. To address the above issues, we propose a lightweight separated asynchronous 3D convolutional neural network (LSA3D) for gait recognition. Unlike common stacked 3D architecture, our network adopts the reverse pyramid design pattern, i.e., the top uses 3D convolution to extract internal features with rich spatiotemporal information, and the bottom uses separated asynchronous convolution to exploit deeper interactions between features. Experiments on the CASIA-B and OUMVLP gait recognition datasets show that LSA3D outperforms the state-of-the-art gait recognition methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Chao, H., Wang, K., He, Y., Zhang, J., Feng, J.: Gaitset: cross-view gait recognition through utilizing gait as a deep set. IEEE Trans. Pattern Anal. Mach. Intell. 44(7), 3467–3478 (2022)
Chen, J., Wang, Z., Yi, P., Zeng, K., He, Z., Zou, Q.: Gait pyramid attention network: toward silhouette semantic relation learning for gait recognition. IEEE Trans. Biometrics Behav. Identity Sci. 4(4), 582–595 (2022). https://doi.org/10.1109/TBIOM.2022.3213545
Chen, X., Luo, X., Weng, J., Luo, W., Li, H., Tian, Q.: Multi-view gait image generation for cross-view gait recognition. IEEE Trans. Image Process. 30, 3041–3055 (2021)
Fan, C., et al.: Gaitpart: temporal part-based model for gait recognition. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 14225–14233 (2020)
Huang, Z., et al.: 3d local convolutional neural networks for gait recognition. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 14920–14929 (2021)
Lin, B., Zhang, S., Bao, F.: Gait recognition with multiple-temporal-scale 3d convolutional neural network. In: Proceedings of the 28th ACM International conference on Multimedia, pp. 3054–3062 (2020)
Lin, B., Zhang, S., Yu, X.: Gait recognition via effective global-local feature representation and local temporal aggregation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 14648–14656 (2021)
Van der Maaten, L., Hinton, G.: Visualizing data using t-SNE. J. Mach. Learn. Res. 9(11), 2579–2605 (2008)
Qin, H., Chen, Z., Guo, Q., Jonathan, Wu., Q.M., Lu, M.: RPNet: gait recognition with relationships between each body-parts. IEEE Trans. Circ. Syst. Video Technol. 32(5), 2990–3000 (2022)
Qiu, Z., Yao, T., Mei, T.: Learning spatio-temporal representation with pseudo-3d residual networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 5533–5541 (2017)
Shiraga, K., Makihara, Y., Muramatsu, D., Echigo, T., Yagi, Y.: Geinet: view-invariant gait recognition using a convolutional neural network. In: 2016 International Conference on Biometrics, pp. 1–8. IEEE (2016)
Takemura, N., Makihara, Y., Muramatsu, D., Echigo, T., Yagi, Y.: Multi-view large population gait dataset and its performance evaluation for cross-view gait recognition. IPSJ Trans. Comput. Vis. Appl. 10(1), 1–14 (2018). https://doi.org/10.1186/s41074-018-0039-6
Xie, S., Sun, C., Huang, J., Tu, Z., Murphy, K.: Rethinking spatiotemporal feature learning: speed-accuracy trade-offs in video classification. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 305–321 (2018)
Yang, H., et al.: Asymmetric 3d convolutional neural networks for action recognition. Pattern Recognit. 85, 1–12 (2019)
Yu, S., Tan, D., Tan, T.: A framework for evaluating the effect of view angle, clothing and carrying condition on gait recognition. In: 18th International Conference on Pattern Recognition, vol. 4, pp. 441–444 (2006)
Zhang, Z., Tran, L., Liu, F., Liu, X.: On learning disentangled representations for gait recognition. IEEE Trans. Pattern Anal. Mach. Intell. 44(1), 345–360 (2022)
Acknowledgements
This work was supposed in part by the National Natural Science Foundation of China (U1903214, 62071339, 61872277, 62072347) and in part by the Open Research Fund from Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ) (GML-KF-22-16).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Chen, J., Wang, Z., Zeng, K., Xiao, J., Han, Z. (2023). LSA3D: Lightweight Separate Asynchronous 3D Convolutional Neural Network for Gait Recognition. In: Iliadis, L., Papaleonidas, A., Angelov, P., Jayne, C. (eds) Artificial Neural Networks and Machine Learning – ICANN 2023. ICANN 2023. Lecture Notes in Computer Science, vol 14263. Springer, Cham. https://doi.org/10.1007/978-3-031-44204-9_23
Download citation
DOI: https://doi.org/10.1007/978-3-031-44204-9_23
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-44203-2
Online ISBN: 978-3-031-44204-9
eBook Packages: Computer ScienceComputer Science (R0)