Abstract
A wheelchair users detector is presented to extend people detection, providing a more general solution to detect people in environments such as houses adapted for independent and assisted living, hospitals, healthcare centers and senior residences. A wheelchair user model is incorporated in a detector whose detections are afterwards combined with the ones obtained using traditional people detectors (we define these as standing people detectors). We have trained a model for classical (DPM) and two for modern (Faster-RCNN and YOLOv3) detection algorithms, to compare their performance. Besides the extensibility proposed with respect to people detection, a dataset of video sequences has been recorded in a real in-door senior residence environment containing wheelchairs users and standing people and it has been released together with the associated ground-truth.
Similar content being viewed by others
References
Alonso IP, Llorca DF, Sotelo MA, Bergasa LM, de Toro PR, Nuevo J, Ocaña M, Garrido MAG (2007) Combination of feature extraction methods for svm pedestrian detection. IEEE Trans Intell Transp Syst 8(2):292–307
Andriluka M, Roth S, Schiele B (2008) People-tracking-by-detection and people-detection-by-tracking. In: Proceedings of computer vision and pattern recognition, pp 1–8
Andriluka M, Roth S, Schiele B (2009) Pictorial structures revisited: People detection and articulated pose estimation. In: Computer vision and pattern recognition, pp 1014–1021
Auvinet E, Multon F, Saint-Arnaud A, Rousseau J, Meunier J (2011) Fall detection with multiple cameras: an occlusion-resistant method based on 3-d silhouette vertical distribution. Inf Technol Biomed 15(2):290–300
Bian Z-P, Hou J, Chau L-P, Magnenat-Thalmann N (2015) Fall detection based on body part tracking using a depth camera. J Biomed Health Inf 19(2):430–439
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Proceedings of computer vision and pattern recognition, pp 886–893
Davis J, Goadrich M (2006) The relationship between precision-recall and roc curves. In: International conference on machine learning, pp 233–240
de Chaumont F, Marhic B, Delahoche L, Cauchois C (2004) Generic method for recognition of a wheelchair, even with a low resolution-effective sensor. In: International conference on industrial technology, pp 56–60
Dollàr P, Appel R, Kienzle W (2012) Crosstalk cascades for frame-rate pedestrian detection. In: European conference on computer vision, pp 645–659
Dollàr P, Wojek C, Schiele B, Perona P (2012) Pedestrian detection: an evaluation of the state of the art. IEEE Trans Pattern Anal Mach Intell 34(4):743–761
Enzweiler M, Gavrila DM (2009) Monocular pedestrian detection: survey and experiments. IEEE Trans Pattern Anal Mach Intell 31(12):2179–2195
Felzenszwalb PF, Girshick RB, McAllester D, Ramanan D (2010) Object detection with discriminatively trained part-based models. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645
Felzenszwalb PF, Girshick R, McAllester D (2010) Discriminatively trained deformable part models, release 4. http://people.cs.uchicago.edu/pff/latent-release4/
Garcia-Martin A, Martinez JM (2010) Obust real time moving people detection in surveillance scenarios. In: international conference on advanced video and signal based surveillance, pp 241– 247
Garcia-Martin A, Martinez JM (2015) Post-processing approaches for improving people detection performance. Comput Vis Image Underst 133:76–89
Gerónimo D, López AM, Sappa AD, Graf T (2010) Survey of pedestrian detection for advanced driver assistance systems. IEEE Trans Pattern Anal Mach Intell 32(7):1239–1258
Girshick R (2015) Fast R-CNN. In: International conference on computer vision, pp 1440–1448. arXiv:1504.08083
Girshick R, Donahue J, Darrell T, Malik J (2013) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Conference on computer vision and pattern recognition, pp 580–587. arXiv:1311.2524
Girshick RB, Iandola FN, Darrell T, Malik J (2014) Deformable part models are convolutional neural networks. In: Proceedings of computer vision and pattern recognition. arXiv:1409.5403
Hare S, Golodetz S, Saffari A, Vineet V, Cheng M, Hicks SL, Torr PHS (2016) Struck: structured output tracking with kernels. IEEE Trans Pattern Anal Mach Intell 38(10):2096–2109
Hosotani D, Yoda I, Sakaue K (2009) Wheelchair recognition by using stereo vision and histogram of oriented gradients (hog) in real environments. In: Workshop on applications of computer vision, pp 1–6
Hu W, Tan T, Wang L, Maybank S (2004) A survey on visual surveillance of object motion and behaviors. IEEE Trans Syst Man Cybern Part C Appl Rev 34 (3):334–352
Huang C-R, Chung P-C, Lin K-W, Tseng S-C (2010) Wheelchair detection using cascaded decision tree. Inf Technol Biomed 14(2):292–300
Huang P-J, yu Chen D (2012) Robust wheelchair pedestrian detection using sparse representation. In: Visual Communications and Image Processing, pp 1–5
Huang Y-X, Hsu S-P, Yu C-C, Chung Y-N, Lin C-T (2013) Applying image technology to detect and track the wheelchair patient safety. In: World conference on e-learning in corporate, government, healthcare, and higher education, pp 2333–2415
Jia X, Lu H, Yang M (2016) Visual tracking via coarse and fine structural local sparse appearance models. IEEE Trans Image Process 25(10):4555–4564
Kilambi P, Ribnick E, Joshi AJ, Masoud O, Papanikolopoulos N (2008) Estimating pedestrian counts in groups. Comput Vis Image Underst 110(1):43–59
Kristan M, Matas J, Leonardis A, Felsberg M, Cehovin L, Fernandez G, Vojir T, Hager G, Nebehay G, Pflugfelder R, Gupta A, Bibi A, Lukezic A, Garcia-Martin A, Saffari A, Petrosino A, Montero AS (2015) The visual object tracking vot2015 challenge results. In: 2015 IEEE international conference on computer vision workshop (ICCVW), pp 564–586
Leibe B, Leonardis A, Schiele B (2008) Robust object detection with interleaved categorization and segmentation. Proc Int J Comput Vis 77(1-3):259–289
Leibe B, Seemann E, Schiele B (2005) Pedestrian detection in crowded scenes. In: Proceedings of Computer Vision and Pattern Recognition, pp 878–885
Li P, Wang D, Wang L, Lu H (2018) Deep visual tracking: review and experimental comparison. Pattern Recogn 76(1):323–338. [Online]. Available: http://www.sciencedirect.com/science/article/pii/S0031320317304612
Myles A, da Vitoria Lobo N, Shah M (2002) Wheelchair detection in a calibrated environment. In: Asian conference on computer vision, pp 1–7
Nam H, Han B (2016) Learning multi-domain convolutional neural networks for visual tracking. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR), pp 4293–4302
Ouyang W, Luo P, Zeng X, Qiu S, Tian Y, Li H, Yang S, Wang Z, Xiong Y, Qian C, Zhu Z, Wang R, Loy C C, Wang X, Tang X (2014) Deepid-net: multi-stage and deformable deep convolutional neural networks for object detection. In: Proceedings of computer vision and pattern recognition. [Online]. Available: arXiv:1409.3505
Redmon J Darknet: Open source neural networks in c. http://pjreddie.com/darknet/
Redmon J, Divvala SK, Girshick RB, Farhadi A (2015) You only look once: Unified, real-time object detection. arXiv:1506.02640
Redmon J, Farhadi A (2016) YOLO9000: Better, faster, stronger. arXiv:1612.08242
Redmon J, Farhadi A (2018) Olov3: An incremental improvement. arXiv:1804.02767
Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: towards real-time object detection with region proposal networks. Adv Neural Inf Proces Syst 28:91–99
Simonnet D, Velastin S, Turkbeyler E, Orwell J (2012) Backgroundless detection of pedestrians in cluttered conditions based on monocular images: a review. IET Comput Vis 6(6):540–550
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. In Proceedings of computer vision and pattern recognition
Valera M, Velastin SA (2005) Intelligent distributed surveillance systems: a review. Visual Image Signal Process 152(2):192–204
Viola P, Jones M (2004) Robust real-time face detection. Int J Comput Vis 57(2):137–154
Wang L, Ouyang W, Wang X, Lu H (2015) Visual tracking with fully convolutional networks,” in 2015 IEEE International Conference on Computer Vision (ICCV), pp 3119–3127
Wojek C, Walk S, Schiele B (June 2009) Multi-cue onboard pedestrian detection. In: Proceedings of computer vision and pattern recognition, pp 794–801
Wu C-W, Liu C-D, Chung P-C (2010) Assistance instruments detection using geometry constrained knowledge for health care centers. In: International conference on future information technology, pp 1–5
Yang C-A, Chung P-C (2007) Recovery of 3-d location and orientation of a wheelchair in a calibrated environment by using single perspective geometry. In: Region 10 Conference, pp 1–4
Yun S, Choi J, Yoo Y, Yun K, Choi J (2017) Action-decision networks for visual tracking with deep reinforcement learning. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 1349–1358
Acknowledgments
This work has been partially supported by the Spanish government under the project TEC2014-53176-R (HAVideo) and by the Spanish Government FPU grant programme (Ministerio de Educación, Cultura y Deporte).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Martín-Nieto, R., García-Martín, A. & Martínez, J.M. Incorporating wheelchair users in people detection. Multimed Tools Appl 78, 14109–14127 (2019). https://doi.org/10.1007/s11042-018-6822-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-018-6822-7