Abstract
Pedestrian detection in real time to avoid collision for unmanned vehicles is an interesting and challenging problem in computer vision. This paper proposes a new pedestrian detection method by using an appearance-based multi-channel features. The method involves only the monocular environment since most cameras have only a single lens. A pedestrian is represented by the combination of channel features partly weighted according to the clearly visual appearance and occlusion. Handling partial occlusion is carried out by constructing a hierarchical region reduction structure. A full pedestrian image is disintegrated into several horizontal and vertical regions. Each region captures the outstanding appearance of pedestrian’s body. The features extracted from all regions are hierarchically combined to perform the concurrent detection of pedestrian’s occurrence. The experiment yielded good results using standard benchmark dataset. The performance evaluations on miss rate, average false positive per image, and the trade-off between running time and performance show that the proposed detection framework is a reasonable option for real world applications.
Similar content being viewed by others
References
Enzweiler, M., & Gavrila, D.M. (2009). Monocular pedestrian detection: survey and experiments. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(12), 2179–2195.
Gandhi, T., & Trivedi, M.M. (2007). Pedestrian protection systems: issues, survey, and challenges. IEEE Transactions on Intelligent Transportation Systems, 8(3), 413–429.
Gavrila, D.M. (1999). The visual analysis of human movement: a survey. Computer Vision and Image Understanding, 73(1), 82– 98.
Geronimo, D., Lopez, A.M., Sappa, A.D., Graf, T. (2010). Survey of pedestrian detection for advanced driver assistance systems. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(7), 1239–1258.
Benenson, R., Omran, M., Hosang, J., Schiele, B. (2014). Ten years of pedestrian detection, what have we learned?. In European conference on computer vision (pp. 613–627).
Viola, P., Jones, M., Snow, D. (2003). Detecting pedestrians using patterns of motion and appearance. IEEE International Conference on Computer Vision, 2, 734–741.
Dalal, N., Triggs, B., Schmid, C. (2006). Human detection using oriented histograms of flow and appearance. In European conference on computer vision (pp. 428–441).
Enzweiler, M., Kanter, P., Gavrila, D.M. (2008). Monocular pedestrian recognition using motion parallax. In IEEE intelligent vehicles symposium (pp. 792–797).
Cao, X. B., Qiao, H., Keane, J. (2008). A low-cost pedestrian-detection system with a single optical camera. IEEE Transactions on Intelligent Transportation Systems, 9(1), 58–67.
Shashua, A., Gdalyahu, Y., Hayun, G. (2004). Pedestrian detection for driving assistance systems: Single-frame classification and system level performance. In Intelligent vehicles symposium (pp. 1–6).
Gavrila, D.M., & Munder, S. (2007). Multi-cue pedestrian detection and tracking from a moving vehicle. International Journal of Computer Vision, 73, 41–59.
Nedevschi, S., Bota, S., Tomiuc, C. (2009). Stereo-based pedestrian detection for collision-avoidance applications. IEEE Transactions on Intelligent Transportation Systems, 10(3), 380–391.
Leibe, B., Cornelis, N., Cornelis, K., Van Gool, L. (2007). Dynamic 3d scene analysis from a moving vehicle. In IEEE conference on computer vision and pattern recognition (pp. 1–8).
Bertozzi, M., Broggi, A., Fascioli, A., Graf, T., Meinecke, M. (2004). Pedestrian detection for driver assistance using multiresolution infrared vision. IEEE Transactions on Vehicular Technology, 53(6), 1666–1678.
Ge, J., Luo, Y., Tei, G. (2009). Real-time pedestrian detection and tracking at night time for driver-assistance systems. IEEE Transactions on Intelligent Transportation Systems, 10, 283– 298.
Dalal, N., & Triggs, B. (2005). Histograms of oriented gradients for human detection. In IEEE conference on computer vision and pattern recognition (pp. 886–893).
Ojala, T., Pietikainen, M., Harwood, D. (1996). A comparative study of texture measures with classification based on featured distributions. Pattern Recognition, 29(1), 51–59.
Tuzel, O., Porikli, F., Meer, P. (2006). Region covariance: A fast descriptor for detection and classification. In European conference on computer vision (pp. 589–600).
Munder, S., & Gavrila, D.M. (2006). An experimental study on pedestrian classification. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28, 1863–1868.
Wohler, C., & Anlauf, J. (1999). An adaptable time-delay neural-network algorithm for image sequence analysis. IEEE Transactions on Neural Networks, 10(6), 1531–1536.
Enzweiler, M., Eigenstetter, A., Schiele, B., Gavrila, D.M. (2010). Multi-cue pedestrian classification with partial occlusion handling. In IEEE conference on computer vision and pattern recognition (pp. 990–997).
Park, D., Zitnick, C. L., Ramanan, D., Dollár, P. (2013). Exploring weak stabilization for motion feature extraction. In IEEE conference on computer vision and pattern recognition (pp. 2882–2889).
Enzweiler, M., & Gavrila, D.M. (2011). A multilevel mixture-of-experts framework for pedestrian classification. IEEE Transactions on Image Processing, 20(10), 2967–2979.
Junior, O.L., Delgado, D., Gonçalves, V., Nunes, U. (2009). Trainable classifier-fusion schemes: an application to pedestrian detection. In International ieee conference on intelligent transportation systems (pp. 1–6).
Xia, X., Yang, W., Li, H., Zhang, S. (2009). Part-based object detection using cascades of boosted classifiers. In Asian conference on computer vision (pp. 556–565).
Zhu, Q., Yeh, M.C., Cheng, K.T., Avidan, S. (2006). Fast human detection using a cascade of histograms of oriented gradients. In IEEE conference on computer vision and pattern recognition (pp. 1491–1498).
Dollár, P., Babenko, B., Belongie, S., Perona, P., Tu, Z. (2008). Multiple component learning for object detection. In European conference on computer vision (pp. 211–224).
Rao, S., Pramod, N.C., Paturu, C.K. (2008). People detection in image and video data. In Proceedings of the 1st ACM workshop on vision networks for behavior analysis (pp. 85–92).
Mohan, A., Papageorgiou, C., Poggio, T. (2001). Example-based object detection in images by components. IEEE Transaction on Pattern Analysis and Machine Intelligence, 23(4), 349– 361.
Felzenszwalb, P., McAllester, D., Ramanan, D. (2008). A discriminatively trained, multiscale, deformable part model. In IEEE conference on computer vision and pattern recognition (pp. 1–8).
Ouyang, W., & Wang, X. (2012). A discriminative deep model for pedestrian detection with occlusion handling. In IEEE conference on computer vision and pattern recognition (pp. 3258–3265).
Bar-Hillel, A., Levi, D., Krupka, E., Goldberg, C. (2010). Part-based feature synthesis for human detection. In European conference on computer vision (pp. 127–142).
Rapus, M., Munder, S., Baratoff, G., Denzler, J. (2009). Pedestrian detection by probabilistic component assembly. In Joint pattern recognition symposium (pp. 91– 100).
Wu, B., & Nevatia, R. (2005). Detection of multiple, partially occluded humans in a single image by bayesian combination of edgelet part detectors. In IEEE international conference on computer vision (pp. 90–97).
Wang, X., Han, T. X., Yan, S. (2009). An HOG-LBP Human detector with partial occlusion handling. In IEEE international conference on computer vision (pp. 32–39).
Park, D., Ramanan, D., Fowlkes, C. (2010). Multiresolution models for object detection. In European conference on computer vision (pp. 241–254).
Yan, J., Zhang, X., Lei, Z., Liao, S., Li, S.Z. (2013). Robust multi-resolution pedestrian detection in traffic scenes. In IEEE conference on computer vision and pattern recognition (pp. 3033–3040).
Ouyang, W., & Wang, X. (2013). Single-pedestrian detection aided by multi-pedestrian detection. In IEEE conference on computer vision and pattern recognition (pp. 3198–3205).
Ouyang, W., & Wang, X. (2013). Joint deep learning for pedestrian detection. In IEEE international conference on computer vision (pp. 2056–2063).
Mathias, M., Benenson, R., Timofte, R., Van Gool, L. (2013). Handling occlusions with franken-classifiers. In IEEE international conference on computer vision (pp. 1505–1512).
Dollar, P., Wojek, C., Schiele, B., Perona, P. (2012). Pedestrian detection: an evaluation of the state of the art. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(4), 743– 761.
Dollár, P., Appel, R., Belongie, S., Perona, P. (2014). Fast feature pyramids for object detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 36(8), 1532– 1545.
Viola, P., & Jones, M.J. (2004). Robust real-time face detection. International Journal of Computer Vision, 57(2), 137–154.
Paisitkriangkrai, S., Shen, C., Hengel, A.V.D. (2016). Pedestrian detection with spatially pooled features and structured ensemble learning. IEEE Transactions on Pattern Analysis and Machine Intelligence, 38(6), 1243–1257.
Nam, W., Dollár, P., Han, J.H. (2014). Local decorrelation for improved pedestrian detection. In Advances in neural information processing systems (pp. 424–432).
Luo, P., Tian, Y., Wang, X., Tang, X. (2014). Switchable deep network for pedestrian detection. In IEEE conference on computer vision and pattern recognition (pp. 899–906).
Appel, R., Fuchs, T., Dollár, P., Perona, P. (2013). Quickly boosting decision trees-pruning underachieving features early. In International conference on machine learning (pp. 594– 602).
Hosang, J., Omran, M., Benenson, R., Schiele, B. (2015). Taking a deeper look at pedestrians. In IEEE conference on computer vision and pattern recognition (pp. 4073–4082).
Zhang, S., Benenson, R., Schiele, B. (2015). Filtered channel features for pedestrian detection. In IEEE conference on computer vision and pattern recognition (pp. 1751–1760).
Cao, J., Pang, Y., Li, X. (2016). Pedestrian detection inspired by appearance constancy and shape symmetry. In IEEE conference on computer vision and pattern recognition (pp. 1316– 1324).
Acknowledgments
Wittawin is supported by the program Strategic Scholarships for Frontier Research Network for the Ph.D. Program Thai Doctoral degree from the Office of the Higher Education Commission, Ministry of Education, Thailand and Lursinsap is partially supported by Thailand Research Fund (TRF) under grant number RTA6080013.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Susutti, W., Lursinsap, C. & Sophatsathit, P. Pedestrian Detection by Using Weighted Channel Features with Hierarchical Region Reduction. J Sign Process Syst 91, 587–608 (2019). https://doi.org/10.1007/s11265-018-1361-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11265-018-1361-z