Abstract
In Internet of Things (IoT) environments, visual sensors with good performance have been used to create and apply various kinds of image data. Particularly, in the field of human–computer interaction, the image sensor interface using human hands is applicable to sign language recognition, games, object operation in virtual reality, and remote surgery. With the popularization of depth cameras, there has been a new interest in the research conducted in RGB images. Nevertheless, hand pose estimation is hard. Research on hand pose estimation has multiple issues, including high-dimensional degrees of freedom, shape changes, self-occlusion, and real-time condition. To address the issues, this study proposes the random forests-based method of hierarchically estimating hand pose in depth images. In this study, the hierarchical estimation method that individually handles hand palms and fingers with the use of an inverse matrix is utilized to address high-dimensional degrees of freedom, shape changes, and self-occlusion. For real-time execution, random forests using simple characteristics are applied. As shown in the experimental results of this study, the proposed hierarchical estimation method estimates the hand pose in input depth images more robustly and quickly than other existing methods.
Similar content being viewed by others
References
Teixeira FA, Pereira FMQ, Wong H-C, Nogueira JMS, Oliveira LB (2019) SIoT: securing Internet of Things through distributed systems analysis. Future Gener Comput Syst 92:1172–1186
Zhang X, Yue WT (2019) Transformative value of the Internet of Things and pricing decisions. In: Electronic Commerce Research and Applications, vol 34, Article 100825
Costa KAP, Papa JP, Lisboa CO, Munoz R, Albuquerque VHC (2019) Internet of Things: a survey on machine learning-based intrusion detection approaches. Comput Netw 151:147–157
Kim J-J (2017) Spatio-temporal sensor data processing techniques. J Inf Process Syst 13(5):1259–1276
Bendimerad N, Kechar B (2015) Rotational wireless video sensor networks with obstacle avoidance capability for improving disaster area coverage. J Inf Process Syst 11(4):509–527
Liu L, Liu Y, Wang L, Zomaya A, Hu S (2015) Economical and balanced energy usage in the smart home infrastructure: a tutorial and new results. IEEE Trans Emerg Top Comput 3(4):556–570
Wiederhold BK, Miller IT, Wiederhold MD (2018) Using virtual reality to mobilize health care: mobile virtual reality technology for attenuation of anxiety and pain. IEEE Consum Electron Mag 7(1):106–109
Grubert J, Langlotz T, Zollmann S, Regenbrecht H (2017) Towards pervasive augmented reality: context-awareness in augmented reality. IEEE Trans Vis Comput Graph 23(6):1706–1724
Finogeev AG, Parygin DS, Finogeev AA (2017) The convergence computing model for big sensor data mining and knowledge discovery. Hum-Centric Comput Inf Sci 7(1):1–16
Xiao F, Lu M, Zhao Y, Menasria S, Meng D, Xie S, Li J, Li C (2018) An information-aware visualization for privacy-preserving accelerometer data sharing. Hum-Centric Comput Inf Sci 8(1):1–28
Xu Y, Ding C, Shu X, Gui K, Zhang D (2019) Shared control of a robotic arm using non-invasive brain–computer interface and computer vision guidance. Robot Auton Syst 115:121–129
Hassan MU, Rehmani MH, Chen J (2019) Privacy preservation in blockchain-based IoT systems: integration issues, prospects, challenges, and future research directions. Future Gener Comput Syst 97:512–529
Dinh D-L, Kim JT, Kim T-S (2014) Hand gesture recognition and interface via a depth imaging sensor for smart home appliances. Energy Procedia 62:576–582
Cantillo-Negrete J, Carino-Escobar RI, Carrillo-Mora P, Barraza-Madrigal JA, Arias-Carrion O (2019) Robotic orthosis compared to virtual hand for brain–computer interface feedback. Biocybern Biomed Eng 39(2):263–272
Erazo O, Pino JA (2018) Predicting user performance time for hand gesture interfaces. Int J Ind Ergon 65:122–138
Vuletic T, Duffy A, Hay L, McTeague C, Grealy M (2019) Systematic literature review of hand gestures used in human computer interaction interfaces. Int J Hum Comput Stud 129:74–94
Fan Q, Shen X, Hu Y, Yu C (2019) Simple very deep convolutional network for robust hand pose regression from a single depth image. Pattern Recognit Lett 119:205–213
Fang B, Sun F, Liu H, Liu C (2018) 3D human gesture capturing and recognition by the IMMU-based data glove. Neurocomputing 277:198–207
Chan ATS, Leong HV, Kong SH (2009) Real-time tracking of hand gestures for interactive game design. In: Proceedings of the IEEE International Symposium on Industrial Electronics, Seoul, Korea, pp 98–103
Smedt QD, Wannous H, Vandeborre J-P (2016) Skeleton-based dynamic hand gesture recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Las Vegas, USA
Sridhar S, Mueller F, Oulasvirta A, Theobalt C (2015) Fast and robust hand tracking using detection-guided optimization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, USA, pp 3213–3221
Li R, Liu Z, Tan J (2019) A survey on 3D hand pose estimation: cameras, methods, and datasets. Pattern Recognit 93:251–272
Erol A, Bebis G, Nicolescu M, Boyle RD, Twombly XI (2007) Vision-based hand pose estimation: a review. Comput Vis Image Underst 108(1-2):52–73
Taylor J, Bordeaux L, Cashman T et al (2016) Efficient and precise interactive hand tracking through joint, continuous optimization of pose and correspondences. ACM Trans Graph 35(4):1–12
Remelli E, Tkach A, Tagliasacchi A, Pauly M (2017) Low-dimensionality calibration through local anisotropic scaling for robust hand model personalization. In: Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, pp 2554–2562
Tkach A, Tagliasacchi A, Remelli E, Pauly M, Fitzgibbon A (2017) Online generative model personalization for hand tracking. ACM Trans Graph 36(6):1–11
Roditakis K, Makris A, Argyros A (2017) Generative 3D hand tracking with spatially constrained pose sampling. In: Proceedings of the British Machine Vision Conference, London, England, pp 1–14
Tkach A, Pauly M, Tagliasacchi A (2016) Sphere-meshes for real-time hand modeling and tracking. ACM Trans Graph 35(6):1–11
Fleishman S, Kliger M, Lerner A, Kutliroff G (2015) ICPIK: inverse kinematics based articulated-ICP. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Boston, USA, pp 28–35
Tagliasacchi A, Tkach A, Bouaziz S, Botsch M, Pauly M (2015) Robust articulated-ICP for real-time hand tracking. Comput Graph Forum 34(5):101–114
Neverova N, Wolf C, Nebout F, Taylor GW (2017) Hand pose estimation through semi-supervised and weakly-supervised learning. Comput Vis Image Underst 164:56–67
Ge L, Liang H, Yuan J, Thalmann D (2017) 3D convolutional neural networks for efficient and robust hand pose estimation from single depth images. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA, pp 5679–5688
Malik J, Elhayek A, Stricker D (2017) Simultaneous hand pose and skeleton bone-lengths estimation from a single depth image. In: Proceedings of the International Conference on 3D Vision, pp 557–565
Hu Z, Hu Y, Wu B, Liu J, Han D, Kurfess T (2018) Hand pose estimation with multi-scale network. Appl Intell 48(8):2501–2515
Ge L, Liang H, Yuan J, Thalmann D (2019) Real-time 3D hand pose estimation with 3D convolutional neural networks. IEEE Trans Pattern Anal Mach Intell 41(4):956–970
Poier G, Roditakis K, Schulter S, Michel D, Bischof H, Argyros AA (2015) Hybrid one-shot 3D hand pose estimation by exploiting uncertainties. In: Proceedings of the British Machine Vision Conference, pp 1–14
Oberweger M, Wohlhart P, Lepetit V (2015) Hands deep in deep learning for hand pose estimation. In: Proceedings of the Computer Vision Winter Workshop
Sanchez-Riera J, Srinivasan K, Hua KL, Cheng WH, Hossain MA, Alhamid MF (2018) Robust RGB-D hand tracking using deep learning priors. IEEE Trans Circuits Syst Video Technol 28(9):2289–2301
Krejov P, Gilbert A, Bowden R (2017) Guided optimisation through classification and regression for hand pose estimation. Comput Vis Image Underst 155:124–138
Madadi M, Escalera S, Carruesco A, Andujar C, Baro X, Gonzalez J (2017) Occlusion aware hand pose recovery from sequences of depth images. In: Proceedings of the IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, pp 230–237
Chen T, Ting PW, Wu MY, Fu LC (2018) Learning a deep network with spherical part model for 3D hand pose estimation. Pattern Recognit 80:1–20
Ahmad A, Migniot C, Dipanda A (2019) Hand pose estimation and tracking in real and virtual interaction: a review. Image Vis Comput 89:35–49
Zhang S, Wang H, Gao J-G, Xing C-Q (2019) Frequency domain point cloud registration based on the Fourier transform. J Vis Commun Image Represent 61:170–177
Pujol-Miro A, Casas JR, Ruiz-Hidalgo J (2019) Correspondence matching in unorganized 3D point clouds using convolutional neural networks. Image Vis Comput 83–84:51–60
Mineo C, Pierce SG, Summan R (2019) Novel algorithms for 3D surface point cloud boundary detection and edge reconstruction. J Comput Des Eng 6(1):81–91
Pernek A, Hajder L (2013) Automatic focal length estimation as an eigenvalue problem. Pattern Recognit Lett 34(9):1108–1117
Rasouli S, Rajabi Y, Sarabi H (2013) Microlenses focal length measurement using Z-scan and parallel moire deflectometry. Opt Lasers Eng 51(12):1321–1326
Zhu Z, Wang X, Liu Q, Zhang F (2019) Camera calibration method based on optimal polarization angle. Opt Lasers Eng 112:128–135
Duan F, Wu F, Zhou M, Deng X, Tian Y (2012) Calibrating effective focal length for central catadioptric cameras using one space line. Pattern Recogn Lett 33(5):646–653
Yajai A, Rasmequan S (2017) Adaptive directional bounding box from RGB-D information for improving fall detection. J Vis Commun Image Represent 49:257–273
Sun J, Zhong G, Huang K, Dong J (2018) Banzhaf random forests: cooperative game theory based random forests with consistency. Neural Netw 106:20–29
Cao H, Bernard S, Sabourin R, Heutte L (2019) Random forest dissimilarity based multi-view learning for radiomics application. Pattern Recognit 88:185–197
Zerbini CB, Carvalho LF, Abrao T, Proenca ML (2019) Wavelet against random forest for anomaly mitigation in software-defined networking. Appl Soft Comput 80:138–153
Abellan J, Mantas CJ, Castellano JG, Moral-Garcia S (2018) Increasing diversity in random forest learning algorithm via imprecise probabilities. Expert Syst Appl 97:228–243
Shotton J, Fitzgibbon A, Cook M, Sharp T, Finocchio M, Moore R, Kipman A, Blake A (2011) Real-time human pose recognition in parts from single depth images. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Colorado, USA, pp 1297–1304
Bohat VK, Arya KV (2019) A new heuristic for multilevel thresholding of images. Expert Syst Appl 117:176–203
Elaziz MA, Lu S (2019) Many-objectives multilevel thresholding image segmentation using knee evolutionary algorithm. Expert Syst Appl 125:305–316
Wang Y, Luo X, Ding L, Fu S, Wei X (2019) Detection based visual tracking with convolutional neural network. Knowl-Based Syst 175:62–71
Acknowledgements
This research was supported by the MSIT (Ministry of Science and ICT), Korea, under the ITRC (Information Technology Research Center) support program (IITP-2018-0-01419) supervised by the IITP (Institute for Information & communications Technology Promotion).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Kim, SH., Jang, SW., Park, JH. et al. Robust hand pose estimation using visual sensor in IoT environment. J Supercomput 76, 5382–5401 (2020). https://doi.org/10.1007/s11227-019-03082-3
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-019-03082-3