Abstract
Assessing the different factors that contribute to accidents in the workplace is essential to ensure the safety and well-being of employees. Given the importance of risk identification in hazard prediction, this work proposes a comparative study between different feature selection techniques (\(\chi ^2\) test and Forward Feature Selection) combined with learning algorithms (Support Vector Machine, Random Forest, and Naive Bayes), both applied to a database of a leading company in the retail sector, in Portugal. The goal is to conclude which factors of each database have the most significant impact on the occurrence of accidents. Initial databases include accident records, ergonomic workplace analysis, hazard intervention and risk assessment, climate databases, and holiday records. Each method was evaluated based on its accuracy in the forecast of the occurrence of the accident. The results showed that the Forward Feature Selection-Random Forest pair performed better among the assessed combinations, considering the case study database. In addition, data from accident records and ergonomic workplace analysis have the largest number of features with the most significant predictive impact on accident prediction. Future studies will be carried out to evaluate factors from other databases that may have meaningful information for predicting accidents.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Antão, P., Calderón, M., Puig, M., Michail, A., Wooldridge, C., Darbra, M.R.: Identification of occupational health, safety, security (OHHS) and environmental performance indicators in port areas. Saf. Sci. 85, 266–275 (2016)
Beus, J.M., McCord, A.M., Zohar, D.: Workplace safety: a review and research synthesis. Org. Psychol. Rev. 4, 352–381 (2016)
Capodaglio, E.M.: Occupational risk and prolonged standing work in apparel sales assistants. Int. J. Ind. Ergon. 60, 53–59 (2017)
Cervantes, J., Garcia-Lamont, F., Rodríguez-Mazahua, L., Lopez, A.: A comprehensive survey on support vector machine classification: applications, challenges and trends. Neurocomputing 408, 189–215 (2020)
Cioni, M., Sabioli, M.: A survey on semi-supervised feature selection methods. Work Employ. Soc. 30, 858–875 (2016)
European Commission: Communication from the commission to the European parliament, the council, the European economic and social committee and the committee of the regions (2012). https://www.eea.europa.eu/policy-documents/communication-from-the-commission-to-1. Accessed 20 Jan 2022
Dukart, J.: Basic concepts of image classification algorithms applied to study neurodegenerative diseases. Brain Mapp., 641–646 (2015). https://doi.org/10.1016/B978-0-12-397025-1.00072-5
Encarnação, J.: Identificação de perigos e avaliação de riscos nas operações de carga e descarga numa empresa de tratamento e valorização de resíduos. Ph.D. thesis, Escola Superior de Tecnologia do Instituto Politécnico de Setúbal (2014)
Eurostat Statistic Explained: Accidents at work statistics. https://ec.europa.eu/
Garcia-Herrero, S., Mariscal, M.A., Garcia-Rodrigues, J., Ritzel, O.D.: Working conditions, psychological/physical symptoms and occupational accidents. Bayesian network models. Saf. Sci. 50, 1760–1774 (2012)
Kang, K., Ryu, H.: Predicting types of occupational accidents at construction sites in Korea using random forest model. Saf. Sci. 120, 226–236 (2019)
Liaw, A., Wiener, M.: Classification and regression by RandomForest. R News 2 (2002)
Loske, D., Klumpp, M., Keil, M., Neukirchen, T.: Logistics work, ergonomics, and social sustainability: empirical musculoskeletal system strain assessment in retail intralogistics. Logistics 5, 89 (2021)
López-García, J.R., Garcia-Herrero, S., Gutiérrez, J.M., Mariscal, M.A.: Psychosocial and ergonomic conditions at work: influence on the probability of a workplace accident. Saf. Sci. 5 (2019)
Martins, D.M.D., et al.: Dynamic extraction of holiday data for use in a predictive model for workplace accidents. In: Second Symposium of Applied Science for Young Researchers - SASYR (2022, in Press)
Matías, J.M., Rivas, T., Martin, J.E., Taboada, J.: Workplace safety: a review and research synthesis. Int. J. Comput. Math. 85, 559–578 (2008)
Muhammad, L.J., Algehyne, E.A., Usman, S.S., Ahmad, A., Chakraborty, C., Mohammed, I.A.: Supervised machine learning models for prediction of COVID-19 infection using epidemiology dataset. SN Comput. Sci. 2, 11 (2021)
Pordata: Acidentes de trabalho: total e por sector de actividade económica. https://www.pordata.pt
Rivas, T., Paz, M., Martin, J.E., Matías, J.M., García, J.F., Taboada, J.: Explaining and predicting workplace accidents using data-mining techniques. Reliab. Eng. Syst. Saf. 96, 739–747 (2011)
Silva, F.G., et al.: External climate data extraction using the forward feature selection method in the context of occupational safety. In: Gervasi, O., Murgante, B., Misra, S., Rocha, A.M.A.C., Garau, C. (eds.) ICCSA 2022. LNCS, vol. 13378, pp. 3–14. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-10562-3_1
Trivedi, S.K.: A study on credit scoring modeling with different feature selection and machine learning approaches. Technol. Soc. 63, 101413 (2020)
Wang, Y., Jin, Z., Deng, C., Guo, S., Wang, X., Wang, X.: Establishment of safety structure theory. Saf. Sci. 115, 265–277 (2019)
Acknowledgement
The authors are grateful to the Foundation for Science and Technology (FCT, Portugal) for financial support through national funds FCT/MCTES (PIDDAC) to CeDRI (UIDB/05757/2020 and UIDP/05757/2020) and SusTEC (LA/P/0007/2021). This work has been supported by NORTE-01-0247-FEDER-072598 iSafety: Intelligent system for occupational safety and well-being in the retail sector. Inês Sena was supported by FCT PhD grant UI/BD/153348/2022.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Sena, I. et al. (2022). Integrated Feature Selection and Classification Algorithm in the Prediction of Work-Related Accidents in the Retail Sector: A Comparative Study. In: Pereira, A.I., Košir, A., Fernandes, F.P., Pacheco, M.F., Teixeira, J.P., Lopes, R.P. (eds) Optimization, Learning Algorithms and Applications. OL2A 2022. Communications in Computer and Information Science, vol 1754. Springer, Cham. https://doi.org/10.1007/978-3-031-23236-7_14
Download citation
DOI: https://doi.org/10.1007/978-3-031-23236-7_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-23235-0
Online ISBN: 978-3-031-23236-7
eBook Packages: Computer ScienceComputer Science (R0)