Abstract
In this article, a digital twin approach is proposed for modeling a pharmaceutical drying process using machine learning techniques, driven by data from different sensors captured in-line. The current difficulty with the drying process is mainly due to the manual operator control for choosing the end-point for terminating the drying step. This results in significant variability, depending on which human operator is supervising, and due to the multi-tasking nature of the job, generally allows a longer processing time to be sure the material is completely dry. The objective is to automate the end-point identification, thus optimizing the processing duration and therefore the overall energy consumption of the process. The point at which the drying is complete is indicated by the temperature difference between the ingoing and outgoing air flow. However, the stochastic nature of the process makes the data modeling a challenge. Firstly, a wide selection of supervised statistical and machine learning algorithms was benchmarked to find the one (CatBoost) which gave the best performance with the data. Next, the set of hyper-parameters was found for CatBoost which gave the optimum performance. This gave a best performance of 0.788 (R2) fitting of the drying end-point prediction with the real values, for a large number of batches (over 700 K records). This is considered a good result taking into account the high residual of data models for these data and the stochastic nature of the process. The approach has been deployed in a real setting digital twin to control the drying process cutoff in the production plant. The results show the viability of the approach for modeling the process and automatically identifying the optimum end-point for the drying process, thus achieving significant energy savings which have been quantified as approximately 3.7 MWh per year for the pharmaceutical company.
Similar content being viewed by others
References
Barricelli BR, Casiraghi E, Fogli D (2019) A survey on digital twin: definitions, characteristics, applications, and design implications. IEEE Access 7:167653–167671. https://doi.org/10.1109/ACCESS.2019.2953499
Boschert S, Rosen R (2016) Digital twin—the simulation aspect. Mechatronic futures. Springer, Cham, pp 59–74
Burggraeve A, Monteyne T, Vervaet C, Remon JP, De Beer T (2013) Process analytical tools for monitoring, understanding, and control of pharmaceutical fluidized bed granulation: a review. Eur J Pharm Biopharm 83(1):2–15. https://doi.org/10.1016/j.ejpb.2012.09.008
Chen Y, Yang O, Sampat C, Bhalode P, Ramachandran R, Ierapetritou M (2020) Digital twins in pharmaceutical and biopharmaceutical manufacturing: a literature review. Processes 8(9):1088
Cheng D, Zhang J, Hu Z, Xu S, Fang X. A digital twin-driven approach for on-line controlling quality of marine diesel engine critical parts. Int J Precision Eng Manuf. 2020;21(10)
Colombo EF, Shougarian N, Sinha K, Cascini G, de Weck OL. Value analysis for customizable modular product platforms: theory and case study. Res Eng Design. 2020;31(1)
Fornasiero R, Nettleton D, Kiebler L, Martinez de Yuso A, De Marco CE. AI and BD in process industry: a literature review with an operational perspective. In: APMS 2021 conference, advances in production management systems, 5–9 Sept 2021, Nantes, France (online).
Herwig C, Wolbeling C, Zimmer T (2017) A holistic approach to production control: from industry 4.0 to pharma 4.0. Pharm Eng 37:44–46
Lim KYH, Zheng P, Chen CH (2019) A state-of-the-art survey of digital twin: techniques, engineering product lifecycle management and business innovation perspectives. J Intell Manuf 1–25
Lourenço V, Lochmann D, Reich G, Menezes J, Herdling T, Schewitz J (2012) A quality by design study applied to an industrial pharmaceutical fluid bed granulation. Eur J Pharm Biopharm 81(2):438–447
Markarian J (2018) Modernizing pharma manufacturing. Pharm Technol 42(4):20–25
Nettleton DF, Bugnicourt E, Wasiak C, Rosales A (2016) Towards automatic calibration of in-line machine processes. In: Proceedings of 18th International Conference on Industrial Engineering and Manufacturing (ICIEMPM) 2016, London, UK.
Nettleton DF, Wasiak C, Dorissen J, Gillen D, Tretyak A, Bugnicourt E, Rosales A. Data modeling and calibration of in-line pultrusion and laser ablation machine processes. In: International Conference on Advanced Data Mining and Applications (ICADMA), Barcelona, Spain, Aug 2018.
Park J, Ferguson M, Law KH (2018) Data driven analytics (machine learning) for system characterization, diagnostics and control optimization. https://doi.org/10.1007/978-3-319-91635-4_2
Shafqat S, Kishwer S, Rasool RU et al (2020) Big data analytics enhanced healthcare systems: a review. J Supercomput 76:1754–1799. https://doi.org/10.1007/s11227-017-2222-4
Peterson JJ, Snee RD, McAllister PR, Schoeld TL, Carella AJ (2009) Statistics in pharmaceutical development and manufacturing. J Qual Technol 41(2):111–134
Liu Z (2022) Using neural network to establish manufacture production performance forecasting in IoT environment. J Supercomput. https://doi.org/10.1007/s11227-021-04210-8
Ali ZH, Ali HA (2021) Towards sustainable smart IoT applications architectural elements and design: opportunities, challenges, and open directions. J Supercomput 77:5668–5725. https://doi.org/10.1007/s11227-020-03477-7
Prokhorenkova L, Gusev G, Vorobev A, Dorogush AV, Gulin A. CatBoost: unbiased boosting with categorical features. arXiv preprint arXiv 2017-arxiv.org
Shcherbakov MV, Artem VG, Sergey VC. Proactive and predictive maintenance of cyber-physical systems. In: Cyber-Physical Systems: Advances in Design & Modelling. Springer, Cham;2020. pp. 263–278.
Wasiak C, Nettleton D, Janssen H, Brecher C. Quantification of micro-pullwinding process as basis of data mining algorithms for predictive process model. In: 21st Int. Conf. on Composite Materials (ICCM) 2017, Xi’an, China.
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Tibshirani R (1996) Regression shrinkage and selection via the lasso. J R Stat Soc Ser B (Methodol) 58(1):267–288. https://doi.org/10.1111/j.2517-6161.1996.tb02080.x
Fix E, Hodges JL (1989) Discriminatory analysis. Nonparametric discrimination: consistency properties. Int Stat Rev 57(3):238–247. https://doi.org/10.1007/978-3-642-38652-7_2
Keskes S, Hanini S, Hentabli M, Laidi M (2020) Artificial intelligence and mathematical modelling of the drying kinetics of pharmaceutical powders. Kem Ind. https://doi.org/10.15255/KUI.2019.038
Petrović J, Chansanroj K, Meier B, Ibrić S, Betz G (2011) Analysis of fluidized bed granulation process using conventional and novel modeling techniques. Eur J Pharm Sci 44:227–234. https://doi.org/10.1016/j.ejps.2011.07.013
Wang W, Ye Z, Gao H, Ouyang D (2021) Computational pharmaceutics-A new paradigm of drug delivery. J Control Release 338:119–136
Oliveira AL (2019) Biotechnology, big data and artificial intelligence. Biotechnol J 14(8):1800613
Arden NS, Fisher AC, Tyner K, Lawrence XY, Lee SL, Kopcha M (2021) Industry 4.0 for pharmaceutical manufacturing: preparing for the smart factories of the future. Int J Pharm 602:120554
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Barriga, R., Romero, M., Nettleton, D. et al. Advanced data modeling for industrial drying machine energy optimization. J Supercomput 78, 16820–16840 (2022). https://doi.org/10.1007/s11227-022-04498-0
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-022-04498-0