The Use of Deep Learning and Machine Learning on Longitudinal Electronic Health Records for the Early Detection and Prevention of Diseases: Scoping Review
- PMID: 39163096
- PMCID: PMC11372333
- DOI: 10.2196/48320
The Use of Deep Learning and Machine Learning on Longitudinal Electronic Health Records for the Early Detection and Prevention of Diseases: Scoping Review
Abstract
Background: Electronic health records (EHRs) contain patients' health information over time, including possible early indicators of disease. However, the increasing amount of data hinders clinicians from using them. There is accumulating evidence suggesting that machine learning (ML) and deep learning (DL) can assist clinicians in analyzing these large-scale EHRs, as algorithms thrive on high volumes of data. Although ML has become well developed, studies mainly focus on engineering but lack medical outcomes.
Objective: This study aims for a scoping review of the evidence on how the use of ML on longitudinal EHRs can support the early detection and prevention of disease. The medical insights and clinical benefits that have been generated were investigated by reviewing applications in a variety of diseases.
Methods: This study was conducted according to the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines. A literature search was performed in 2022 in collaboration with a medical information specialist in the following databases: PubMed, Embase, Web of Science Core Collection (Clarivate Analytics), and IEEE Xplore Digital Library and computer science bibliography. Studies were eligible when longitudinal EHRs were used that aimed for the early detection of disease via ML in a prevention context. Studies with a technical focus or using imaging or hospital admission data were beyond the scope of this review. Study screening and selection and data extraction were performed independently by 2 researchers.
Results: In total, 20 studies were included, mainly published between 2018 and 2022. They showed that a variety of diseases could be detected or predicted, particularly diabetes; kidney diseases; diseases of the circulatory system; and mental, behavioral, and neurodevelopmental disorders. Demographics, symptoms, procedures, laboratory test results, diagnoses, medications, and BMI were frequently used EHR data in basic recurrent neural network or long short-term memory techniques. By developing and comparing ML and DL models, medical insights such as a high diagnostic performance, an earlier detection, the most important predictors, and additional health indicators were obtained. A clinical benefit that has been evaluated positively was preliminary screening. If these models are applied in practice, patients might also benefit from personalized health care and prevention, with practical benefits such as workload reduction and policy insights.
Conclusions: Longitudinal EHRs proved to be helpful for support in health care. Current ML models on EHRs can support the detection of diseases in terms of accuracy and offer preliminary screening benefits. Regarding the prevention of diseases, ML and specifically DL models can accurately predict or detect diseases earlier than current clinical diagnoses. Adding personally responsible factors allows targeted prevention interventions. While ML models based on textual EHRs are still in the developmental stage, they have high potential to support clinicians and the health care system and improve patient outcomes.
Keywords: artificial intelligence; big data; detection; electronic health records; machine learning; personalized health care; prediction; prevention.
©Laura Swinckels, Frank C Bennis, Kirsten A Ziesemer, Janneke F M Scheerman, Harmen Bijwaard, Ander de Keijzer, Josef Jan Bruers. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 20.08.2024.
Conflict of interest statement
Conflicts of Interest: None declared.
Figures
Similar articles
-
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217. Cochrane Database Syst Rev. 2022. PMID: 36321557 Free PMC article.
-
The future of Cochrane Neonatal.Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12. Early Hum Dev. 2020. PMID: 33036834
-
Artificial intelligence-based methods for fusion of electronic health records and imaging data.Sci Rep. 2022 Oct 26;12(1):17981. doi: 10.1038/s41598-022-22514-4. Sci Rep. 2022. PMID: 36289266 Free PMC article.
-
Machine learning models to detect and predict patient safety events using electronic health records: A systematic review.Int J Med Inform. 2023 Dec;180:105246. doi: 10.1016/j.ijmedinf.2023.105246. Epub 2023 Oct 9. Int J Med Inform. 2023. PMID: 37837710 Review.
-
Deep representation learning of patient data from Electronic Health Records (EHR): A systematic review.J Biomed Inform. 2021 Mar;115:103671. doi: 10.1016/j.jbi.2020.103671. Epub 2020 Dec 31. J Biomed Inform. 2021. PMID: 33387683 Free PMC article. Review.
Cited by
-
Predicting survival benefits of immune checkpoint inhibitor therapy in lung cancer patients: a machine learning approach using real-world data.Int J Clin Pharm. 2024 Oct 29. doi: 10.1007/s11096-024-01818-7. Online ahead of print. Int J Clin Pharm. 2024. PMID: 39470981
References
-
- Xie F, Yuan H, Ning Y, Ong ME, Feng M, Hsu W, Chakraborty B, Liu N. Deep learning for temporal data representation in electronic health records: a systematic review of challenges and methodologies. J Biomed Inform. 2022 Mar;126:103980. doi: 10.1016/j.jbi.2021.103980. https://linkinghub.elsevier.com/retrieve/pii/S1532-0464(21)00309-9 S1532-0464(21)00309-9 - DOI - PubMed
-
- Chawla NV, Davis DA. Bringing big data to personalized healthcare: a patient-centered framework. J Gen Intern Med. 2013 Sep;28 Suppl 3(Suppl 3):S660–5. doi: 10.1007/s11606-013-2455-8. https://europepmc.org/abstract/MED/23797912 - DOI - PMC - PubMed
-
- Shickel B, Tighe PJ, Bihorac A, Rashidi P. Deep EHR: a survey of recent advances in deep learning techniques for electronic health record (EHR) analysis. IEEE J Biomed Health Inform. 2018 Sep;22(5):1589–604. doi: 10.1109/JBHI.2017.2767063. https://europepmc.org/abstract/MED/29989977 - DOI - PMC - PubMed
-
- Beath C, Becerra-Fernandez I, Ross J, Short J. Finding value in the information explosion. MIT Sloan Manag Rev. 2012;53:18–20.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources