Clinical text classification with rule-based features and knowledge-guided convolutional neural networks
- PMID: 30943960
- PMCID: PMC6448186
- DOI: 10.1186/s12911-019-0781-4
Clinical text classification with rule-based features and knowledge-guided convolutional neural networks
Abstract
Background: Clinical text classification is an fundamental problem in medical natural language processing. Existing studies have cocnventionally focused on rules or knowledge sources-based feature engineering, but only a limited number of studies have exploited effective representation learning capability of deep learning methods.
Methods: In this study, we propose a new approach which combines rule-based features and knowledge-guided deep learning models for effective disease classification. Critical Steps of our method include recognizing trigger phrases, predicting classes with very few examples using trigger phrases and training a convolutional neural network (CNN) with word embeddings and Unified Medical Language System (UMLS) entity embeddings.
Results: We evaluated our method on the 2008 Integrating Informatics with Biology and the Bedside (i2b2) obesity challenge. The results demonstrate that our method outperforms the state-of-the-art methods.
Conclusion: We showed that CNN model is powerful for learning effective hidden features, and CUIs embeddings are helpful for building clinical text representations. This shows integrating domain knowledge into CNN models is promising.
Keywords: Clinical text classification; Convolutional neural networks; Entity embeddings; Obesity challenge; Word embeddings.
Conflict of interest statement
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figures
Similar articles
-
Combine Factual Medical Knowledge and Distributed Word Representation to Improve Clinical Named Entity Recognition.AMIA Annu Symp Proc. 2018 Dec 5;2018:1110-1117. eCollection 2018. AMIA Annu Symp Proc. 2018. PMID: 30815153 Free PMC article.
-
A clinical text classification paradigm using weak supervision and deep representation.BMC Med Inform Decis Mak. 2019 Jan 7;19(1):1. doi: 10.1186/s12911-018-0723-6. BMC Med Inform Decis Mak. 2019. PMID: 30616584 Free PMC article.
-
Predicting mortality in critically ill patients with diabetes using machine learning and clinical notes.BMC Med Inform Decis Mak. 2020 Dec 30;20(Suppl 11):295. doi: 10.1186/s12911-020-01318-4. BMC Med Inform Decis Mak. 2020. PMID: 33380338 Free PMC article.
-
Convolutional Neural Networks for ATC Classification.Curr Pharm Des. 2018;24(34):4007-4012. doi: 10.2174/1381612824666181112113438. Curr Pharm Des. 2018. PMID: 30417778 Review.
-
Neural network-based approaches for biomedical relation classification: A review.J Biomed Inform. 2019 Nov;99:103294. doi: 10.1016/j.jbi.2019.103294. Epub 2019 Sep 23. J Biomed Inform. 2019. PMID: 31557530 Review.
Cited by
-
Transformer-based active learning for multi-class text annotation and classification.Digit Health. 2024 Oct 17;10:20552076241287357. doi: 10.1177/20552076241287357. eCollection 2024 Jan-Dec. Digit Health. 2024. PMID: 39430702 Free PMC article.
-
Estimating the prevalence of select non-communicable diseases in Saudi Arabia using a population-based sample: econometric analysis with natural language processing.Ann Saudi Med. 2024 Sep-Oct;44(5):329-338. doi: 10.5144/0256-4947.2024.329. Epub 2024 Oct 3. Ann Saudi Med. 2024. PMID: 39368120 Free PMC article.
-
Northwestern University resource and education development initiatives to advance collaborative artificial intelligence across the learning health system.Learn Health Syst. 2024 Apr 15;8(3):e10417. doi: 10.1002/lrh2.10417. eCollection 2024 Jul. Learn Health Syst. 2024. PMID: 39036530 Free PMC article.
-
A computational clinical decision-supporting system to suggest effective anti-epileptic drugs for pediatric epilepsy patients based on deep learning models using patient's medical history.BMC Med Inform Decis Mak. 2024 May 31;24(1):149. doi: 10.1186/s12911-024-02552-w. BMC Med Inform Decis Mak. 2024. PMID: 38822293 Free PMC article.
-
Machine learning natural language processing for identifying venous thromboembolism: systematic review and meta-analysis.Blood Adv. 2024 Jun 25;8(12):2991-3000. doi: 10.1182/bloodadvances.2023012200. Blood Adv. 2024. PMID: 38522096 Free PMC article.
References
-
- Suominen H, Ginter F, Pyysalo S, Airola A, Pahikkala T, Salanter S, Salakoski T. Machine learning to automate the assignment of diagnosis codes to free-text radiology reports: a method description. In: Proceedings of the ICML/UAI/COLT Workshop on Machine Learning for Health-Care Applications: 2008.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources