iBet uBet web content aggregator. Adding the entire web to your favor.
iBet uBet web content aggregator. Adding the entire web to your favor.



Link to original content: https://pubmed.ncbi.nlm.nih.gov/33936427/
Improving the Utility of Tobacco-Related Problem List Entries Using Natural Language Processing - PubMed Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jan 25:2020:534-543.
eCollection 2020.

Improving the Utility of Tobacco-Related Problem List Entries Using Natural Language Processing

Affiliations

Improving the Utility of Tobacco-Related Problem List Entries Using Natural Language Processing

Daniel R Harris et al. AMIA Annu Symp Proc. .

Abstract

We present findings on using natural language processing to classify tobacco-related entries from problem lists found within patient's electronic health records. Problem lists describe health-related issues recorded during a patient's medical visit; these problems are typically followed up upon during subsequent visits and are updated for relevance or accuracy. The mechanics of problem lists vary across different electronic health record systems. In general, they either manifest as pre-generated generic problems that may be selected from a master list or as text boxes where a healthcare professional may enter free text describing the problem. Using commonly-available natural language processing tools, we classified tobacco-related problems into three classes: active-user, former-user, and non-user; we further demonstrate that rule-based post-processing may significantly increase precision in identifying these classes (+32%, +22%, +35% respectively). We used these classes to generate tobacco time-spans that reconstruct a patient's tobacco-use history and better support secondary data analysis. We bundle this as an open-source toolkit with flow visualizations indicating how patient tobacco-related behavior changes longitudinally, which can also capture and visualize contradicting information such as smokers being flagged as having never smoked.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
Distinct problem types and instances of problems over time (Kentucky)
Figure 2:
Figure 2:
New problem types introduced over time (Kentucky)
Figure 3:
Figure 3:
A high-level overview of the CLAMP smoking pipeline
Figure 4:
Figure 4:
Visualizing a single patient’s tobacco-related statuses over time (A) and fixing contradictions (B)
Figure 5:
Figure 5:
Confusion matrix for tobacco status classification
Figure 6:
Figure 6:
Capturing patients and their recorded tobacco-use changes over time

Similar articles

Cited by

References

    1. Weed LL. Medical records that guide and teach (concluded) Yearbook of Medical Informatics. 1968;212:1.
    1. Department of Health and Human Services Health information technology: initial set of standards, implementation specifications, and certification criteria for electronic health record technology. Final rule. Federal Register. 2010;75(144):44589–654. - PubMed
    1. Chute CG, Cohn SP, Campbell KE, Oliver DE, Campbell JR. on Codes & Structures CBPRIWG. The content coverage of clinical classifications. Journal of the American Medical Informatics Association. 1996;3(3):224–233. - PMC - PubMed
    1. Payne TH, Murphy GR, Salazar A. Proceedings of the Annual Symposium on Computer Application in Medical Care. American Medical Informatics Association; 1992. How well does ICD9 represent phrases used in the medical record problem list? p. 654. - PMC - PubMed
    1. Campbell JR, Payne T. Proceedings of the Annual Symposium on Computer Application in Medical Care. American Medical Informatics Association; 1994. A comparison of four schemes for codification of problem lists; p. 201. - PMC - PubMed

Publication types

LinkOut - more resources