Extracting social determinants of health from electronic health records using natural language processing: a systematic review. 2021

Braja G Patra, and Mohit M Sharma, and Veer Vekaria, and Prakash Adekkanattu, and Olga V Patterson, and Benjamin Glicksberg, and Lauren A Lepow, and Euijung Ryu, and Joanna M Biernacka, and Al'ona Furmanchuk, and Thomas J George, and William Hogan, and Yonghui Wu, and Xi Yang, and Jiang Bian, and Myrna Weissman, and Priya Wickramaratne, and J John Mann, and Mark Olfson, and Thomas R Campion, and Mark Weiner, and Jyotishman Pathak
Department of Population Health Sciences, Weill Cornell Medicine, New York, New York, USA.

Social determinants of health (SDoH) are nonclinical dispositions that impact patient health risks and clinical outcomes. Leveraging SDoH in clinical decision-making can potentially improve diagnosis, treatment planning, and patient outcomes. Despite increased interest in capturing SDoH in electronic health records (EHRs), such information is typically locked in unstructured clinical notes. Natural language processing (NLP) is the key technology to extract SDoH information from clinical text and expand its utility in patient care and research. This article presents a systematic review of the state-of-the-art NLP approaches and tools that focus on identifying and extracting SDoH data from unstructured clinical text in EHRs. A broad literature search was conducted in February 2021 using 3 scholarly databases (ACL Anthology, PubMed, and Scopus) following Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. A total of 6402 publications were initially identified, and after applying the study inclusion criteria, 82 publications were selected for the final review. Smoking status (n = 27), substance use (n = 21), homelessness (n = 20), and alcohol use (n = 15) are the most frequently studied SDoH categories. Homelessness (n = 7) and other less-studied SDoH (eg, education, financial problems, social isolation and support, family problems) are mostly identified using rule-based approaches. In contrast, machine learning approaches are popular for identifying smoking status (n = 13), substance use (n = 9), and alcohol use (n = 9). NLP offers significant potential to extract SDoH data from narrative clinical notes, which in turn can aid in the development of screening tools, risk prediction models, and clinical decision support systems.

UI MeSH Term Description Entries
D009323 Natural Language Processing Computer processing of a language with rules that reflect and describe current usage rather than prescribed usage. Language Processing, Natural,Language Processings, Natural,Natural Language Processings,Processing, Natural Language,Processings, Natural Language
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000069550 Machine Learning A type of ARTIFICIAL INTELLIGENCE that enable COMPUTERS to independently initiate and execute LEARNING when exposed to new data. Transfer Learning,Learning, Machine,Learning, Transfer
D000079803 Data Management Processes that include acquiring, validating, storing, protecting, and processing data to ensure accessibility, reliability, and timeliness for users. Data Administration,Administration, Data,Management, Data
D057286 Electronic Health Records Media that facilitate transportability of pertinent information concerning patient's illness across varied providers and geographic locations. Some versions include direct linkages to online CONSUMER HEALTH INFORMATION that is relevant to the health conditions and treatments related to a specific patient. Electronic Health Record Data,Electronic Medical Record,Electronic Medical Records,Computerized Medical Record,Computerized Medical Records,Electronic Health Record,Medical Record, Computerized,Medical Records, Computerized,Health Record, Electronic,Health Records, Electronic,Medical Record, Electronic,Medical Records, Electronic
D064890 Social Determinants of Health The circumstances in which people are born, grow up, live, work, and age, as well as the systems put in place to deal with illness. These circumstances are in turn shaped by a wider set of forces: economics, social policies, and politics (http://www.cdc.gov/socialdeterminants/). Commercial Determinants of Health,Socio-Economic Determinants of Health,Structural Determinants of Health,Socio Economic Determinants of Health

Related Publications

Braja G Patra, and Mohit M Sharma, and Veer Vekaria, and Prakash Adekkanattu, and Olga V Patterson, and Benjamin Glicksberg, and Lauren A Lepow, and Euijung Ryu, and Joanna M Biernacka, and Al'ona Furmanchuk, and Thomas J George, and William Hogan, and Yonghui Wu, and Xi Yang, and Jiang Bian, and Myrna Weissman, and Priya Wickramaratne, and J John Mann, and Mark Olfson, and Thomas R Campion, and Mark Weiner, and Jyotishman Pathak
March 2022, Journal of biomedical informatics,
Braja G Patra, and Mohit M Sharma, and Veer Vekaria, and Prakash Adekkanattu, and Olga V Patterson, and Benjamin Glicksberg, and Lauren A Lepow, and Euijung Ryu, and Joanna M Biernacka, and Al'ona Furmanchuk, and Thomas J George, and William Hogan, and Yonghui Wu, and Xi Yang, and Jiang Bian, and Myrna Weissman, and Priya Wickramaratne, and J John Mann, and Mark Olfson, and Thomas R Campion, and Mark Weiner, and Jyotishman Pathak
April 2021, JMIR medical informatics,
Braja G Patra, and Mohit M Sharma, and Veer Vekaria, and Prakash Adekkanattu, and Olga V Patterson, and Benjamin Glicksberg, and Lauren A Lepow, and Euijung Ryu, and Joanna M Biernacka, and Al'ona Furmanchuk, and Thomas J George, and William Hogan, and Yonghui Wu, and Xi Yang, and Jiang Bian, and Myrna Weissman, and Priya Wickramaratne, and J John Mann, and Mark Olfson, and Thomas R Campion, and Mark Weiner, and Jyotishman Pathak
July 2024, JAMIA open,
Braja G Patra, and Mohit M Sharma, and Veer Vekaria, and Prakash Adekkanattu, and Olga V Patterson, and Benjamin Glicksberg, and Lauren A Lepow, and Euijung Ryu, and Joanna M Biernacka, and Al'ona Furmanchuk, and Thomas J George, and William Hogan, and Yonghui Wu, and Xi Yang, and Jiang Bian, and Myrna Weissman, and Priya Wickramaratne, and J John Mann, and Mark Olfson, and Thomas R Campion, and Mark Weiner, and Jyotishman Pathak
May 2021, JMIR medical informatics,
Braja G Patra, and Mohit M Sharma, and Veer Vekaria, and Prakash Adekkanattu, and Olga V Patterson, and Benjamin Glicksberg, and Lauren A Lepow, and Euijung Ryu, and Joanna M Biernacka, and Al'ona Furmanchuk, and Thomas J George, and William Hogan, and Yonghui Wu, and Xi Yang, and Jiang Bian, and Myrna Weissman, and Priya Wickramaratne, and J John Mann, and Mark Olfson, and Thomas R Campion, and Mark Weiner, and Jyotishman Pathak
October 2023, BMC bioinformatics,
Braja G Patra, and Mohit M Sharma, and Veer Vekaria, and Prakash Adekkanattu, and Olga V Patterson, and Benjamin Glicksberg, and Lauren A Lepow, and Euijung Ryu, and Joanna M Biernacka, and Al'ona Furmanchuk, and Thomas J George, and William Hogan, and Yonghui Wu, and Xi Yang, and Jiang Bian, and Myrna Weissman, and Priya Wickramaratne, and J John Mann, and Mark Olfson, and Thomas R Campion, and Mark Weiner, and Jyotishman Pathak
December 2023, Health services research,
Braja G Patra, and Mohit M Sharma, and Veer Vekaria, and Prakash Adekkanattu, and Olga V Patterson, and Benjamin Glicksberg, and Lauren A Lepow, and Euijung Ryu, and Joanna M Biernacka, and Al'ona Furmanchuk, and Thomas J George, and William Hogan, and Yonghui Wu, and Xi Yang, and Jiang Bian, and Myrna Weissman, and Priya Wickramaratne, and J John Mann, and Mark Olfson, and Thomas R Campion, and Mark Weiner, and Jyotishman Pathak
March 2022, The journals of gerontology. Series A, Biological sciences and medical sciences,
Braja G Patra, and Mohit M Sharma, and Veer Vekaria, and Prakash Adekkanattu, and Olga V Patterson, and Benjamin Glicksberg, and Lauren A Lepow, and Euijung Ryu, and Joanna M Biernacka, and Al'ona Furmanchuk, and Thomas J George, and William Hogan, and Yonghui Wu, and Xi Yang, and Jiang Bian, and Myrna Weissman, and Priya Wickramaratne, and J John Mann, and Mark Olfson, and Thomas R Campion, and Mark Weiner, and Jyotishman Pathak
January 2024, Studies in health technology and informatics,
Braja G Patra, and Mohit M Sharma, and Veer Vekaria, and Prakash Adekkanattu, and Olga V Patterson, and Benjamin Glicksberg, and Lauren A Lepow, and Euijung Ryu, and Joanna M Biernacka, and Al'ona Furmanchuk, and Thomas J George, and William Hogan, and Yonghui Wu, and Xi Yang, and Jiang Bian, and Myrna Weissman, and Priya Wickramaratne, and J John Mann, and Mark Olfson, and Thomas R Campion, and Mark Weiner, and Jyotishman Pathak
January 2017, Studies in health technology and informatics,
Braja G Patra, and Mohit M Sharma, and Veer Vekaria, and Prakash Adekkanattu, and Olga V Patterson, and Benjamin Glicksberg, and Lauren A Lepow, and Euijung Ryu, and Joanna M Biernacka, and Al'ona Furmanchuk, and Thomas J George, and William Hogan, and Yonghui Wu, and Xi Yang, and Jiang Bian, and Myrna Weissman, and Priya Wickramaratne, and J John Mann, and Mark Olfson, and Thomas R Campion, and Mark Weiner, and Jyotishman Pathak
March 2024, medRxiv : the preprint server for health sciences,
Copied contents to your clipboard!