Extracting seizure frequency from epilepsy clinic notes: a machine reading approach to natural language processing. 2022

Kevin Xie, and Ryan S Gallagher, and Erin C Conrad, and Chadric O Garrick, and Steven N Baldassano, and John M Bernabei, and Peter D Galer, and Nina J Ghosn, and Adam S Greenblatt, and Tara Jennings, and Alana Kornspun, and Catherine V Kulick-Soper, and Jal M Panchal, and Akash R Pattnaik, and Brittany H Scheid, and Danmeng Wei, and Micah Weitzman, and Ramya Muthukrishnan, and Joongwon Kim, and Brian Litt, and Colin A Ellis, and Dan Roth
Department of Bioengineering, School of Engineering and Applied Sciences, University of Pennsylvania, Philadelphia, Pennsylvania, USA.

Seizure frequency and seizure freedom are among the most important outcome measures for patients with epilepsy. In this study, we aimed to automatically extract this clinical information from unstructured text in clinical notes. If successful, this could improve clinical decision-making in epilepsy patients and allow for rapid, large-scale retrospective research. We developed a finetuning pipeline for pretrained neural models to classify patients as being seizure-free and to extract text containing their seizure frequency and date of last seizure from clinical notes. We annotated 1000 notes for use as training and testing data and determined how well 3 pretrained neural models, BERT, RoBERTa, and Bio_ClinicalBERT, could identify and extract the desired information after finetuning. The finetuned models (BERTFT, Bio_ClinicalBERTFT, and RoBERTaFT) achieved near-human performance when classifying patients as seizure free, with BERTFT and Bio_ClinicalBERTFT achieving accuracy scores over 80%. All 3 models also achieved human performance when extracting seizure frequency and date of last seizure, with overall F1 scores over 0.80. The best combination of models was Bio_ClinicalBERTFT for classification, and RoBERTaFT for text extraction. Most of the gains in performance due to finetuning required roughly 70 annotated notes. Our novel machine reading approach to extracting important clinical outcomes performed at or near human performance on several tasks. This approach opens new possibilities to support clinical practice and conduct large-scale retrospective clinical research. Future studies can use our finetuning pipeline with minimal training annotations to answer new clinical questions.

UI MeSH Term Description Entries
D009323 Natural Language Processing Computer processing of a language with rules that reflect and describe current usage rather than prescribed usage. Language Processing, Natural,Language Processings, Natural,Natural Language Processings,Processing, Natural Language,Processings, Natural Language
D004827 Epilepsy A disorder characterized by recurrent episodes of paroxysmal brain dysfunction due to a sudden, disorderly, and excessive neuronal discharge. Epilepsy classification systems are generally based upon: (1) clinical features of the seizure episodes (e.g., motor seizure), (2) etiology (e.g., post-traumatic), (3) anatomic site of seizure origin (e.g., frontal lobe seizure), (4) tendency to spread to other structures in the brain, and (5) temporal patterns (e.g., nocturnal epilepsy). (From Adams et al., Principles of Neurology, 6th ed, p313) Aura,Awakening Epilepsy,Seizure Disorder,Epilepsy, Cryptogenic,Auras,Cryptogenic Epilepsies,Cryptogenic Epilepsy,Epilepsies,Epilepsies, Cryptogenic,Epilepsy, Awakening,Seizure Disorders
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D012189 Retrospective Studies Studies used to test etiologic hypotheses in which inferences about an exposure to putative causal factors are derived from data relating to characteristics of persons under study or to events or experiences in their past. The essential feature is that some of the persons under study have the disease or outcome of interest and their characteristics are compared with those of unaffected persons. Retrospective Study,Studies, Retrospective,Study, Retrospective
D012640 Seizures Clinical or subclinical disturbances of cortical function due to a sudden, abnormal, excessive, and disorganized discharge of brain cells. Clinical manifestations include abnormal motor, sensory and psychic phenomena. Recurrent seizures are usually referred to as EPILEPSY or "seizure disorder." Absence Seizure,Absence Seizures,Atonic Absence Seizure,Atonic Seizure,Clonic Seizure,Complex Partial Seizure,Convulsion,Convulsions,Convulsive Seizure,Convulsive Seizures,Epileptic Seizure,Epileptic Seizures,Generalized Absence Seizure,Generalized Tonic-Clonic Seizures,Jacksonian Seizure,Myoclonic Seizure,Non-Epileptic Seizure,Nonepileptic Seizure,Partial Seizure,Seizure,Seizures, Convulsive,Seizures, Focal,Seizures, Generalized,Seizures, Motor,Seizures, Sensory,Tonic Clonic Seizure,Tonic Seizure,Tonic-Clonic Seizure,Atonic Absence Seizures,Atonic Seizures,Clonic Seizures,Complex Partial Seizures,Convulsion, Non-Epileptic,Generalized Absence Seizures,Myoclonic Seizures,Non-Epileptic Seizures,Nonepileptic Seizures,Partial Seizures,Petit Mal Convulsion,Seizures, Auditory,Seizures, Clonic,Seizures, Epileptic,Seizures, Gustatory,Seizures, Olfactory,Seizures, Somatosensory,Seizures, Tonic,Seizures, Tonic-Clonic,Seizures, Vertiginous,Seizures, Vestibular,Seizures, Visual,Single Seizure,Tonic Seizures,Tonic-Clonic Seizures,Absence Seizure, Atonic,Absence Seizure, Generalized,Absence Seizures, Atonic,Absence Seizures, Generalized,Auditory Seizure,Auditory Seizures,Clonic Seizure, Tonic,Clonic Seizures, Tonic,Convulsion, Non Epileptic,Convulsion, Petit Mal,Convulsions, Non-Epileptic,Focal Seizure,Focal Seizures,Generalized Seizure,Generalized Seizures,Generalized Tonic Clonic Seizures,Generalized Tonic-Clonic Seizure,Gustatory Seizure,Gustatory Seizures,Motor Seizure,Motor Seizures,Non Epileptic Seizure,Non Epileptic Seizures,Non-Epileptic Convulsion,Non-Epileptic Convulsions,Olfactory Seizure,Olfactory Seizures,Partial Seizure, Complex,Partial Seizures, Complex,Seizure, Absence,Seizure, Atonic,Seizure, Atonic Absence,Seizure, Auditory,Seizure, Clonic,Seizure, Complex Partial,Seizure, Convulsive,Seizure, Epileptic,Seizure, Focal,Seizure, Generalized,Seizure, Generalized Absence,Seizure, Generalized Tonic-Clonic,Seizure, Gustatory,Seizure, Jacksonian,Seizure, Motor,Seizure, Myoclonic,Seizure, Non-Epileptic,Seizure, Nonepileptic,Seizure, Olfactory,Seizure, Partial,Seizure, Sensory,Seizure, Single,Seizure, Somatosensory,Seizure, Tonic,Seizure, Tonic Clonic,Seizure, Tonic-Clonic,Seizure, Vertiginous,Seizure, Vestibular,Seizure, Visual,Seizures, Generalized Tonic-Clonic,Seizures, Nonepileptic,Sensory Seizure,Sensory Seizures,Single Seizures,Somatosensory Seizure,Somatosensory Seizures,Tonic Clonic Seizures,Tonic-Clonic Seizure, Generalized,Tonic-Clonic Seizures, Generalized,Vertiginous Seizure,Vertiginous Seizures,Vestibular Seizure,Vestibular Seizures,Visual Seizure,Visual Seizures
D057286 Electronic Health Records Media that facilitate transportability of pertinent information concerning patient's illness across varied providers and geographic locations. Some versions include direct linkages to online CONSUMER HEALTH INFORMATION that is relevant to the health conditions and treatments related to a specific patient. Electronic Health Record Data,Electronic Medical Record,Electronic Medical Records,Computerized Medical Record,Computerized Medical Records,Electronic Health Record,Medical Record, Computerized,Medical Records, Computerized,Health Record, Electronic,Health Records, Electronic,Medical Record, Electronic,Medical Records, Electronic

Related Publications

Kevin Xie, and Ryan S Gallagher, and Erin C Conrad, and Chadric O Garrick, and Steven N Baldassano, and John M Bernabei, and Peter D Galer, and Nina J Ghosn, and Adam S Greenblatt, and Tara Jennings, and Alana Kornspun, and Catherine V Kulick-Soper, and Jal M Panchal, and Akash R Pattnaik, and Brittany H Scheid, and Danmeng Wei, and Micah Weitzman, and Ramya Muthukrishnan, and Joongwon Kim, and Brian Litt, and Colin A Ellis, and Dan Roth
October 2023, BMC bioinformatics,
Kevin Xie, and Ryan S Gallagher, and Erin C Conrad, and Chadric O Garrick, and Steven N Baldassano, and John M Bernabei, and Peter D Galer, and Nina J Ghosn, and Adam S Greenblatt, and Tara Jennings, and Alana Kornspun, and Catherine V Kulick-Soper, and Jal M Panchal, and Akash R Pattnaik, and Brittany H Scheid, and Danmeng Wei, and Micah Weitzman, and Ramya Muthukrishnan, and Joongwon Kim, and Brian Litt, and Colin A Ellis, and Dan Roth
April 2014, Journal of biomedical informatics,
Kevin Xie, and Ryan S Gallagher, and Erin C Conrad, and Chadric O Garrick, and Steven N Baldassano, and John M Bernabei, and Peter D Galer, and Nina J Ghosn, and Adam S Greenblatt, and Tara Jennings, and Alana Kornspun, and Catherine V Kulick-Soper, and Jal M Panchal, and Akash R Pattnaik, and Brittany H Scheid, and Danmeng Wei, and Micah Weitzman, and Ramya Muthukrishnan, and Joongwon Kim, and Brian Litt, and Colin A Ellis, and Dan Roth
January 2017, Studies in health technology and informatics,
Kevin Xie, and Ryan S Gallagher, and Erin C Conrad, and Chadric O Garrick, and Steven N Baldassano, and John M Bernabei, and Peter D Galer, and Nina J Ghosn, and Adam S Greenblatt, and Tara Jennings, and Alana Kornspun, and Catherine V Kulick-Soper, and Jal M Panchal, and Akash R Pattnaik, and Brittany H Scheid, and Danmeng Wei, and Micah Weitzman, and Ramya Muthukrishnan, and Joongwon Kim, and Brian Litt, and Colin A Ellis, and Dan Roth
May 2022, Brain informatics,
Kevin Xie, and Ryan S Gallagher, and Erin C Conrad, and Chadric O Garrick, and Steven N Baldassano, and John M Bernabei, and Peter D Galer, and Nina J Ghosn, and Adam S Greenblatt, and Tara Jennings, and Alana Kornspun, and Catherine V Kulick-Soper, and Jal M Panchal, and Akash R Pattnaik, and Brittany H Scheid, and Danmeng Wei, and Micah Weitzman, and Ramya Muthukrishnan, and Joongwon Kim, and Brian Litt, and Colin A Ellis, and Dan Roth
January 2003, AMIA ... Annual Symposium proceedings. AMIA Symposium,
Kevin Xie, and Ryan S Gallagher, and Erin C Conrad, and Chadric O Garrick, and Steven N Baldassano, and John M Bernabei, and Peter D Galer, and Nina J Ghosn, and Adam S Greenblatt, and Tara Jennings, and Alana Kornspun, and Catherine V Kulick-Soper, and Jal M Panchal, and Akash R Pattnaik, and Brittany H Scheid, and Danmeng Wei, and Micah Weitzman, and Ramya Muthukrishnan, and Joongwon Kim, and Brian Litt, and Colin A Ellis, and Dan Roth
December 2017, BMC medical informatics and decision making,
Kevin Xie, and Ryan S Gallagher, and Erin C Conrad, and Chadric O Garrick, and Steven N Baldassano, and John M Bernabei, and Peter D Galer, and Nina J Ghosn, and Adam S Greenblatt, and Tara Jennings, and Alana Kornspun, and Catherine V Kulick-Soper, and Jal M Panchal, and Akash R Pattnaik, and Brittany H Scheid, and Danmeng Wei, and Micah Weitzman, and Ramya Muthukrishnan, and Joongwon Kim, and Brian Litt, and Colin A Ellis, and Dan Roth
January 2023, Health informatics journal,
Kevin Xie, and Ryan S Gallagher, and Erin C Conrad, and Chadric O Garrick, and Steven N Baldassano, and John M Bernabei, and Peter D Galer, and Nina J Ghosn, and Adam S Greenblatt, and Tara Jennings, and Alana Kornspun, and Catherine V Kulick-Soper, and Jal M Panchal, and Akash R Pattnaik, and Brittany H Scheid, and Danmeng Wei, and Micah Weitzman, and Ramya Muthukrishnan, and Joongwon Kim, and Brian Litt, and Colin A Ellis, and Dan Roth
November 2014, Arthritis care & research,
Kevin Xie, and Ryan S Gallagher, and Erin C Conrad, and Chadric O Garrick, and Steven N Baldassano, and John M Bernabei, and Peter D Galer, and Nina J Ghosn, and Adam S Greenblatt, and Tara Jennings, and Alana Kornspun, and Catherine V Kulick-Soper, and Jal M Panchal, and Akash R Pattnaik, and Brittany H Scheid, and Danmeng Wei, and Micah Weitzman, and Ramya Muthukrishnan, and Joongwon Kim, and Brian Litt, and Colin A Ellis, and Dan Roth
January 2024, Studies in health technology and informatics,
Kevin Xie, and Ryan S Gallagher, and Erin C Conrad, and Chadric O Garrick, and Steven N Baldassano, and John M Bernabei, and Peter D Galer, and Nina J Ghosn, and Adam S Greenblatt, and Tara Jennings, and Alana Kornspun, and Catherine V Kulick-Soper, and Jal M Panchal, and Akash R Pattnaik, and Brittany H Scheid, and Danmeng Wei, and Micah Weitzman, and Ramya Muthukrishnan, and Joongwon Kim, and Brian Litt, and Colin A Ellis, and Dan Roth
January 2004, Studies in health technology and informatics,
Copied contents to your clipboard!