Streamlining Quality Review of Mass Spectrometry Data in the Clinical Laboratory by Use of Machine Learning. 2019

Min Yu, and Lindsay A L Bazydlo, and David E Bruns, and James H Harrison
From the Division of Laboratory Medicine, Department of Pathology, University of Virginia School of Medicine and Health System, Charlottesville. Dr Yu is currently located in the Department of Pathology and Laboratory Medicine, University of Kentucky Medical Center, Lexington.

Turnaround time and productivity of clinical mass spectrometric (MS) testing are hampered by time-consuming manual review of the analytical quality of MS data before release of patient results. To determine whether a classification model created by using standard machine learning algorithms can verify analytically acceptable MS results and thereby reduce manual review requirements. We obtained retrospective data from gas chromatography-MS analyses of 11-nor-9-carboxy-delta-9-tetrahydrocannabinol (THC-COOH) in 1267 urine samples. The data for each sample had been labeled previously as either analytically unacceptable or acceptable by manual review. The dataset was randomly split into training and test sets (848 and 419 samples, respectively), maintaining equal proportions of acceptable (90%) and unacceptable (10%) results in each set. We used stratified 10-fold cross-validation in assessing the abilities of 6 supervised machine learning algorithms to distinguish unacceptable from acceptable assay results in the training dataset. The classifier with the highest recall was used to build a final model, and its performance was evaluated against the test dataset. In comparison testing of the 6 classifiers, a model based on the Support Vector Machines algorithm yielded the highest recall and acceptable precision. After optimization, this model correctly identified all unacceptable results in the test dataset (100% recall) with a precision of 81%. Automated data review identified all analytically unacceptable assays in the test dataset, while reducing the manual review requirement by about 87%. This automation strategy can focus manual review only on assays likely to be problematic, allowing improved throughput and turnaround time without reducing quality.

UI MeSH Term Description Entries
D008401 Gas Chromatography-Mass Spectrometry A microanalytical technique combining mass spectrometry and gas chromatography for the qualitative as well as quantitative determinations of compounds. Chromatography, Gas-Liquid-Mass Spectrometry,Chromatography, Gas-Mass Spectrometry,GCMS,Spectrometry, Mass-Gas Chromatography,Spectrum Analysis, Mass-Gas Chromatography,Gas-Liquid Chromatography-Mass Spectrometry,Mass Spectrometry-Gas Chromatography,Chromatography, Gas Liquid Mass Spectrometry,Chromatography, Gas Mass Spectrometry,Chromatography, Mass Spectrometry-Gas,Chromatography-Mass Spectrometry, Gas,Chromatography-Mass Spectrometry, Gas-Liquid,Gas Chromatography Mass Spectrometry,Gas Liquid Chromatography Mass Spectrometry,Mass Spectrometry Gas Chromatography,Spectrometries, Mass-Gas Chromatography,Spectrometry, Gas Chromatography-Mass,Spectrometry, Gas-Liquid Chromatography-Mass,Spectrometry, Mass Gas Chromatography,Spectrometry-Gas Chromatography, Mass,Spectrum Analysis, Mass Gas Chromatography
D012015 Reference Standards A basis of value established for the measure of quantity, weight, extent or quality, e.g. weight standards, standard solutions, methods, techniques, and procedures used in diagnosis and therapy. Standard Preparations,Standards, Reference,Preparations, Standard,Standardization,Standards,Preparation, Standard,Reference Standard,Standard Preparation,Standard, Reference
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000069550 Machine Learning A type of ARTIFICIAL INTELLIGENCE that enable COMPUTERS to independently initiate and execute LEARNING when exposed to new data. Transfer Learning,Learning, Machine,Learning, Transfer
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D012189 Retrospective Studies Studies used to test etiologic hypotheses in which inferences about an exposure to putative causal factors are derived from data relating to characteristics of persons under study or to events or experiences in their past. The essential feature is that some of the persons under study have the disease or outcome of interest and their characteristics are compared with those of unaffected persons. Retrospective Study,Studies, Retrospective,Study, Retrospective
D013058 Mass Spectrometry An analytical method used in determining the identity of a chemical based on its mass using mass analyzers/mass spectrometers. Mass Spectroscopy,Spectrometry, Mass,Spectroscopy, Mass,Spectrum Analysis, Mass,Analysis, Mass Spectrum,Mass Spectrum Analysis,Analyses, Mass Spectrum,Mass Spectrum Analyses,Spectrum Analyses, Mass
D013759 Dronabinol A psychoactive compound extracted from the resin of Cannabis sativa (marihuana, hashish). The isomer delta-9-tetrahydrocannabinol (THC) is considered the most active form, producing characteristic mood and perceptual changes associated with this compound. THC,Tetrahydrocannabinol,delta(9)-THC,9-ene-Tetrahydrocannabinol,Marinol,Tetrahydrocannabinol, (6a-trans)-Isomer,Tetrahydrocannabinol, (6aR-cis)-Isomer,Tetrahydrocannabinol, (6aS-cis)-Isomer,Tetrahydrocannabinol, Trans-(+-)-Isomer,Tetrahydrocannabinol, Trans-Isomer,delta(1)-THC,delta(1)-Tetrahydrocannabinol,delta(9)-Tetrahydrocannabinol,9 ene Tetrahydrocannabinol,Tetrahydrocannabinol, Trans Isomer
D015203 Reproducibility of Results The statistical reproducibility of measurements (often in a clinical context), including the testing of instrumentation or techniques to obtain reproducible results. The concept includes reproducibility of physiological measurements, which may be used to develop rules to assess probability or prognosis, or response to a stimulus; reproducibility of occurrence of a condition; and reproducibility of experimental results. Reliability and Validity,Reliability of Result,Reproducibility Of Result,Reproducibility of Finding,Validity of Result,Validity of Results,Face Validity,Reliability (Epidemiology),Reliability of Results,Reproducibility of Findings,Test-Retest Reliability,Validity (Epidemiology),Finding Reproducibilities,Finding Reproducibility,Of Result, Reproducibility,Of Results, Reproducibility,Reliabilities, Test-Retest,Reliability, Test-Retest,Result Reliabilities,Result Reliability,Result Validities,Result Validity,Result, Reproducibility Of,Results, Reproducibility Of,Test Retest Reliability,Validity and Reliability,Validity, Face
D057205 Automation, Laboratory Controlled operations of analytic or diagnostic processes, or systems by mechanical or electronic devices. Laboratory Automation

Related Publications

Min Yu, and Lindsay A L Bazydlo, and David E Bruns, and James H Harrison
January 2022, Journal of mass spectrometry and advances in the clinical lab,
Min Yu, and Lindsay A L Bazydlo, and David E Bruns, and James H Harrison
January 2018, Clinical proteomics,
Min Yu, and Lindsay A L Bazydlo, and David E Bruns, and James H Harrison
January 2013, Methods in molecular biology (Clifton, N.J.),
Min Yu, and Lindsay A L Bazydlo, and David E Bruns, and James H Harrison
January 2010, Methods in molecular biology (Clifton, N.J.),
Min Yu, and Lindsay A L Bazydlo, and David E Bruns, and James H Harrison
January 2016, Clinical chemistry,
Min Yu, and Lindsay A L Bazydlo, and David E Bruns, and James H Harrison
May 2020, Mass spectrometry reviews,
Min Yu, and Lindsay A L Bazydlo, and David E Bruns, and James H Harrison
April 2024, Clinical chemistry and laboratory medicine,
Min Yu, and Lindsay A L Bazydlo, and David E Bruns, and James H Harrison
April 2018, Bioanalysis,
Min Yu, and Lindsay A L Bazydlo, and David E Bruns, and James H Harrison
July 2022, Journal of proteome research,
Min Yu, and Lindsay A L Bazydlo, and David E Bruns, and James H Harrison
March 2020, JMIR medical informatics,
Copied contents to your clipboard!