Current controversies: Null hypothesis significance testing. 2022

Philip M Sedgwick, and Anne Hammer, and Ulrik Schiøler Kesmodel, and Lars Henning Pedersen
Institute for Medical and Biomedical Education, St George's, University of London, London, UK.

Traditional null hypothesis significance testing (NHST) incorporating the critical level of significance of 0.05 has become the cornerstone of decision-making in health care, and nowhere less so than in obstetric and gynecological research. However, such practice is controversial. In particular, it was never intended for clinical significance to be inferred from statistical significance. The inference of clinical importance based on statistical significance (p < 0.05), and lack of clinical significance otherwise (p ≥ 0.05) represents misunderstanding of the original purpose of NHST. Furthermore, the limitations of NHST-sensitivity to sample size, plus type I and II errors-are frequently ignored. Therefore, decision-making based on NHST has the potential for recurrent false claims about the effectiveness of interventions or importance of exposure to risk factors, or dismissal of important ones. This commentary presents the history behind NHST along with the limitations that modern-day NHST presents, and suggests that a statistics reform regarding NHST be considered.

UI MeSH Term Description Entries
D012107 Research Design A plan for collecting and utilizing data so that desired information can be obtained with sufficient precision or so that an hypothesis can be tested properly. Experimental Design,Data Adjustment,Data Reporting,Design, Experimental,Designs, Experimental,Error Sources,Experimental Designs,Matched Groups,Methodology, Research,Problem Formulation,Research Methodology,Research Proposal,Research Strategy,Research Technics,Research Techniques,Scoring Methods,Adjustment, Data,Adjustments, Data,Data Adjustments,Design, Research,Designs, Research,Error Source,Formulation, Problem,Formulations, Problem,Group, Matched,Groups, Matched,Matched Group,Method, Scoring,Methods, Scoring,Problem Formulations,Proposal, Research,Proposals, Research,Reporting, Data,Research Designs,Research Proposals,Research Strategies,Research Technic,Research Technique,Scoring Method,Source, Error,Sources, Error,Strategies, Research,Strategy, Research,Technic, Research,Technics, Research,Technique, Research,Techniques, Research
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D018401 Sample Size The number of units (persons, animals, patients, specified circumstances, etc.) in a population to be studied. The sample size should be big enough to have a high likelihood of detecting a true difference between two groups. (From Wassertheil-Smoller, Biostatistics and Epidemiology, 1990, p95) Sample Sizes,Size, Sample,Sizes, Sample

Related Publications

Philip M Sedgwick, and Anne Hammer, and Ulrik Schiøler Kesmodel, and Lars Henning Pedersen
February 2001, Journal of the American Academy of Child and Adolescent Psychiatry,
Philip M Sedgwick, and Anne Hammer, and Ulrik Schiøler Kesmodel, and Lars Henning Pedersen
September 2016, East Asian archives of psychiatry : official journal of the Hong Kong College of Psychiatrists = Dong Ya jing shen ke xue zhi : Xianggang jing shen ke yi xue yuan qi kan,
Philip M Sedgwick, and Anne Hammer, and Ulrik Schiøler Kesmodel, and Lars Henning Pedersen
March 2021, Clinical spine surgery,
Philip M Sedgwick, and Anne Hammer, and Ulrik Schiøler Kesmodel, and Lars Henning Pedersen
January 2015, F1000Research,
Philip M Sedgwick, and Anne Hammer, and Ulrik Schiøler Kesmodel, and Lars Henning Pedersen
February 2004, Proceedings. Biological sciences,
Philip M Sedgwick, and Anne Hammer, and Ulrik Schiøler Kesmodel, and Lars Henning Pedersen
November 2020, Nurse researcher,
Philip M Sedgwick, and Anne Hammer, and Ulrik Schiøler Kesmodel, and Lars Henning Pedersen
October 2020, Sports biomechanics,
Philip M Sedgwick, and Anne Hammer, and Ulrik Schiøler Kesmodel, and Lars Henning Pedersen
August 2017, Educational and psychological measurement,
Philip M Sedgwick, and Anne Hammer, and Ulrik Schiøler Kesmodel, and Lars Henning Pedersen
March 2011, The American journal of bioethics : AJOB,
Philip M Sedgwick, and Anne Hammer, and Ulrik Schiøler Kesmodel, and Lars Henning Pedersen
September 2022, BMC medical research methodology,
Copied contents to your clipboard!