Statistics uses variables to describe a measurement. Such a variable is called statistically significant if under a certain status quo assumption, the probability of obtaining its outcome (or a more extreme outcome) is less than a given value. Statistical significance is hence a way of determining the unlikeliness of an experimental result—when a certain status quo assumption is assumed to be true.[1]Statistical hypothesis tests are used to check significance.
The concept of statistical significance was originated by Ronald Fisher in his 1925 publication, Statistical Methods for Research Workers,[2][3][4] when he developed statistical hypothesis testing (which he described as "tests of significance"). Fisher suggested a probability of one in twenty (0.05 or 5%)—as a convenient cutoff level to reject the null hypothesis.[5] In their 1933 paper, Jerzy Neyman and Egon Pearson recommended that the significance level (for example 0.05), which they called α, be set before any data collection.[5][6]
Despite his initial suggestion of 0.05 as a significance level, Fisher did not intend this cutoff value to be fixed. In his 1956 publication Statistical methods and scientific inference, he recommended that significant levels be set according to specific circumstances.[5]
↑Cumming, Geoff (2012). Understanding the new statistics: effect sizes, confidence intervals, and meta-analysis. New York, USA: Routledge. pp. 27–28.
↑Poletiek, Fenna H. (2001). "Formal theories of testing". Hypothesis-testing behaviour. Essays in Cognitive Psychology. East Sussex, United Kingdom: Psychology Press. pp. 29–48. ISBN1-841-69159-3.