F-test

An f-test pdf with d1 and d2 = 10, at a significance level of 0.05. (Red shaded region indicates the critical region)

An F-test is any statistical test used to compare the variances of two samples or the ratio of variances between multiple samples. The test statistic, random variable F, is used to determine if the tested data has an F-distribution under the true null hypothesis, and true customary assumptions about the error term (ε).[1] It is most often used when comparing statistical models that have been fitted to a data set, in order to identify the model that best fits the population from which the data were sampled. Exact "F-tests" mainly arise when the models have been fitted to the data using least squares. The name was coined by George W. Snedecor, in honour of Ronald Fisher. Fisher initially developed the statistic as the variance ratio in the 1920s.[2]

Common examples

Common examples of the use of F-tests include the study of the following cases

F-test of the equality of two variances

The F-test is sensitive to non-normality.[3][4] In the analysis of variance (ANOVA), alternative tests include Levene's test, Bartlett's test, and the Brown–Forsythe test. However, when any of these tests are conducted to test the underlying assumption of homoscedasticity (i.e. homogeneity of variance), as a preliminary step to testing for mean effects, there is an increase in the experiment-wise Type I error rate.[5]

Formula and calculation

Most F-tests arise by considering a decomposition of the variability in a collection of data in terms of sums of squares. The test statistic in an F-test is the ratio of two scaled sums of squares reflecting different sources of variability. These sums of squares are constructed so that the statistic tends to be greater when the null hypothesis is not true. In order for the statistic to follow the F-distribution under the null hypothesis, the sums of squares should be statistically independent, and each should follow a scaled χ²-distribution. The latter condition is guaranteed if the data values are independent and normally distributed with a common variance.

One-way analysis of variance

The formula for the one-way ANOVA F-test statistic is

or

The "explained variance", or "between-group variability" is

where denotes the sample mean in the i-th group, is the number of observations in the i-th group, denotes the overall mean of the data, and denotes the number of groups.

The "unexplained variance", or "within-group variability" is

where is the jth observation in the ith out of groups and is the overall sample size. This F-statistic follows the F-distribution with degrees of freedom and under the null hypothesis. The statistic will be large if the between-group variability is large relative to the within-group variability, which is unlikely to happen if the population means of the groups all have the same value.

F Table: Level 5% Critical values, containing degrees of freedoms for both denominator and numerator ranging from 1-20

The result of the F test can be determined by comparing calculated F value and critical F value with specific significance level (e.g. 5%). The F table serves as a reference guide containing critical F values for the distribution of the F-statistic under the assumption of a true null hypothesis. It is designed to help determine the threshold beyond which the F statistic is expected to exceed a controlled percentage of the time (e.g., 5%) when the null hypothesis is accurate. To locate the critical F value in the F table, one needs to utilize the respective degrees of freedom. This involves identifying the appropriate row and column in the F table that corresponds to the significance level being tested (e.g., 5%).[6]

How to use critical F values:

If the F statistic < the critical F value

  • Fail to reject null hypothesis
  • Reject alternative hypothesis
  • There is no significant differences among sample averages
  • The observed differences among sample averages could be reasonably caused by random chance itself
  • The result is not statistically significant

If the F statistic > the critical F value

  • Accept alternative hypothesis
  • Reject null hypothesis
  • There is significant differences among sample averages
  • The observed differences among sample averages could not be reasonably caused by random chance itself
  • The result is statistically significant

Note that when there are only two groups for the one-way ANOVA F-test, where t is the Student's statistic.

Advantages

  • Multi-group Comparison Efficiency: Facilitating simultaneous comparison of multiple groups, enhancing efficiency particularly in situations involving more than two groups.
  • Clarity in Variance Comparison: Offering a straightforward interpretation of variance differences among groups, contributing to a clear understanding of the observed data patterns.
  • Versatility Across Disciplines: Demonstrating broad applicability across diverse fields, including social sciences, natural sciences, and engineering.

Disadvantages

  • Sensitivity to Assumptions: The F-test is highly sensitive to certain assumptions, such as homogeneity of variance and normality which can affect the accuracy of test results.
  • Limited Scope to Group Comparisons: The F-test is tailored for comparing variances between groups, making it less suitable for analyses beyond this specific scope.
  • Interpretation Challenges: The F-test does not pinpoint specific group pairs with distinct variances. Careful interpretation is necessary, and additional post hoc tests are often essential for a more detailed understanding of group-wise differences.

Multiple-comparison ANOVA problems

The F-test in one-way analysis of variance (ANOVA) is used to assess whether the expected values of a quantitative variable within several pre-defined groups differ from each other. For example, suppose that a medical trial compares four treatments. The ANOVA F-test can be used to assess whether any of the treatments are on average superior, or inferior, to the others versus the null hypothesis that all four treatments yield the same mean response. This is an example of an "omnibus" test, meaning that a single test is performed to detect any of several possible differences. Alternatively, we could carry out pairwise tests among the treatments (for instance, in the medical trial example with four treatments we could carry out six tests among pairs of treatments). The advantage of the ANOVA F-test is that we do not need to pre-specify which treatments are to be compared, and we do not need to adjust for making multiple comparisons. The disadvantage of the ANOVA F-test is that if we reject the null hypothesis, we do not know which treatments can be said to be significantly different from the others, nor, if the F-test is performed at level α, can we state that the treatment pair with the greatest mean difference is significantly different at level α.

Regression problems

Consider two models, 1 and 2, where model 1 is 'nested' within model 2. Model 1 is the restricted model, and model 2 is the unrestricted one. That is, model 1 has p1 parameters, and model 2 has p2 parameters, where p1 < p2, and for any choice of parameters in model 1, the same regression curve can be achieved by some choice of the parameters of model 2.

One common context in this regard is that of deciding whether a model fits the data significantly better than does a naive model, in which the only explanatory term is the intercept term, so that all predicted values for the dependent variable are set equal to that variable's sample mean. The naive model is the restricted model, since the coefficients of all potential explanatory variables are restricted to equal zero.

Another common context is deciding whether there is a structural break in the data: here the restricted model uses all data in one regression, while the unrestricted model uses separate regressions for two different subsets of the data. This use of the F-test is known as the Chow test.

The model with more parameters will always be able to fit the data at least as well as the model with fewer parameters. Thus typically model 2 will give a better (i.e. lower error) fit to the data than model 1. But one often wants to determine whether model 2 gives a significantly better fit to the data. One approach to this problem is to use an F-test.

If there are n data points to estimate parameters of both models from, then one can calculate the F statistic, given by

where RSSi is the residual sum of squares of model i. If the regression model has been calculated with weights, then replace RSSi with χ2, the weighted sum of squared residuals. Under the null hypothesis that model 2 does not provide a significantly better fit than model 1, F will have an F distribution, with (p2p1np2) degrees of freedom. The null hypothesis is rejected if the F calculated from the data is greater than the critical value of the F-distribution for some desired false-rejection probability (e.g. 0.05). Since F is a monotone function of the likelihood ratio statistic, the F-test is a likelihood ratio test.

See also

References

  1. ^ a b Berger, Paul D.; Maurer, Robert E.; Celli, Giovana B. (2018). Experimental Design. Cham: Springer International Publishing. p. 108. doi:10.1007/978-3-319-64583-4. ISBN 978-3-319-64582-7.
  2. ^ Lomax, Richard G. (2007). Statistical Concepts: A Second Course. p. 10. ISBN 978-0-8058-5850-1.
  3. ^ Box, G. E. P. (1953). "Non-Normality and Tests on Variances". Biometrika. 40 (3/4): 318–335. doi:10.1093/biomet/40.3-4.318. JSTOR 2333350.
  4. ^ Markowski, Carol A; Markowski, Edward P. (1990). "Conditions for the Effectiveness of a Preliminary Test of Variance". The American Statistician. 44 (4): 322–326. doi:10.2307/2684360. JSTOR 2684360.
  5. ^ Sawilowsky, S. (2002). "Fermat, Schubert, Einstein, and Behrens–Fisher: The Probable Difference Between Two Means When σ12 ≠ σ22". Journal of Modern Applied Statistical Methods. 1 (2): 461–472. doi:10.22237/jmasm/1036109940. Archived from the original on 2015-04-03. Retrieved 2015-03-30.
  6. ^ Siegel, Andrew F. (2016-01-01), Siegel, Andrew F. (ed.), "Chapter 15 - ANOVA: Testing for Differences Among Many Samples and Much More", Practical Business Statistics (Seventh Edition), Academic Press, pp. 469–492, doi:10.1016/b978-0-12-804250-2.00015-8, ISBN 978-0-12-804250-2, retrieved 2023-12-10

Further reading

Read other articles:

Questa voce o sezione sull'argomento mezzi di trasporto non cita le fonti necessarie o quelle presenti sono insufficienti. Puoi migliorare questa voce aggiungendo citazioni da fonti attendibili secondo le linee guida sull'uso delle fonti. Linz: Bombardier Cityrunner n. 009 sulla linea 2 Ginevra: tram snodato Bombardier Cityrunner del TPG n. 873 sulla linea 16 Milano: Bombardier Eurotram Zagato dell'ATM n. 7006 sulla linea 15 Palermo: Flexity Outlook Cityrunner in corso di consegna Strasb...

Music Canada Music Canada (semula Canadian Recording Industry Association (CRIA)) adalah organisasi berbasis di Toronto yang tidak mencari keuntungan dan didirikan pada 9 April 1963 untuk bekerja pada bidang rekaman, artis, manufaktur, produksi, promosi dan distribusik musik di Kanada. Penghargaan Sertifikasi Penghargaan sertifikasi Gold dan Platinum Album Sertifikasi Untuk rilis sebelum 1 Mei, 2008[1] Untuk rilis setelah 1 Mei, 2008[1] Gold 50,000 40,000 Platinum 100,000 80,0...

Coordenadas: 45° 19' N 8° 45' E Cilavegna    Comuna   Localização CilavegnaLocalização de Cilavegna na Itália Coordenadas 45° 19' N 8° 45' E Região Lombardia Província Pavia Características geográficas Área total 17 km² População total 4 927 hab. Densidade 290 hab./km² Altitude 115 m Outros dados Comunas limítrofes Albonese, Borgolavezzaro (NO), Gravellona Lomellina, Parona, Tornaco (NO), Vigevano Código ISTAT 018050 ...

Decauville railway Vigía Chico-Santa CruzMapTechnicalLine length56.75 km (35.26 mi)Track gauge600 mm (1 ft 11+5⁄8 in) Route map Legend 0 km (0 mi) Vigía Chico now in Sian Ka'an 57 km (35 mi) Santa Cruz now F. Carrillo Puerto The Decauville railway Vigía Chico-Santa Cruz (Spanish Ferroaril Decauville de Vigía Chico) was a nearly 57 km (35 mi) long 600 mm (1 ft 11+5⁄8 in) gauge railway line, which was bu...

Organization created under Government of India to regulate the Dental colleges Dental Council of IndiaAbbreviationDCIFormation1948TypeGovernmentPurposeTo regulate dental education in India and to grant Colleges, Universities, and also for registration of dental degree holders and monitoring dental practice.HeadquartersNew DelhiLocationAiwan-E-Galib Marg Kotla Road, Temple Lane New Delhi-110002Official language English and HindiPresidentDr. Dibyendu MazumdarMain organCouncilAffiliationsMinistr...

Computer program that provides a user interface to work with file systems File Manager redirects here. Not to be confused with Windows File Manager. file browser redirects here. Not to be confused with file viewer. This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: File manager – news · newspapers · books · scholar&#...

У Вікіпедії є статті про інші значення цього терміна: Кремінь (значення). «Кремінь-Арена» імені Олега Бабаєва Повна назва Стадіон «Кремінь» Країна  Україна Розташування  Україна, Кременчук, вул. Сержанта Мельничука, 6А Координати 49°04′38″ пн. ш. 33°25′43″ сх. д...

LekkoatletykaBieg na 400 metrów mężczyzn Letnie Igrzyska Olimpijskie 2020 2016 2024 Miejsce  JaponiaTokio Termin 1 sierpnia–5 sierpnia 2021 Liczba ekip 33 Liczba sportowców 48 Obiekt rozgrywek Stadion Narodowy Złoty medal Steven Gardiner Srebrny medal Anthony Jose Zambrano Brązowy medal Kirani James Bieg na 400 metrów mężczyzn był jedną z konkurencji lekkoatletycznych rozgrywanych podczas igrzysk olimpijskich w Tokio. Wystartowało 48 zawodników z 33 krajów. Terminarz Godz...

Genus of palms Andean wax palms Ceroxylon quindiuense Scientific classification Kingdom: Plantae Clade: Tracheophytes Clade: Angiosperms Clade: Monocots Clade: Commelinids Order: Arecales Family: Arecaceae Subfamily: Ceroxyloideae Tribe: Ceroxyleae Genus: CeroxylonBonpl. ex DC. Type species Ceroxylon alpinum Synonyms[1] Klopstockia H.Karst. Beethovenia Engel Ceroxylon is a genus of flowering plants in the family Arecaceae, native to the Andes in Venezuela, Colombia, Ecuador, Peru, and...

2018 Indian filmJohny Johny Yes AppaDirected byG. MarthandanWritten byJoji ThomasProduced byVaishak RajanStarring Kunchacko Boban Sanoop Santhosh Anu Sithara Mamta Mohandas CinematographyVinod IllampallyEdited byLijo PaulMusic byShaan RahmanProductioncompanyVaishaka CynymaDistributed byVaishaka CynymaRelease date 26 October 2018 (2018-10-26) CountryIndiaLanguageMalayalam Johny Johny Yes Appa is a 2018 Indian Malayalam-language family comedy film directed by G. Marthandan, produ...

Upcoming RapidX's Delhi—Meerut RRTS station Anand Vihar RapidX stationGeneral informationLocationAnand Vihar, East Delhi, Delhi IndiaCoordinates28°38′52″N 77°19′04″E / 28.6477198°N 77.3177137°E / 28.6477198; 77.3177137Owned byNCRTCOperated byNCRTCLine(s)Delhi–Meerut RRTSPlatforms(TBC)Tracks(TBC)Connections Anand ViharBlue Line Pink Line Anand Vihar Terminal Rlwy Stn Anand Vihar ISBTConstructionStructure typeUndergroundParking(TBC)Other informationS...

Concept to quantify greenhouse gas emissions from activities or products The carbon footprint can be used to compare the climate change impact of many things. The example given here is the carbon footprint (greenhouse gas emissions) of food across the supply chain caused by land use change, farm, animal feed, processing, transport, retail, packing, losses.[1] A carbon footprint (or greenhouse gas footprint) is a measurement of emissions of carbon dioxide or CO2-equivalent amounts of o...

Artikel atau sebagian dari artikel ini mungkin diterjemahkan dari List of accolades received by Forrest Gump di en.wikipedia.org. Isinya masih belum akurat, karena bagian yang diterjemahkan masih perlu diperhalus dan disempurnakan. Jika Anda menguasai bahasa aslinya, harap pertimbangkan untuk menelusuri referensinya dan menyempurnakan terjemahan ini. Anda juga dapat ikut bergotong royong pada ProyekWiki Perbaikan Terjemahan. (Pesan ini dapat dihapus jika terjemahan dirasa sudah cukup tepat. L...

У этого термина существуют и другие значения, см. The Great Escape. The Great Escape Разработчик Denton Designs Издатель Ocean Software Ltd Дата выпуска 1986 Жанр Аркадная адвенчура Технические данные Платформы ZX SpectrumCommodore 64Amstrad CPCDOS Режим игры однопользовательский Язык английский[1] Носитель Кассет...

2023 soundtrack album by Marcelo Zarvos and Oak FelderWhite Men Can't Jump (Original Soundtrack)Soundtrack album by Marcelo Zarvos and Oak FelderReleasedMay 19, 2023GenreFilm scorefilm soundtrackLength37:56LabelHollywoodProducerMarcelo ZarvosOak FelderMarcelo Zarvos chronology Big George Foreman(2023) White Men Can't Jump(2023) Flamin' Hot(2023) Oak Felder chronology House Party(2023) White Men Can't Jump(2023) White Men Can't Jump (Original Soundtrack) is the soundtrack to the 2023 f...

Pour les articles homonymes, voir Le Voyeur et Peeping Tom. Cet article est une ébauche concernant un film britannique. Vous pouvez partager vos connaissances en l’améliorant (comment ?) selon les conventions filmographiques. Le Voyeur Données clés Titre original Peeping Tom Réalisation Michael Powell Scénario Leo Marks Acteurs principaux Karlheinz BöhmAnna MasseyMaxine AudleyMoira Shearer Sociétés de production Michael Powell (Theatre) Ltd Pays de production Royaume-Uni Genre...

Opioid analgesic FuranylfentanylClinical dataATC codenoneLegal statusLegal status BR: Class F1 (Prohibited narcotics) CA: Schedule I DE: Anlage II (Authorized trade only, not prescriptible) UK: Under Psychoactive Substances Act US: Schedule I UN: Narcotic Schedule I Illegal in Sweden Identifiers IUPAC name N-Phenyl-N-[1-(2-phenylethyl)piperidin-4-yl]furan-2-carboxamide CAS Number101345-66-8PubChem CID13653606ChemSpider14921702UNII3F7C9J1LS7KEGGC22761Chemical...

Railway station in Miaoli, Taiwan This article is about the THSR station. For the transferrable TRA station, see Fengfu railway station. For the TRA station with the same name, see Miaoli railway station. Miaoli苗栗THSR railway stationStation exteriorChinese nameTraditional Chinese苗栗TranscriptionsStandard MandarinHanyu PinyinMiáolìBopomofoㄇㄧㄠˊ ㄌㄧˋHakkaRomanization Měu-lid (Sixian dialect) Miau-lìd (Hailu dialect) Southern MinTâi-lô Biâu-li̍k Miâu-li̍k General ...

American musician Laura Jane GraceGrace performing in 2017BornThomas James Gabel (1980-11-08) November 8, 1980 (age 43)Fort Benning, Georgia, U.S.Spouses Tiffany Danielle Kay ​ ​(m. 2000; div. 2004)​ Heather Hannoura ​ ​(m. 2007; div. 2013)​ Children1Musical careerOriginGainesville, Florida, U.S.GenresPunk rockOccupation(s)SingersongwriterguitaristInstrumentsVocalsguitarbassharmonicaYears acti...

1944 film by Elmer Clifton Gangsters of the FrontierOriginal film posterDirected byElmer CliftonWritten byElmer CliftonProduced byArthur AlexanderAlfred SternStarringSee belowCinematographyRobert E. ClineEdited byCharles Henkel Jr.Distributed byProducers Releasing CorporationRelease date 22 September 1944 (1944-09-22) Running time56 minutesCountryUnited StatesLanguageEnglish Gangsters of the Frontier (also known as Raiders of the Frontier in the United Kingdom) is a 1944 Americ...