Statistical method for handling multiple comparisons
In statistics, the false discovery rate (FDR) is a method of conceptualizing the rate of type I errors in null hypothesis testing when conducting multiple comparisons. FDR-controlling procedures are designed to control the FDR, which is the expected proportion of "discoveries" (rejected null hypotheses) that are false (incorrect rejections of the null).[1] Equivalently, the FDR is the expected ratio of the number of false positive classifications (false discoveries) to the total number of positive classifications (rejections of the null). The total number of rejections of the null include both the number of false positives (FP) and true positives (TP). Simply put, FDR = FP / (FP + TP). FDR-controlling procedures provide less stringent control of Type I errors compared to family-wise error rate (FWER) controlling procedures (such as the Bonferroni correction), which control the probability of at least one Type I error. Thus, FDR-controlling procedures have greater power, at the cost of increased numbers of Type I errors.[2]
History
Technological motivations
The modern widespread use of the FDR is believed to stem from, and be motivated by, the development in technologies that allowed the collection and analysis of a large number of distinct variables in several individuals (e.g., the expression level of each of 10,000 different genes in 100 different persons).[3] By the late 1980s and 1990s, the development of "high-throughput" sciences, such as genomics, allowed for rapid data acquisition. This, coupled with the growth in computing power, made it possible to seamlessly perform a very high number of statistical tests on a given data set. The technology of microarrays was a prototypical example, as it enabled thousands of genes to be tested simultaneously for differential expression between two biological conditions.[4]
As high-throughput technologies became common, technological and/or financial constraints led researchers to collect datasets with relatively small sample sizes (e.g. few individuals being tested) and large numbers of variables being measured per sample (e.g. thousands of gene expression levels). In these datasets, too few of the measured variables showed statistical significance after classic correction for multiple tests with standard multiple comparison procedures. This created a need within many scientific communities to abandon FWER and unadjusted multiple hypothesis testing for other ways to highlight and rank in publications those variables showing marked effects across individuals or treatments that would otherwise be dismissed as non-significant after standard correction for multiple tests. In response to this, a variety of error rates have been proposed—and become commonly used in publications—that are less conservative than FWER in flagging possibly noteworthy observations. The FDR is useful when researchers are looking for "discoveries" that will give them followup work (E.g.: detecting promising genes for followup studies), and are interested in controlling the proportion of "false leads" they are willing to accept.
Literature
The FDR concept was formally described by Yoav Benjamini and Yosef Hochberg in 1995[1] (BH procedure) as a less conservative and arguably more appropriate approach for identifying the important few from the trivial many effects tested. The FDR has been particularly influential, as it was the first alternative to the FWER to gain broad acceptance in many scientific fields (especially in the life sciences, from genetics to biochemistry, oncology and plant sciences).[3] In 2005, the Benjamini and Hochberg paper from 1995 was identified as one of the 25 most-cited statistical papers.[5]
Prior to the 1995 introduction of the FDR concept, various precursor ideas had been considered in the statistics literature. In 1979, Holm proposed the Holm procedure,[6] a stepwise algorithm for controlling the FWER that is at least as powerful as the well-known Bonferroni adjustment. This stepwise algorithm sorts the p-values and sequentially rejects the hypotheses starting from the smallest p-values.
Benjamini (2010) said that the false discovery rate,[3] and the paper Benjamini and Hochberg (1995), had its origins in two papers concerned with multiple testing:
The first paper is by Schweder and Spjotvoll (1982) who suggested plotting the ranked p-values and assessing the number of true null hypotheses () via an eye-fitted line starting from the largest p-values.[7] The p-values that deviate from this straight line then should correspond to the false null hypotheses. This idea was later developed into an algorithm and incorporated the estimation of into procedures such as Bonferroni, Holm or Hochberg.[8] This idea is closely related to the graphical interpretation of the BH procedure.
The second paper is by Branko Soric (1989) which introduced the terminology of "discovery" in the multiple hypothesis testing context.[9] Soric used the expected number of false discoveries divided by the number of discoveries as a warning that "a large part of statistical discoveries may be wrong". This led Benjamini and Hochberg to the idea that a similar error rate, rather than being merely a warning, can serve as a worthy goal to control.
The BH procedure was proven to control the FDR for independent tests in 1995 by Benjamini and Hochberg.[1] In 1986, R. J. Simes offered the same procedure as the "Simes procedure", in order to control the FWER in the weak sense (under the intersection null hypothesis) when the statistics are independent.[10]
Definitions
Based on definitions below we can define Q as the proportion of false discoveries among the discoveries (rejections of the null hypothesis):
where is the number of false discoveries and is the number of true discoveries.
The false discovery rate (FDR) is then simply the following:[1]
where is the expected value of . The goal is to keep FDR below a given threshold q. To avoid division by zero, is defined to be 0 when . Formally, .[1]
The following table defines the possible outcomes when testing multiple null hypotheses.
Suppose we have a number m of null hypotheses, denoted by: H1, H2, ..., Hm.
Using a statistical test, we reject the null hypothesis if the test is declared significant. We do not reject the null hypothesis if the test is non-significant.
Summing each type of outcome over all Hi yields the following random variables:
The settings for many procedures is such that we have null hypotheses tested and their corresponding p-values. We list these p-values in ascending order and denote them by . A procedure that goes from a small test-statistic to a large one will be called a step-up procedure. In a similar way, in a "step-down" procedure we move from a large corresponding test statistic to a smaller one.
Benjamini–Hochberg procedure
The Benjamini–Hochberg procedure (BH step-up procedure) controls the FDR at level .[1] It works as follows:
For a given , find the largest k such that
Reject the null hypothesis (i.e., declare discoveries) for all for
Geometrically, this corresponds to plotting vs. k (on the y and x axes respectively), drawing the line through the origin with slope , and declaring discoveries for all points on the left, up to, and including the last point that is not above the line.
The BH procedure is valid when the m tests are independent, and also in various scenarios of dependence, but is not universally valid.[11] It also satisfies the inequality:
If an estimator of is inserted into the BH procedure, it is no longer guaranteed to achieve FDR control at the desired level.[3] Adjustments may be needed in the estimator and several modifications have been proposed.[12][13][14][15]
Note that the mean for these m tests is , the Mean(FDR ) or MFDR, adjusted for m independent or positively correlated tests (see AFDR below). The MFDR expression here is for a single recomputed value of and is not part of the Benjamini and Hochberg method.
Benjamini–Yekutieli procedure
The Benjamini–Yekutieli procedure controls the false discovery rate under arbitrary dependence assumptions.[11] This refinement modifies the threshold and finds the largest k such that:
If the tests are independent or positively correlated (as in Benjamini–Hochberg procedure):
Using MFDR and formulas above, an adjusted MFDR (or AFDR) is the minimum of the mean for m dependent tests, i.e., .
Another way to address dependence is by bootstrapping and rerandomization.[4][16][17]
Storey-Tibshirani procedure
In the Storey-Tibshirani procedure, q-values are used for controlling the FDR.
Properties
Adaptive and scalable
Using a multiplicity procedure that controls the FDR criterion is adaptive and scalable. Meaning that controlling the FDR can be very permissive (if the data justify it), or conservative (acting close to control of FWER for sparse problem) - all depending on the number of hypotheses tested and the level of significance.[3]
The FDR criterion adapts so that the same number of false discoveries (V) will have different implications, depending on the total number of discoveries (R). This contrasts with the family-wise error rate criterion. For example, if inspecting 100 hypotheses (say, 100 genetic mutations or SNPs for association with some phenotype in some population):
If we make 4 discoveries (R), having 2 of them be false discoveries (V) is often very costly. Whereas,
If we make 50 discoveries (R), having 2 of them be false discoveries (V) is often not very costly.
The FDR criterion is scalable in that the same proportion of false discoveries out of the total number of discoveries (Q), remains sensible for different number of total discoveries (R). For example:
If we make 100 discoveries (R), having 5 of them be false discoveries () may not be very costly.
Similarly, if we make 1000 discoveries (R), having 50 of them be false discoveries (as before, ) may still not be very costly.
Dependency among the test statistics
Controlling the FDR using the linear step-up BH procedure, at level q, has several properties related to the dependency structure between the test statistics of the m null hypotheses that are being corrected for. If the test statistics are:
If all of the null hypotheses are true (), then controlling the FDR at level q guarantees control over the FWER (this is also called "weak control of the FWER"): , simply because the event of rejecting at least one true null hypothesis is exactly the event , and the event is exactly the event (when , by definition).[1] But if there are some true discoveries to be made () then FWER ≥ FDR. In that case there will be room for improving detection power. It also means that any procedure that controls the FWER will also control the FDR.
Average power
The average power of the Benjamini-Hochberg procedure can be computed analytically[18]
Related concepts
The discovery of the FDR was preceded and followed by many other types of error rates. These include:
PCER (per-comparison error rate) is defined as: . Testing individually each hypothesis at level α guarantees that (this is testing without any correction for multiplicity)
(The tail probability of the False Discovery Proportion), suggested by Lehmann and Romano, van der Laan at al, [citation needed] is defined as: .
(also called the generalized FDR by Sarkar in 2007[19][20]) is defined as: .
is the proportion of false discoveries among the discoveries", suggested by Soric in 1989,[9] and is defined as: . This is a mixture of expectations and realizations, and has the problem of control for .[1]
(or Fdr) was used by Benjamini and Hochberg,[3] and later called "Fdr" by Efron (2008) and earlier.[21] It is defined as: . This error rate cannot be strictly controlled because it is 1 when .
was used by Benjamini and Hochberg,[3] and later called "pFDR" by Storey (2002).[22] It is defined as: . This error rate cannot be strictly controlled because it is 1 when . JD Storey promoted the use of the pFDR (a close relative of the FDR), and the q-value, which can be viewed as the proportion of false discoveries that we expect in an ordered table of results, up to the current line.[citation needed] Storey also promoted the idea (also mentioned by BH) that the actual number of null hypotheses, , can be estimated from the shape of the probability distribution curve. For example, in a set of data where all null hypotheses are true, 50% of results will yield probabilities between 0.5 and 1.0 (and the other 50% will yield probabilities between 0.0 and 0.5). We can therefore estimate by finding the number of results with and doubling it, and this permits refinement of our calculation of the pFDR at any particular cut-off in the data-set.[22]
False exceedance rate (the tail probability of FDP), defined as:[23]
(Weighted FDR). Associated with each hypothesis i is a weight , the weights capture importance/price. The W-FDR is defined as: .
FDCR (False Discovery Cost Rate). Stemming from statistical process control: associated with each hypothesis i is a cost and with the intersection hypothesis a cost . The motivation is that stopping a production process may incur a fixed cost. It is defined as:
PFER (per-family error rate) is defined as: .
FNR (False non-discovery rates) by Sarkar; Genovese and Wasserman [citation needed] is defined as:
The false coverage rate (FCR) is, in a sense, the FDR analog to the confidence interval. FCR indicates the average rate of false coverage, namely, not covering the true parameters, among the selected intervals. The FCR gives a simultaneous coverage at a level for all of the parameters considered in the problem. Intervals with simultaneous coverage probability 1−q can control the FCR to be bounded by q. There are many FCR procedures such as: Bonferroni-Selected–Bonferroni-Adjusted,[citation needed] Adjusted BH-Selected CIs (Benjamini and Yekutieli (2005)),[24] Bayes FCR (Yekutieli (2008)),[citation needed] and other Bayes methods.[25]
Bayesian approaches
Connections have been made between the FDR and Bayesian approaches (including empirical Bayes methods),[21][26][27] thresholding wavelets coefficients and model selection,[28][29][30][31][32] and generalizing the confidence interval into the false coverage statement rate (FCR).[24]
^Holm S (1979). "A simple sequentially rejective multiple test procedure". Scandinavian Journal of Statistics. 6 (2): 65–70. JSTOR4615733. MR0538597.
^Schweder T, Spjøtvoll E (1982). "Plots of P-values to evaluate many tests simultaneously". Biometrika. 69 (3): 493–502. doi:10.1093/biomet/69.3.493.
^Hochberg Y, Benjamini Y (July 1990). "More powerful procedures for multiple significance testing". Statistics in Medicine. 9 (7): 811–8. doi:10.1002/sim.4780090710. PMID2218183.
^ abSoric B (June 1989). "Statistical "Discoveries" and Effect-Size Estimation". Journal of the American Statistical Association. 84 (406): 608–610. doi:10.1080/01621459.1989.10478811. JSTOR2289950.
^Simes RJ (1986). "An improved Bonferroni procedure for multiple tests of significance". Biometrika. 73 (3): 751–754. doi:10.1093/biomet/73.3.751.
^Benjamini Y, Krieger AM, Yekutieli D (2006). "Adaptive linear step-up procedures that control the false discovery rate". Biometrika. 93 (3): 491–507. doi:10.1093/biomet/93.3.491.
^Gavrilov Y, Benjamini Y, Sarkar SK (2009). "An adaptive step-down procedure with proven FDR control under independence". The Annals of Statistics. 37 (2): 619. arXiv:0903.5373. doi:10.1214/07-AOS586. S2CID16913244.
^Yekutieli D, Benjamini Y (1999). "Resampling based False Discovery Rate controlling procedure for dependent test statistics". J. Statist. Planng Inf. 82 (1–2): 171–196. doi:10.1016/S0378-3758(99)00041-5.
^van der Laan MJ, Dudoit S (2007). Multiple Testing Procedures with Applications to Genomics. New York: Springer.
^Benjamini Y (December 2010). "Simultaneous and selective inference: Current successes and future challenges". Biometrical Journal. Biometrische Zeitschrift. 52 (6): 708–21. doi:10.1002/bimj.200900299. PMID21154895. S2CID8806192.
^ abBenjamini Y, Yekutieli Y (2005). "False discovery rate controlling confidence intervals for selected parameters". Journal of the American Statistical Association. 100 (469): 71–80. doi:10.1198/016214504000001907. S2CID23202143.
^Stoica P, Babu P (2022). "False discovery rate (FDR) and familywise error rate (FER) rules for model selection in signal processing applications". IEEE Open Journal of Signal Processing. 3 (1): 403–416. doi:10.1109/OJSP.2022.3213128.
Rumah Pengasingan Bung Karno merupakan tempat Soekarno menjalani hukuman pengasingan sebagai tahanan politik.[1] Soekarno diasingkan ke Ende, Flores pada 14 Januari 1934. Ia diasingkan di sana selama empat tahun (1934-1938).[1] Setelah itu, ia diasingkan ke Bengkulu.[1] Rumah pengasingan di Bengkulu Rumah Bekas Kediaman Bung Karno Di BengkuluNama sebagaimana tercantum dalamSistem Registrasi Nasional Cagar BudayaBerkas:Rumah pengasingan bung karno.jpgRumah Pengasingan B...
American fine art and collectibles auction house Heritage AuctionsCompany typePrivateFounded1976; 48 years ago (1976) in Dallas, Texas, U.S.FounderSteve Ivy (founder and CEO)Jim Halperin(co-founder)HeadquartersDallas, Texas, U.S.ProductsAntiques and collectiblesServicesAuctioneerWebsiteha.com Heritage Auctions is an American multi-national auction house based in Dallas, Texas. Founded in 1976, Heritage is an auctioneer of numismatic collections, comics, fine art, books, luxu...
Pour les articles homonymes, voir Gainsbourg et Ginsburg. Cet article possède un paronyme, voir Gainsborough. Serge GainsbourgSerge Gainsbourg en 1981.BiographieNaissance 2 avril 19284e arrondissement de Paris (France)Décès 2 mars 1991 (à 62 ans)7e arrondissement de Paris (France)Sépulture Cimetière du MontparnasseNom de naissance Lucien GinsburgSurnoms Gainsbarre, Julien Gris, Julien Grix, L'homme à tête de chouNationalité françaiseFormation Lycée CondorcetÉcole normale de ...
Los Angeles LakersPallacanestro Segni distintivi Uniformi di gara Casa Trasferta Terza divisa Colori sociali Oro, Viola, Nero[1][2] Dati societari Città Los Angeles (CA) Nazione Stati Uniti Campionato NBA Conference Western Conference Division Pacific Division Fondazione 1946 e 1947 Denominazione Detroit Gems (NBL)1946-1947Minneapolis Lakers (NBL)1947-1948Minneapolis Lakers (BAA)1948-1949Minneapolis Lakers (NBA)1948-196...
Legal doctrine in some common-law jurisdictions Felony murder redirects here. For the general felony of murder in some jurisdictions, see Murder. Not to be confused with Malice murder. Part of a series onHomicide Murder Note: Varies by jurisdiction Assassination Child murder Consensual homicide Contract killing Crime of passion Depraved-heart murder Felony murder rule Foeticide Honor killing Human cannibalism Child cannibalism Human sacrifice Child sacrifice Internet homicide Lonely hearts ki...
2019 soundtrack by Hans Zimmer The Lion King: Original Motion Picture SoundtrackSoundtrack album by Hans Zimmer and various artistsReleasedJuly 11, 2019 (2019-07-11)Length77:32LabelWalt DisneyProducer Pharrell Williams Hans Zimmer (exec.) Elton John Greg Kurstin Elton John chronology Rocketman: Music from the Motion Picture(2019) The Lion King: Original Motion Picture Soundtrack(2019) Mufasa: The Lion King (Original Motion Picture Soundtrack)(2024) Singles from The Lion Kin...
Chinese actress The native form of this personal name is Chen Fala. This article uses Western name order when mentioning individuals. Fala ChenChen in 2010 at a promotional event for Ghost WriterBorn (1982-02-24) 24 February 1982 (age 42)Chengdu, Sichuan, ChinaEducationEmory University (BA)Juilliard School (MFA)Occupation(s)Actress,[1] Singer[1]Years active2005–presentSpouses Daniel Sit (m. 2008; div. 2013) Em...
1950 Japanese House of Councillors election ← 1947 4 June 1950 1953 → 132 of the 250 seats in the House of Councillors126 seats needed for a majority First party Second party Third party Leader Shigeru Yoshida Tetsu Katayama Party Liberal Socialist Ryokufūkai Seats after 76 61 50 Popular vote 8,313,756 4,854,629 3,660,391 Percentage 29.70% 17.34% 13.08% Fourth party Fifth party Sixth party Leader Tomabechi Gizō Hisao Kuroda Kyuichi...
Si ce bandeau n'est plus pertinent, retirez-le. Cliquez ici pour en savoir plus. Cet article ne cite pas suffisamment ses sources (novembre 2012). Si vous disposez d'ouvrages ou d'articles de référence ou si vous connaissez des sites web de qualité traitant du thème abordé ici, merci de compléter l'article en donnant les références utiles à sa vérifiabilité et en les liant à la section « Notes et références ». En pratique : Quelles sources sont attendues ? ...
Zoo in Portland, Oregon, United States Oregon ZooMain entrance in January 2024.45°30′30″N 122°42′53″W / 45.50833°N 122.71472°W / 45.50833; -122.71472Date opened1888; 136 years ago (1888)LocationWashington Park, Portland, Oregon, United StatesLand area64 acres (26 ha)[1]No. of animals1,800[1]No. of species232[1]Annual visitors1.7 million[2]MembershipsAZA[3] WAZA[4]Major exhibitsThe Great ...
Guy BerrymanInformasi latar belakangNama lahirGuy Rupert BerrymanLahir12 April 1978 (umur 46)GenreRock AlternatifPekerjaanBassist, penulis lagu, produser rekaman, Fotografer, Penjual barang antikInstrumenbass guitar, double bass, cello, acoustic guitar, electric guitar, vocals, mandolin, piano, keyboards, drums, trumpet, french horn, santoor, percussionTahun aktif1995–sekarangArtis terkaitColdplay, Apparatjik Guy Rupert Berryman (Lahir 12 April 1978, di Kirkcaldy, Skotlandia) adalah se...
Women's high jump at the 2023 World ChampionshipsVenueNational Athletics CentreDates25 August (qualification)27 August (final)Winning height2.01Medalists Yaroslava Mahuchikh Ukraine Eleanor Patterson Australia Nicola Olyslagers Australia← 20222025 → Events at the2023 World ChampionshipsTrack events100 mmenwomen200 mmenwomen400 mmenwomen800 mmenwomen1500 mmenwomen5000 mmenwomen10,000 mmenwomen100 m hurdleswomen110 ...
El funcionalismo estructuralista es una construcción teórica que ve a la sociedad como un sistema complejo, cuyas partes trabajan juntas para promover la armonía social. Se entiende como el estudio de una sociedad conocida como estructura o sistema social.[1] Este enfoque ve a la sociedad desde una orientación de nivel macro, que es un enfoque amplio en las estructuras sociales que conforman la sociedad en su conjunto y considera que la sociedad evoluciona al igual que los organism...
Portable apparatus to recycle breathing gas For rebreathers used in underwater diving, see Diving rebreather. For diving with a rebreather, see Rebreather diving. For breathing gas recycling in a habitat, see Life-support system. For surface recycling of breathing gas recovered from a diver, see Gas reclaim system. RebreatherA fully closed circuit electronic rebreather (AP Diving Inspiration)AcronymCCUBA (closed circuit underwater breathing apparatus); CCR (closed circuit rebreather), SCR (se...
Emplacement de l'amphithéâtre morainique d’Ivrée Photo satellite, le débouché de la vallée d'Aoste sur la plaine du Pô L'amphithéâtre morainique d’Ivrée (AMI) est un relief de moraine d’origine glaciaire situé dans le Canavais, intéressant administrativement la province de Turin et plus marginalement, celles de Biella et de Verceil, région du Piémont en Italie. L'AMI remonte à la période du quaternaire et fut créé par le transport de sédiments vers la plaine du Pô au...
Chain of icecream shops This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: MaggieMoo's Ice Cream and Treatery – news · newspapers · books · scholar · JSTOR (January 2021) (Learn how and when to remove this message) MaggieMoo's Ice Cream and TreateryCompany typeSubsidiaryIndustryRestaurantsGenreIce cream shopFou...
Жан Виктор Дюрюифр. Victor Duruy министр по делам образования и религии Франции[вд] 1863 — 17 июля 1869 Предшественник Gustave Rouland[вд] Преемник Louis Olivier Bourbeau[вд] seat 20 of the Académie française[вд] 4 декабря 1884 — 25 ноября 1894 Предшественник Франсуа Минье Преемник Леметр, Франсуа Эли Жюль Рожде...