Spurious relationship

Graphical model: Whereas a mediator is a factor in the causal chain (top), a confounder is a spurious factor incorrectly implying causation (bottom)

In statistics, a spurious relationship or spurious correlation[1][2] is a mathematical relationship in which two or more events or variables are associated but not causally related, due to either coincidence or the presence of a certain third, unseen factor (referred to as a "common response variable", "confounding factor", or "lurking variable").

Examples

An example of a spurious relationship can be found in the time-series literature, where a spurious regression is one that provides misleading statistical evidence of a linear relationship between independent non-stationary variables. In fact, the non-stationarity may be due to the presence of a unit root in both variables.[3][4] In particular, any two nominal economic variables are likely to be correlated with each other, even when neither has a causal effect on the other, because each equals a real variable times the price level, and the common presence of the price level in the two data series imparts correlation to them. (See also spurious correlation of ratios.)

Another example of a spurious relationship can be seen by examining a city's ice cream sales. The sales might be highest when the rate of drownings in city swimming pools is highest. To allege that ice cream sales cause drowning, or vice versa, would be to imply a spurious relationship between the two. In reality, a heat wave may have caused both. The heat wave is an example of a hidden or unseen variable, also known as a confounding variable.

Another commonly noted example is a series of Dutch statistics showing a positive correlation between the number of storks nesting in a series of springs and the number of human babies born at that time. Of course there was no causal connection; they were correlated with each other only because of two independent coincidences. During the Pagan era, which can be traced back at least to medieval times more than 600 years ago, it was common for couples to wed during the annual summer solstice, because summer was associated with fertility. At the same time, storks would commence their annual migration, flying all the way from Europe to Africa. The birds would then return the following spring — exactly nine months later.[5]

In rare cases, a spurious relationship can occur between two completely unrelated variables without any confounding variable, as was the case between the success of the Washington Commanders professional football team in a specific game before each presidential election and the success of the incumbent President's political party in said election. For 16 consecutive elections between 1940 and 2000, the Redskins Rule correctly matched whether the incumbent President's political party would retain or lose the Presidency. The rule eventually failed shortly after Elias Sports Bureau discovered the correlation in 2000; in 2004, 2012 and 2016, the results of the Commanders' game and the election did not match.[6][7][8] In a similar spurious relationship involving the National Football League, in the 1970s, Leonard Koppett noted a correlation between the direction of the stock market and the winning conference of that year's Super Bowl, the Super Bowl indicator; the relationship maintained itself for most of the 20th century before reverting to more random behavior in the 21st.[9]

Hypothesis testing

Often one tests a null hypothesis of no correlation between two variables, and chooses in advance to reject the hypothesis if the correlation computed from a data sample would have occurred in less than (say) 5% of data samples if the null hypothesis were true. While a true null hypothesis will be accepted 95% of the time, the other 5% of the times having a true null of no correlation a zero correlation will be wrongly rejected, causing acceptance of a correlation which is spurious (an event known as Type I error). Here the spurious correlation in the sample resulted from random selection of a sample that did not reflect the true properties of the underlying population.

Detecting spurious relationships

The term "spurious relationship" is commonly used in statistics and in particular in experimental research techniques, both of which attempt to understand and predict direct causal relationships (X → Y). A non-causal correlation can be spuriously created by an antecedent which causes both (W → X and W → Y). Mediating variables, (X → M → Y), if undetected, estimate a total effect rather than direct effect without adjustment for the mediating variable M. Because of this, experimentally identified correlations do not represent causal relationships unless spurious relationships can be ruled out.

Experiments

In experiments, spurious relationships can often be identified by controlling for other factors, including those that have been theoretically identified as possible confounding factors. For example, consider a researcher trying to determine whether a new drug kills bacteria; when the researcher applies the drug to a bacterial culture, the bacteria die. But to help in ruling out the presence of a confounding variable, another culture is subjected to conditions that are as nearly identical as possible to those facing the first-mentioned culture, but the second culture is not subjected to the drug. If there is an unseen confounding factor in those conditions, this control culture will die as well, so that no conclusion of efficacy of the drug can be drawn from the results of the first culture. On the other hand, if the control culture does not die, then the researcher cannot reject the hypothesis that the drug is efficacious.

Non-experimental statistical analyses

Disciplines whose data are mostly non-experimental, such as economics, usually employ observational data to establish causal relationships. The body of statistical techniques used in economics is called econometrics. The main statistical method in econometrics is multivariable regression analysis. Typically a linear relationship such as

is hypothesized, in which is the dependent variable (hypothesized to be the caused variable), for j = 1, ..., k is the jth independent variable (hypothesized to be a causative variable), and is the error term (containing the combined effects of all other causative variables, which must be uncorrelated with the included independent variables). If there is reason to believe that none of the s is caused by y, then estimates of the coefficients are obtained. If the null hypothesis that is rejected, then the alternative hypothesis that and equivalently that causes y cannot be rejected. On the other hand, if the null hypothesis that cannot be rejected, then equivalently the hypothesis of no causal effect of on y cannot be rejected. Here the notion of causality is one of contributory causality: If the true value , then a change in will result in a change in y unless some other causative variable(s), either included in the regression or implicit in the error term, change in such a way as to exactly offset its effect; thus a change in is not sufficient to change y. Likewise, a change in is not necessary to change y, because a change in y could be caused by something implicit in the error term (or by some other causative explanatory variable included in the model).

Regression analysis controls for other relevant variables by including them as regressors (explanatory variables). This helps to avoid mistaken inference of causality due to the presence of a third, underlying, variable that influences both the potentially causative variable and the potentially caused variable: its effect on the potentially caused variable is captured by directly including it in the regression, so that effect will not be picked up as a spurious effect of the potentially causative variable of interest. In addition, the use of multivariate regression helps to avoid wrongly inferring that an indirect effect of, say x1 (e.g., x1x2y) is a direct effect (x1y).

Just as an experimenter must be careful to employ an experimental design that controls for every confounding factor, so also must the user of multiple regression be careful to control for all confounding factors by including them among the regressors. If a confounding factor is omitted from the regression, its effect is captured in the error term by default, and if the resulting error term is correlated with one (or more) of the included regressors, then the estimated regression may be biased or inconsistent (see omitted variable bias).

In addition to regression analysis, the data can be examined to determine if Granger causality exists. The presence of Granger causality indicates both that x precedes y, and that x contains unique information about y.

Other relationships

There are several other relationships defined in statistical analysis as follows.

See also

Literature

  • David A. Freedman (1983) A Note on Screening Regression Equations, The American Statistician, 37:2, 152-155, DOI: 10.1080/00031305.1983.10482729

Footnotes

  1. ^ Burns, William C., "Spurious Correlations", 1997.
  2. ^ Pearl, Judea. "UCLA 81st Faculty Research Lecture Series". singapore.cs.ucla.edu. Retrieved 2019-11-10.
  3. ^ Yule, G. Udny (1926-01-01). "Why do we Sometimes get Nonsense-Correlations between Time-Series? A Study in Sampling and the Nature of Time-Series". Journal of the Royal Statistical Society. 89 (1): 1–63. doi:10.2307/2341482. JSTOR 2341482. S2CID 126346450.
  4. ^ Granger, Clive W. J.; Ghysels, Eric; Swanson, Norman R.; Watson, Mark W. (2001). Essays in Econometrics: Collected Papers of Clive W. J. Granger. Cambridge University Press. ISBN 978-0521796491.
  5. ^ Sapsford, Roger; Jupp, Victor, eds. (2006). Data Collection and Analysis. Sage. ISBN 0-7619-4362-5.
  6. ^ Hofheimer, Bill (October 30, 2012). "'Redskins Rule': MNF's Hirdt on intersection of football & politics". ESPN. Retrieved October 16, 2016.
  7. ^ Manker, Rob (November 7, 2012). "Redskins Rule: Barack Obama's victory over Mitt Romney tackles presidential predictor for its first loss". Chicago Tribune. Retrieved November 8, 2012.
  8. ^ Pohl, Robert S. (2013). Urban Legends & Historic Lore of Washington. The History Press. pp. 78–80. ISBN 978-1625846648.[permanent dead link]
  9. ^ Don Peppers. "Big Data. Super Bowl. Small Minds". Retrieved December 31, 2015.

References

Read other articles:

Dangerous World TourMichael Jackson sul palco all'inizio dello show nel 1993Tour di Michael JacksonAlbumDangerous Inizio Monaco di Baviera 27 giugno 1992 Fine Città del Messico 11 novembre 1993 Tappe3 Spettacoli40 in Europa16 in Asia6 in Sud America5 in Nord America2 in Eurasia69 totali Cronologia dei tour di Michael Jackson Bad World Tour(1987-1989) HIStory World Tour(1996-1997) Il Dangerous World Tour è stato il secondo tour mondiale da solista del cantautore e ballerino statunitense Mich...

 

 

Hari Tanpa Kekerasan InternasionalDirayakan olehSemua Negara Anggota PBBTanggal2 OktoberFrekuensitahunan Hari Tanpa Kekerasan Internasional diperingati setiap tanggal 2 Oktober, pada hari ulang tahun Mahatma Gandhi. Hari raya ini disebut Gandhi Jayanti di India. Pada bulan Januari 2004 penerima Nobel dari Iran, Shirin Ebadi, mengajukan proposal tentang peringatan Hari Tanpa Kekerasan Internasional dari seorang guru Hindi di Paris yang mengajar siswa internasional untuk Forum Sosial Dunia di B...

 

 

Artikel atau sebagian dari artikel ini mungkin diterjemahkan dari Raster graphics di en.wikipedia.org. Isinya masih belum akurat, karena bagian yang diterjemahkan masih perlu diperhalus dan disempurnakan. Jika Anda menguasai bahasa aslinya, harap pertimbangkan untuk menelusuri referensinya dan menyempurnakan terjemahan ini. Anda juga dapat ikut bergotong royong pada ProyekWiki Perbaikan Terjemahan. (Pesan ini dapat dihapus jika terjemahan dirasa sudah cukup tepat. Lihat pula: panduan penerjem...

This article is about the LNER service. For the LMS service, see Coronation Scot. The Coronation was a streamlined express passenger train run by the London and North Eastern Railway between London King's Cross and Edinburgh Waverley. Named to mark the coronation of King George VI and Queen Elizabeth, it was inaugurated on 5 July 1937. The down train (northbound) left London at 4 pm and arrived in Edinburgh at 10 pm; the up train (southbound) ran half an hour later.[1] Design The des...

 

 

Road in Iran Road 71جاده 71Route informationPart of AH2 AH70 Length1,244 km (773 mi)Major junctionsFromTehran, Tehran Azadegan ExpresswayMajor intersections Shahre Rey Rigway Northern Behesht-e Zahra Expressway Freeway 7 Road 56 Road 58 Road 62 Road 68 Road 78 Road 84 Road 86 Road 88 Road 92 Road 94 Road 96ToBandarabbas, Hormozgan Road 94 LocationCountryIranProvincesTehran, Qom, Isfahan, Yazd, Kerman, HormozganMajor citiesQom, Qom Kashan, Isfahan Yazd, YazdSirjan, Kerman Hi...

 

 

Chinese time signal organisation National Time Service CenterNTSCAgency overviewFormed1966 (as Shaanxi Astronomy Observatory)JurisdictionChinese Academy of SciencesHeadquartersXi'an, Shaanxi, ChinaEmployees450[1]Agency executiveZHANG Shougang, DirectorWebsitentsc.cas.cn The National Time Service Center (NTSC; Chinese: 国家授时中心) has the task of generating, maintaining and broadcasting standard time in China. It is located in Xi'an, Shaanxi, China. Its predecessor was the ...

Raden Tumenggung Adipati AryaDanoeningrat I ꦢꦤꦸꦤꦶꦤꦔꦿꦠ꧀​꧇꧑Eyang Sedo Ing PrangLukisan RAA. Danoeningrat I dan isteri Bupati Magelang Ke-1Masa jabatan30 November 1813 – 30 September 1825 PendahuluTidak ada, jabatan baruPenggantiDanuningrat II Informasi pribadiLahirSayyid Raden Alwi Basyeiban Yogyakarta, Hindia BelandaMeninggal30 September 1825 Magelang, Hindia BelandaKebangsaan Ngayogyakarta Hadiningrat (vassal Hindia Belanda), kemudian Hindia BelandaSua...

 

 

This article includes a list of references, related reading, or external links, but its sources remain unclear because it lacks inline citations. Please help improve this article by introducing more precise citations. (October 2023) (Learn how and when to remove this template message) Ausbund title page Part of a series onAnabaptismA 1685 illustration by Jan Luyken, published in Martyrs Mirror, of Dirk Willems saving his pursuer, an act of mercy that led to his recapture, after which he was b...

 

 

Oliver SmithiesSmithies at the Universidade de São Paulo in 2008Lahir23 Juni 1925 (umur 98)Halifax, West Yorkshire, InggrisKebangsaanBritania Raya, Amerika SerikatAlmamaterBalliol College, Universitas OxfordDikenal atasGel electrophoresis, gene targetingPenghargaanAlbert Lasker Award for Basic Medical Research (2001)Penghargaan Wolf dalam kedokteran]] (2002) Penghargaan Nobel dalam Fisiologi atau Kedokteran (2007)Karier ilmiahBidangBiokimia, genetikaInstitusiUniversitas TorontoUniversi...

Science museum in Richmond, VirginiaScience Museum of VirginiaEstablished1970Location2500 West Broad Street, Richmond, VirginiaCoordinates37°33′40″N 77°27′57″W / 37.56111°N 77.46583°W / 37.56111; -77.46583TypeScience museumWebsitehttp://www.smv.org/ The Science Museum of Virginia is a science museum located in Richmond, Virginia. Established in 1970, it is an agency of the Commonwealth of Virginia. It is housed in the former Broad Street Station, built in ...

 

 

artikel ini perlu dirapikan agar memenuhi standar Wikipedia. Tidak ada alasan yang diberikan. Silakan kembangkan artikel ini semampu Anda. Merapikan artikel dapat dilakukan dengan wikifikasi atau membagi artikel ke paragraf-paragraf. Jika sudah dirapikan, silakan hapus templat ini. (Pelajari cara dan kapan saatnya untuk menghapus pesan templat ini) Didirikan22 Januari 2001RektorDr. Faisal Alhabib, S.Pd,.M.PdLokasiBengkulu UtaraNama julukanUNRASSitus webunras-bkl.ac.id// Universitas Ratu Samba...

 

 

37th Infantry Division Modena37th Infantry Division Modena insigniaActive1939–1943Country Kingdom of ItalyBranch Royal Italian ArmyTypeInfantrySizeDivisionGarrison/HQSavonaEngagementsWorld War IIItalian invasion of FranceGreco-Italian WarCommandersNotablecommandersGeneral Alessandro GloriaInsigniaIdentificationsymbol Modena Division gorget patchesMilitary unit The 37th Infantry Division Modena (Italian: 37ª Divisione di fanteria Modena) was a infantry division of the Royal Itali...

TodayPoster filmSutradaraReza MirkarimiProduserReza MirkarimiDitulis olehReza MirkarimiShadmehr RastinPemeranParviz ParastuiSoheila GolestaniPenata musikAmin HonarmandSinematograferHooman BehmaneshPenyuntingReza MirkarimiDistributoriFILM ProductionTanggal rilis 7 Februari 2014 (2014-02-07) Durasi90 menitNegaraIranBahasaPersia Today adalah sebuah film drama Iran 2014 yang disutradarai oleh Reza Mirkarimi. Film tersebut ditayangkan dalam sesi Sinema Dunia Kontemporer di Festival Film...

 

 

American college football season 1906 Penn Quakers footballConferenceIndependentRecord7–2–3Head coachCarl S. Williams (5th season)CaptainEdward L. GreeneHome stadiumFranklin FieldSeasons← 19051907 → 1906 Eastern college football independents records vte Conf Overall Team W   L   T W   L   T Princeton   –   9 – 0 – 1 Yale   –   9 – 0 – 1 Haverford   –   7 – 0 &#...

 

 

ذي حجور (محلة) تقسيم إداري البلد  اليمن المحافظة محافظة إب المديرية مديرية السبرة العزلة عزلة عينان القرية قرية عدن محلاق السكان التعداد السكاني 2004 السكان 71   • الذكور 38   • الإناث 33   • عدد الأسر 9   • عدد المساكن 9 معلومات أخرى التوقيت توقيت اليمن (+3 غرينيتش) تع�...

Hydrangea macrophylla Jalan setapak di antara beberapa tanaman bambu. Flora Tiongkok sangat beragam. Lebih dari 30.000 spesies tanaman adalah tumbuhan asli dari Cina, mewakili hampir seperdelapan dari total spesies tanaman dunia, termasuk ribuan spesies yang tidak ditemukan di tempat lain di Bumi. Di Cina terdapat berbagai jenis hutan. Baik di sisi timur laut dan barat laut terdapat gunung dan hutan konifer dingin, spesies hewan yang bisa ditemui meliputi rusa dan beruang hitam Asia, dan terd...

 

 

この記事の出典や参考文献は、一次資料や記事主題の関係者による情報源に頼っています。 信頼できる第三者情報源とされる出典の追加が求められています。出典検索?: GnuCash – ニュース · 書籍 · スカラー · CiNii · J-STAGE · NDL · dlib.jp · ジャパンサーチ · TWL(2019年12月) GnuCash GnuCash 2.2.3 のスクリーンショット開発元 The GnuCash ...

 

 

Basilika Maria MahakudusBasilika Minor Maria Mahakudusbahasa Italia: Basilica pontificia minore di Maria Santissima di CannetoBasilika Maria MahakudusWikipedia | Kode sumber | Tata penggunaan Koordinat: 41°40′41.9″N 13°54′29.9″E / 41.678306°N 13.908306°E / 41.678306; 13.908306LokasiSettefratiNegara ItaliaDenominasiGereja Katolik RomaArsitekturStatusBasilika minorStatus fungsionalAktif Basilika Maria Mahakudus (bahasa Italia: Basilica pontificia...

Early episode in the life of Jesus This article is about the biblical episode. For the liturgical feast, see Candlemas. Presentation at the Temple redirects here. For the apocryphal event in the life of Mary, see Presentation of Mary. For other uses, see Presentation at the Temple (disambiguation). Presentation at the Temple by Ambrogio Lorenzetti, 1342 (Galleria degli Uffizi, Florence) Events in theLife of Jesusaccording to the canonical gospels Early life Annunciation Visitation Joseph's dr...

 

 

イオングループ > イオングループの商業ブランド > ダイエー > ビッグ・エー 株式会社ビッグ・エーBig-a Company inc.種類 株式会社本社所在地 日本〒174-0046東京都板橋区蓮根3-22-25SOSiLA板橋設立 1979年8月20日業種 小売業法人番号 2011401010881 事業内容 生鮮食品・加工食品を中心としたハードディスカウントチェーンストアの運営代表者 代表取締役社長 打海直�...