Homogeneity and heterogeneity (statistics)

In statistics, homogeneity and its opposite, heterogeneity, arise in describing the properties of a dataset, or several datasets. They relate to the validity of the often convenient assumption that the statistical properties of any one part of an overall dataset are the same as any other part. In meta-analysis, which combines the data from several studies, homogeneity measures the differences or similarities between the several studies (see also Study heterogeneity).

Homogeneity can be studied to several degrees of complexity. For example, considerations of homoscedasticity examine how much the variability of data-values changes throughout a dataset. However, questions of homogeneity apply to all aspects of the statistical distributions, including the location parameter. Thus, a more detailed study would examine changes to the whole of the marginal distribution. An intermediate-level study might move from looking at the variability to studying changes in the skewness. In addition to these, questions of homogeneity apply also to the joint distributions.

The concept of homogeneity can be applied in many different ways and, for certain types of statistical analysis, it is used to look for further properties that might need to be treated as varying within a dataset once some initial types of non-homogeneity have been dealt with.

Of variance

Plot with random data showing homoscedasticity: at each value of x, the y-value of the dots has about the same variance.
Plot with random data showing heteroscedasticity: The variance of the y-values of the dots increases with increasing values of x.

In statistics, a sequence of random variables is homoscedastic (/ˌhmskəˈdæstɪk/) if all its random variables have the same finite variance; this is also known as homogeneity of variance. The complementary notion is called heteroscedasticity, also known as heterogeneity of variance. The spellings homoskedasticity and heteroskedasticity are also frequently used. “Skedasticity” comes from the Ancient Greek word “skedánnymi”, meaning “to scatter”.[1][2][3] Assuming a variable is homoscedastic when in reality it is heteroscedastic (/ˌhɛtərskəˈdæstɪk/) results in unbiased but inefficient point estimates and in biased estimates of standard errors, and may result in overestimating the goodness of fit as measured by the Pearson coefficient.

The existence of heteroscedasticity is a major concern in regression analysis and the analysis of variance, as it invalidates statistical tests of significance that assume that the modelling errors all have the same variance. While the ordinary least squares estimator is still unbiased in the presence of heteroscedasticity, it is inefficient and inference based on the assumption of homoskedasticity is misleading. In that case, generalized least squares (GLS) was frequently used in the past.[4][5] Nowadays, standard practice in econometrics is to include Heteroskedasticity-consistent standard errors instead of using GLS, as GLS can exhibit strong bias in small samples if the actual skedastic function is unknown.[6]

Because heteroscedasticity concerns expectations of the second moment of the errors, its presence is referred to as misspecification of the second order.[7]

The econometrician Robert Engle was awarded the 2003 Nobel Memorial Prize for Economics for his studies on regression analysis in the presence of heteroscedasticity, which led to his formulation of the autoregressive conditional heteroscedasticity (ARCH) modeling technique.[8]

Examples

Regression

Differences in the typical values across the dataset might initially be dealt with by constructing a regression model using certain explanatory variables to relate variations in the typical value to known quantities. There should then be a later stage of analysis to examine whether the errors in the predictions from the regression behave in the same way across the dataset. Thus the question becomes one of the homogeneity of the distribution of the residuals, as the explanatory variables change. See regression analysis.

Time series

The initial stages in the analysis of a time series may involve plotting values against time to examine homogeneity of the series in various ways: stability across time as opposed to a trend; stability of local fluctuations over time.

Combining information across sites

In hydrology, data-series across a number of sites composed of annual values of the within-year annual maximum river-flow are analysed. A common model is that the distributions of these values are the same for all sites apart from a simple scaling factor, so that the location and scale are linked in a simple way. There can then be questions of examining the homogeneity across sites of the distribution of the scaled values.

Combining information sources

In meteorology, weather datasets are acquired over many years of record and, as part of this, measurements at certain stations may cease occasionally while, at around the same time, measurements may start at nearby locations. There are then questions as to whether, if the records are combined to form a single longer set of records, those records can be considered homogeneous over time. An example of homogeneity testing of wind speed and direction data can be found in Romanić et al., 2015.[9]

Homogeneity within populations

Simple populations surveys may start from the idea that responses will be homogeneous across the whole of a population. Assessing the homogeneity of the population would involve looking to see whether the responses of certain identifiable subpopulations differ from those of others. For example, car-owners may differ from non-car-owners, or there may be differences between different age-groups.

Tests

A test for homogeneity, in the sense of exact equivalence of statistical distributions, can be based on an E-statistic. A location test tests the simpler hypothesis that distributions have the same location parameter.

See also

References

  1. ^ For the Greek etymology of the term, see McCulloch, J. Huston (1985). "On Heteros*edasticity". Econometrica. 53 (2): 483. JSTOR 1911250.
  2. ^ White, Halbert (1980). "A heteroskedasticity-consistent covariance matrix estimator and a direct test for heteroskedasticity". Econometrica. 48 (4): 817–838. CiteSeerX 10.1.1.11.7646. doi:10.2307/1912934. JSTOR 1912934.
  3. ^ Gujarati, D. N.; Porter, D. C. (2009). Basic Econometrics (Fifth ed.). Boston: McGraw-Hill Irwin. p. 400. ISBN 9780073375779.
  4. ^ Goldberger, Arthur S. (1964). Econometric Theory. New York: John Wiley & Sons. pp. 238–243. ISBN 9780471311010.
  5. ^ Johnston, J. (1972). Econometric Methods. New York: McGraw-Hill. pp. 214–221.
  6. ^ Angrist, Joshua D.; Pischke, Jörn-Steffen (2009-12-31). Mostly Harmless Econometrics: An Empiricist's Companion. Princeton University Press. doi:10.1515/9781400829828. ISBN 978-1-4008-2982-8.
  7. ^ Long, J. Scott; Trivedi, Pravin K. (1993). "Some Specification Tests for the Linear Regression Model". In Bollen, Kenneth A.; Long, J. Scott (eds.). Testing Structural Equation Models. London: Sage. pp. 66–110. ISBN 978-0-8039-4506-7.
  8. ^ Engle, Robert F. (July 1982). "Autoregressive Conditional Heteroscedasticity with Estimates of the Variance of United Kingdom Inflation". Econometrica. 50 (4): 987–1007. doi:10.2307/1912773. ISSN 0012-9682. JSTOR 1912773.
  9. ^ Romanić D. Ćurić M- Jovičić I. Lompar M. 2015. Long-term trends of the ‘Koshava’ wind during the period 1949–2010. International Journal of Climatology 35(2):288-302. DOI:10.1002/joc.3981.

Further reading

  • Hall, M.J. (2003) The interpretation of non-homogeneous hydrometeorological time series a case study. Meteorological Applications, 10, 61–67. doi:10.1017/S1350482703005061
  • Krus, D.J., & Blackman, H.S. (1988).Test reliability and homogeneity from perspective of the ordinal test theory. Applied Measurement in Education, 1, 79–88 (Request reprint).[usurped]
  • Loevinger, J. (1948). The technic of homogeneous tests compared with some aspects of scale analysis and factor analysis. Psychological Bulletin, 45, 507–529.

Read other articles:

TrappesNegaraPrancisArondisemenVersaillesAntarkomuneCommunautéd'agglomérationde Saint-Quentin-en-Yvelines Trappes merupakan sebuah komune di pinggiran barat Paris, Prancis. Terletak 26.7 km (16.6 mil) dari pusat kota Paris, di kota baru Saint-Quentin-en-Yvelines. Angkutan Trappes dilayani oleh stasiun Trappes pada jalur rel pinggiran Transilien La Défense dan Transilien Paris – Montparnasse. Demografi Perubahan jumlah penduduk Orang terkenal yang lahir di Trappes Nicolas Anelka, pem...

 

 

Untuk kegunaan lain, lihat Bromo (disambiguasi). BromoGenre Drama Laga Fantasi PembuatMD EntertainmentDitulis olehArya DewataSutradara Anto Agam Manish Sharma Pemeran Randy Pangalila Anggika Bolsterli Armando Jordy Anjani Dina Elscant Wifesa Donny Michael Yati Surachman Dicky Wahyudi Lulu Kurnia Sulton Klein Ali Fikry Setiawan Min Penggubah lagu temaRichard GondoLagu pembukaNirwana Cinta oleh Richard GondoLagu penutupNirwana Cinta oleh Richard GondoNegara asalIndonesiaBahasa asliBahasa Indone...

 

 

Seorang dewata didampingi dua apsara yang tengah memegang kamandalu (kendi) dan bunga utpala (teratai biru), candi Wishnu Prambanan Para Dewata Lokapala penjaga penjuru mata angin di dinding candi Shiwa Prambanan Dewa adalah istilah dalam ajaran Hindu dan Buddha untuk keberadaan spiritual yang memiliki kekuatan supranatural; sedangkan Dewata (Dewanagari: देवता; ,IAST: Devatā, देवता), atau disebut Tevoda (ទេវតា dalam bahasa Khmer), adalah dewa dengan ke...

Guamaggiore GomajòriKomuneComune di GuamaggioreLokasi Guamaggiore di Provinsi Sardinia SelatanNegara ItaliaWilayah SardiniaProvinsiSardinia Selatan (SU)Pemerintahan • Wali kotaAntonio CappaiLuas • Total16,80 km2 (6,49 sq mi)Ketinggian199 m (653 ft)Populasi (2016) • Total984[1]Zona waktuUTC+1 (CET) • Musim panas (DST)UTC+2 (CEST)Kode pos09040Kode area telepon070Situs webhttp://www.comune.guamaggiore.ca....

 

 

المسيحية في هونغ كونغكاتدرائية الحبل بلا دنس الكاثوليكيَّة في هونغ كونغمناطق الوجود المميزةالبلد الصين  هونغ كونغ ‏ 833,000 (تقديرات 2010)اللغات اللغة الصينيةالدين الغالبية تنتمي إلى الكنائس البروتستانتية والكنيسة الرومانية الكاثوليكيةالمجموعات العرقية المرتبطةفرع م�...

 

 

1994 video gameAggressors of Dark KombatDeveloper(s)ADK[a]Publisher(s) ADK Neo Geo CDJP: ADKNA: SNK Director(s)Takashi HatonoProducer(s)Kazuo AraiDesigner(s)Tsutomu MaruyamaProgrammer(s)Hideo KamodaM. AndoSatoru KubotaArtist(s)Atsushi KobayashiF. KuriharaHideyuki KusanoComposer(s)Hideki YamamotoHiroaki KujiraiHiroaki ShimizuPlatform(s) Arcade Neo Geo AES, Neo Geo CD Release 1994 ArcadeWW: 26 July 1994 Neo Geo AESJP: 26 August 1994NA: 26 August 1994EU: 26 August 1994 Neo Geo CDJP: 13 J...

Scholasticism revival Part of a series onThomas Aquinas Thomism Scholasticism Apophatic theology Divine simplicity Quinque viae Beatific vision Actus purus Actus essendi Primum Movens Sacraments Correspondence theory Hylomorphism Substance theory (ousia) Substantial form Quiddity (essence / accident nature) Peripatetic axiom Principle of double effect Aristotelian ethics Cardinal / Theological Natural law Determinatio Just war Just price Concupiscence Analytical Th...

 

 

Maimonides, seorang filsuf Yahudi terkenal, adalah seorang dzimmi di Al-Andalus, Maroko dan akhirnya Mesir. Secara istilah, dzimmi (Arab: ذميcode: ar is deprecated , majemuk: أهل الذمة, ahlul dzimmah, orang-orang dzimmah) adalah orang non-Muslim merdeka yang hidup dalam negara Islam, yang sebagai balasan karena membayar pajak perorangan, menerima perlindungan dan keamanan.[1] Hukum mengenai dzimmi berlaku di sebuah negara yang menjalankan Syariah Islam. Kata dzimmi sendiri ...

 

 

Mama Flora's Family AuthorAlex HaleyPublisherIthaca ILR PressPublication date1997 1998 American filmMama Flora's FamilyDirected byPeter WernerWritten byAlex HaleyDavid Stevens(novel)David StevensCarol Schreder(teleplay)Produced byLisa LindstromCarol SchrederStarringCicely TysonBlair UnderwoodQueen LatifahMario Van PeeblesErika AlexanderTanya WrightDavid RamseyShemar MooreNovella NelsonLou MyersCinematographyNeil RoachEdited byTod FeuermanMusic byJ. A. C. RedfordDistributed byCBSRelease dateNo...

Town in Estonia Place in Viljandi County, EstoniaAbja-PaluojaMain street in Abja-PaluojaAbja-PaluojaShow map of EstoniaAbja-PaluojaShow map of EuropeCoordinates: 58°07′33″N 25°21′31″E / 58.12583°N 25.35861°E / 58.12583; 25.35861CountryEstoniaCountyViljandi CountyMunicipalityMulgi ParishTown status1993Population (2018)[1] • Total1,057 • Rank44thTime zoneUTC+2 (EET) Abja-Paluoja is a town in Mulgi Parish, Viljandi County, ...

 

 

2009 single by Taylor Swift This article is about the Taylor Swift song. For other uses, see 15 (disambiguation) § Songs. FifteenSingle by Taylor Swiftfrom the album Fearless ReleasedAugust 31, 2009 (2009-08-31)Recorded2008StudioBlackbird (Nashville)Genre Country pop guitar pop Length4:55LabelBig MachineSongwriter(s)Taylor SwiftProducer(s) Taylor Swift Nathan Chapman Taylor Swift singles chronology You Belong with Me (2009) Fifteen (2009) Two Is Better Than One (2009) Mus...

 

 

土库曼斯坦总统土库曼斯坦国徽土库曼斯坦总统旗現任谢尔达尔·别尔德穆哈梅多夫自2022年3月19日官邸阿什哈巴德总统府(Oguzkhan Presidential Palace)機關所在地阿什哈巴德任命者直接选举任期7年,可连选连任首任萨帕尔穆拉特·尼亚佐夫设立1991年10月27日 土库曼斯坦土库曼斯坦政府与政治 国家政府 土库曼斯坦宪法 国旗 国徽 国歌 立法機關(英语:National Council of Turkmenistan) ...

هذه المقالة يتيمة إذ تصل إليها مقالات أخرى قليلة جدًا. فضلًا، ساعد بإضافة وصلة إليها في مقالات متعلقة بها. (سبتمبر 2017) يفتقر محتوى هذه المقالة إلى الاستشهاد بمصادر. فضلاً، ساهم في تطوير هذه المقالة من خلال إضافة مصادر موثوق بها. أي معلومات غير موثقة يمكن التشكيك بها وإزالتها...

 

 

Prinsip pengukuran efek fotolistrik.Emisi elektron dari pelat logam disebabkan oleh kuanta-foton cahaya. Efek fotolistrik adalah pengeluaran elektron dari suatu permukaan (biasanya logam) ketika permukaan itu dikenai dan menyerap radiasi elektromagnetik (seperti cahaya tampak dan radiasi ultraungu) yang berada di atas frekuensi ambang tergantung pada jenis permukaan.[1] Elektron yang dipancarkan dengan cara ini disebut fotoelektron. Fenomena ini dipelajari dalam fisika benda terkonden...

 

 

Университет Пикардии имени Жюля ВернаL'université de Picardie Jules-Verne Год основания 1970 Тип государственный университет Президент Georges Fauré Студенты 23 000 Расположение Амьен Сайт u-picardie.fr  Медиафайлы на Викискладе Университет Пикардии имени Жюля Верна — французский университет, ...

1970 Chilean presidential election ← 1964 4 September 1970 1989 →   Nominee Salvador Allende Jorge Alessandri Radomiro Tomic Party Socialist Independent PDC Alliance Popular Unity PN–DR Popular vote 1,070,334 1,031,159 821,801 Percentage 36.61% 35.27% 28.11% Congress vote 153 35 Results by provinceResults by commune President before election Eduardo Frei Montalva PDC Elected President Salvador Allende Socialist Politics of Chile Executive President (lis...

 

 

Volume of air displaced between normal inhalation and exhalation This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: Tidal volume – news · newspapers · books · scholar · JSTOR (October 2009) (Learn how and when to remove this message) TLCTotal lung capacity: the volume in the lungs at maximal inflation, the sum ...

 

 

يفتقر محتوى هذه المقالة إلى الاستشهاد بمصادر. فضلاً، ساهم في تطوير هذه المقالة من خلال إضافة مصادر موثوق بها. أي معلومات غير موثقة يمكن التشكيك بها وإزالتها. (ديسمبر 2018) أنشأت بلغاريا تقاليد في مجموعة كبيرة ومتنوعة من الألعاب الرياضية. بلغاريا في الألعاب الأولمبية المقالة �...

This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: Palestine Railways – news · newspapers · books · scholar · JSTOR (January 2024) (Learn how and when to remove this message)State-owned railway company, 1920–1948 Palestine RailwaySamakh station, shortly after being captured by Australian light horsemen on 25 ...

 

 

.ke

Internet country-code top level domain for Kenya This article relies largely or entirely on a single source. Relevant discussion may be found on the talk page. Please help improve this article by introducing citations to additional sources.Find sources: .ke – news · newspapers · books · scholar · JSTOR (December 2021) .keIntroduced29 April 1993; 31 years ago (1993-04-29)TLD typeCountry code top-level domainStatusActiveRegistryKeNICSpo...