Cronbach's alpha (Cronbach's ), also known as tau-equivalent reliability () or coefficient alpha (coefficient ), is a reliability coefficient and a measure of the internal consistency of tests and measures.[1][2][3] It was named after the American psychologist Lee Cronbach.
In his initial 1951 publication, Lee Cronbach described the coefficient as Coefficientalpha[1] and included an additional derivation.[10]Coefficient alpha had been used implicitly in previous studies,[11][12][13][14] but his interpretation was thought to be more intuitively attractive relative to previous studies and it became quite popular.[15]
In 1967, Melvin Novick and Charles Lewis proved that it was equal to reliability if the true scores[i] of the compared tests or measures vary by a constant, which is independent of the people measured. In this case, the tests or measurements were said to be "essentially tau-equivalent."[16]
In 1978, Cronbach asserted that the reason the initial 1951 publication was widely cited was "mostly because [he] put a brand name on a common-place coefficient."[2]: 263 [3] He explained that he had originally planned to name other types of reliability coefficients, such as those used in inter-rater reliability and test-retest reliability, after consecutive Greek letters (i.e., , , etc.), but later changed his mind.
Later, in 2004, Cronbach and Richard Shavelson encouraged readers to use generalizability theory rather than . Cronbach opposed the use of the name "Cronbach's alpha" and explicitly denied the existence of studies that had published the general formula of KR-20 before Cronbach's 1951 publication of the same name.[9]
Prerequisites for using Cronbach's alpha
To use Cronbach's alpha as a reliability coefficient, the following conditions must be met:[17][18]
Cronbach's alpha is calculated by taking a score from each scale item and correlating it with the total score for each observation. The resulting correlations are then compared with the variance for all individual item scores. Cronbach's alpha is best understood as a function of the number of questions or items in a measure, the average covariance between pairs of items, and the overall variance of the total measured score.[19][8]where:
represents the number of items in the measure
the variance associated with each item i
the variance associated with the total scores
Alternatively, it can be calculated through the following formula:[20]
The value of Cronbach's alpha ranges between zero and one
By definition, reliability cannot be less than zero and cannot be greater than one. Many textbooks mistakenly equate with reliability and give an inaccurate explanation of its range. can be less than reliability when applied to data that are not essentially tau-equivalent. Suppose that copied the value of as it is, and copied by multiplying the value of by -1.
The covariance matrix between items is as follows, .
Observed covariance matrix
Negative can occur for reasons such as negative discrimination or mistakes in processing reversely scored items.
Unlike , SEM-based reliability coefficients (e.g., ) are always greater than or equal to zero.
This anomaly was first pointed out by Cronbach (1943)[21] to criticize , but Cronbach (1951)[10] did not comment on this problem in his article that otherwise discussed potentially problematic issues related .[9]: 396 [22]
If there is no measurement error, the value of Cronbach's alpha is one.
This anomaly also originates from the fact that underestimates reliability.
Suppose that copied the value of as it is, and copied by multiplying the value of by two.
The covariance matrix between items is as follows, .
Observed covariance matrix
For the above data, both and have a value of one.
The above example is presented by Cho and Kim (2015).[7]
A high value of Cronbach's alpha indicates homogeneity between the items
Many textbooks refer to as an indicator of homogeneity[23] between items. This misconception stems from the inaccurate explanation of Cronbach (1951)[10] that high values show homogeneity between the items. Homogeneity is a term that is rarely used in modern literature, and related studies interpret the term as referring to uni-dimensionality. Several studies have provided proofs or counterexamples that high values do not indicate uni-dimensionality.[24][7][25][26][27][28] See counterexamples below.
Uni-dimensional data
in the uni-dimensional data above.
Multidimensional data
in the multidimensional data above.
Multidimensional data with extremely high reliability
The above data have , but are multidimensional.
Uni-dimensional data with unacceptably low reliability
The above data have , but are uni-dimensional.
Uni-dimensionality is a prerequisite for . One should check uni-dimensionality before calculating rather than calculating to check uni-dimensionality.[3]
A high value of Cronbach's alpha indicates internal consistency
The term "internal consistency" is commonly used in the reliability literature, but its meaning is not clearly defined. The term is sometimes used to refer to a certain kind of reliability (e.g., internal consistency reliability), but it is unclear exactly which reliability coefficients are included here, in addition to . Cronbach (1951)[10] used the term in several senses without an explicit definition. Cho and Kim (2015)[7] showed that is not an indicator of any of these.
Removing items using "alpha if item deleted" always increases reliability
Removing an item using "alpha if item deleted"[clarification needed] may result in 'alpha inflation,' where sample-level reliability is reported to be higher than population-level reliability.[29] It may also reduce population-level reliability.[30] The elimination of less-reliable items should be based not only on a statistical basis but also on a theoretical and logical basis. It is also recommended that the whole sample be divided into two and cross-validated.[29]
Ideal reliability level and how to increase reliability
Nunnally's recommendations for the level of reliability
Nunnally's book[31][32] is often mentioned as the primary source for determining the appropriate level of dependability coefficients. However, his proposals contradict his aims as he suggests that different criteria should be used depending on the goal or stage of the investigation. Regardless of the type of study, whether it is exploratory research, applied research, or scale development research, a criterion of 0.7 is universally employed.[33] He advocated 0.7 as a criterion for the early stages of a study, most studies published in the journal do not fall under that category. Rather than 0.7, Nunnally's applied research criterion of 0.8 is more suited for most empirical studies.[33]
Nunnally's recommendations on the level of reliability
His recommendation level did not imply a cutoff point. If a criterion means a cutoff point, it is important whether or not it is met, but it is unimportant how much it is over or under. He did not mean that it should be strictly 0.8 when referring to the criteria of 0.8. If the reliability has a value near 0.8 (e.g., 0.78), it can be considered that his recommendation has been met.[34]
Cost to obtain a high level of reliability
Nunnally's idea was that there is a cost to increasing reliability, so there is no need to try to obtain maximum reliability in every situation.
Trade-off with validity
Measurements with perfect reliability lack validity.[7] For example, a person who takes the test with a reliability of one will either receive a perfect score or a zero score, because if they answer one item correctly or incorrectly, they will answer all other items in the same manner. The phenomenon where validity is sacrificed to increase reliability is known as the attenuation paradox.[35][36]
A high value of reliability can conflict with content validity. To achieve high content validity, each item should comprehensively represent the content to be measured. However, a strategy of repeatedly measuring essentially the same question in different ways is often used solely to increase reliability.[37][38]
Trade-off with efficiency
When the other conditions are equal, reliability increases as the number of items increases. However, the increase in the number of items hinders the efficiency of measurements.
Methods to increase reliability
Despite the costs associated with increasing reliability discussed above, a high level of reliability may be required. The following methods can be considered to increase reliability.
Do not measure what the respondents do not know.[39]
Increase the number of items. However, care should be taken not to excessively inhibit the efficiency of the measurement.
Use a scale that is known to be highly reliable.[40]
Conduct a pretest - discover in advance the problem of reliability.
Exclude or modify items that are different in content or form from other items (e.g., reverse-scored items).
After data collection:
Remove the problematic items using "alpha if item deleted". However, this deletion should be accompanied by a theoretical rationale.
Use a more accurate reliability coefficient than . For example, is 0.02 larger than on average.[41]
Which reliability coefficient to use
is used in an overwhelming proportion. A study estimates that approximately 97% of studies use as a reliability coefficient.[3]
However, simulation studies comparing the accuracy of several reliability coefficients have led to the common result that is an inaccurate reliability coefficient.[42][43][6][44][45]
Methodological studies are critical of the use of . Simplifying and classifying the conclusions of existing studies are as follows.
Conditional use: Use only when certain conditions are met.[3][7][8]
Existing studies are practically unanimous in that they oppose the widespread practice of using unconditionally for all data. However, different opinions are given on which reliability coefficient should be used instead of .
Different reliability coefficients ranked first in each simulation study[42][43][6][44][45] comparing the accuracy of several reliability coefficients.[7]
The majority opinion is to use structural equation modeling or SEM-based reliability coefficients as an alternative to .[3][7][46][5][47][8][6][48]
However, there is no consensus on which of the several SEM-based reliability coefficients (e.g., uni-dimensional or multidimensional models) is the best to use.
Some people suggest [6] as an alternative, but shows information that is completely different from reliability. is a type of coefficient comparable to Reveille's .[49][6] They do not substitute, but complement reliability.[3]
Among SEM-based reliability coefficients, multidimensional reliability coefficients are rarely used, and the most commonly used is ,[3] also known as composite or congeneric reliability.
Software for SEM-based reliability coefficients
General-purpose statistical software such as SPSS and SAS include a function to calculate . Users who don't know the formula have no problem in obtaining the estimates with just a few mouse clicks.
SEM software such as AMOS, LISREL, and MPLUS does not have a function to calculate SEM-based reliability coefficients. Users need to calculate the result by inputting it to the formula. To avoid this inconvenience and possible error, even studies reporting the use of SEM rely on instead of SEM-based reliability coefficients.[3] There are a few alternatives to automatically calculate SEM-based reliability coefficients.
R (free): The psych package[50] calculates various reliability coefficients.
EQS (paid):[51] This SEM software has a function to calculate reliability coefficients.
RelCalc (free):[3] Available with Microsoft Excel. can be obtained without the need for SEM software. Various multidimensional SEM reliability coefficients and various types of can be calculated based on the results of SEM software.
Notes
^The true score is the difference between the score observed during the test or measurement and the error in that observation. See classical test theory for further information.
^This implicitly requires that the data can be ordered, and thus requires that it is not nominal.
References
^ abCronbach, Lee J. (1951). "Coefficient alpha and the internal structure of tests". Psychometrika. 16 (3). Springer Science and Business Media LLC: 297–334. doi:10.1007/bf02310555. hdl:10983/2196. S2CID13820448.
^ abCronbach, L. J. (1978). "Citation Classics"(PDF). Current Contents. 13: 263. Archived(PDF) from the original on 2022-01-20. Retrieved 2021-03-22.
^ abcdefgRevelle, W.; Zinbarg, R. E. (2009). "Coefficients alpha, beta, omega, and the glb: Comments on Sijtsma". Psychometrika. 74 (1): 145–154. doi:10.1007/s11336-008-9102-z. S2CID5864489.
^ abcCronbach, L. J.; Shavelson, R. J. (2004). "My Current Thoughts on Coefficient Alpha and Successor Procedures". Educational and Psychological Measurement. 64 (3): 391–418. doi:10.1177/0013164404266386. S2CID51846704.
^Jackson, R. W. B.; Ferguson, G. A. (1941). "Studies on the reliability of tests". University of Toronto Department of Educational Research Bulletin. 12: 132.
^Cortina, J. M. (1993). "What is coefficient alpha? An examination of theory and applications". Journal of Applied Psychology. 78 (1): 98–104. doi:10.1037/0021-9010.78.1.98.
^Green, S. B.; Lissitz, R. W.; Mulaik, S. A. (1977). "Limitations of coefficient alpha as an Index of test unidimensionality". Educational and Psychological Measurement. 37 (4): 827–838. doi:10.1177/001316447703700403. S2CID122986180.
^McDonald, R. P. (1981). "The dimensionality of tests and items". The British Journal of Mathematical and Statistical Psychology. 34 (1): 100–117. doi:10.1111/j.2044-8317.1981.tb00621.x.
^Schmitt, N. (1996). "Uses and abuses of coefficient alpha". Psychological Assessment. 8 (4): 350–3. doi:10.1037/1040-3590.8.4.350.
^Ten Berge, J. M. F.; Sočan, G. (2004). "The greatest lower bound to the reliability of a test and the hypothesis of unidimensionality". Psychometrika. 69 (4): 613–625. doi:10.1007/BF02289858. S2CID122674001.
^Raykov, T. (2007). "Reliability if deleted, not 'alpha if deleted': Evaluation of scale reliability following component deletion". The British Journal of Mathematical and Statistical Psychology. 60 (2): 201–216. doi:10.1348/000711006X115954. PMID17971267.
^ abLance, C. E.; Butts, M. M.; Michels, L. C. (2006). "What did they really say?". Organizational Research Methods. 9 (2): 202–220. doi:10.1177/1094428105284919. S2CID144195175.
^Cho, E. (2020). "A comprehensive review of so-called Cronbach's alpha". Journal of Product Research. 38 (1): 9–20.
^Humphreys, L. (1956). "The normal curve and the attenuation paradox in test theory". Psychological Bulletin. 53 (6): 472–6. doi:10.1037/h0041091. PMID13370692.
^Boyle, G. J. (1991). "Does item homogeneity indicate internal consistency or item redundancy in psychometric scales?". Personality and Individual Differences. 12 (3): 291–4. doi:10.1016/0191-8869(91)90115-R.
^Streiner, D. L. (2003). "Starting at the beginning: An introduction to coefficient alpha and internal consistency". Journal of Personality Assessment. 80 (1): 99–103. doi:10.1207/S15327752JPA8001_18. PMID12584072. S2CID3679277.
^Lee, H. (2017). Research Methodology (2nd ed.), Hakhyunsa.
^Peterson, R. A.; Kim, Y. (2013). "On the relationship between coefficient alpha and composite reliability". Journal of Applied Psychology. 98 (1): 194–8. doi:10.1037/a0030767. PMID23127213.
^ abKamata, A., Turhan, A., & Darandari, E. (2003). Estimating reliability for multidimensional composite scale scores. Annual Meeting of American Educational Research Association, Chicago, April 2003, April, 1–27.
^ abOsburn, H. G. (2000). "Coefficient alpha and related internal consistency reliability coefficients". Psychological Methods. 5 (3): 343–355. doi:10.1037/1082-989X.5.3.343. PMID11004872.
^ abTang, W., & Cui, Y. (2012). A simulation study for comparing three lower bounds to reliability. Paper Presented on April 17, 2012 at the AERA Division D: Measurement and Research Methodology, Section 1: Educational Measurement, Psychometrics, and Assessment, 1–25.
^ abPeters, G. Y. (2014). "The alpha and the omega of scale reliability and validity comprehensive assessment of scale quality". The European Health Psychologist. 1 (2): 56–69.
^ abYang, Y., & Green, S. B.Yanyun Yang; Green, Samuel B. (2011). "Coefficient alpha: A reliability coefficient for the 21st century?". Journal of Psychoeducational Assessment. 29 (4): 377–392. doi:10.1177/0734282911406668. S2CID119926199.
^Revelle, W. (1979). "Hierarchical cluster analysis and the internal structure of tests". Multivariate Behavioral Research. 14 (1): 57–74. doi:10.1207/s15327906mbr1401_4. PMID26766619.
Potret Paul A View of St. Louis: A Triptych, oil on canvas, 1898 After the Rain Gloucester Paul Cornoyer (15 Agustus 1864 – 17 Juni 1923) adalah seorang pelukis Amerika, yang saat ini terkenal karena lukisannya yang direproduksi secara populer dalam gaya Impresionis, tonalis, dan kadang-kadang pointillist. Lahir di St. Louis, Missouri, Cornoyer mulai melukis dengan gaya Barbizon dan pertama kali dipamerkan pada tahun 1887. Pada tahun 1889, ia melanjutkan studinya di Paris di A...
Dutch painter This article relies largely or entirely on a single source. Relevant discussion may be found on the talk page. Please help improve this article by introducing citations to additional sources.Find sources: Frans van Mieris the Younger – news · newspapers · books · scholar · JSTOR (February 2012) Frans van Mieris (II) (Jacob Houbraken) Frans van Mieris the Younger (24 November 1689 – 22 October 1763) was a Dutch painter. He was born in Le...
Juan Peralta Peralta in 2020 Persoonlijke informatie Volledige naam Juan Peralta Gascon Geboortedatum 17 mei 1990 Geboorteplaats Pamplona, Spanje Sportieve informatie Discipline(s) Baan Specialisatie(s) Sprintonderdelen Portaal Wielersport Juan Peralta Gascon (Pamplona, 17 mei 1990) is een Spaans baanwielrenner. Hij vertegenwoordigde zijn land tijdens de Olympische Spelen van 2012 waar hij tiende werd op de keirin en die van de 2016 waar hij een negentiende plaats behaalde op de ...
具葡萄牙風格的澳門政府總部前座 中華人民共和國澳門特別行政區政府總部(或前總督府,葡萄牙語:Sede do Governo da Região Administrativa Especial de Macau da República Popular da China)是位於澳門南灣大馬路的地標性官方建築物。其前身為澳門總督府(簡稱澳督府,葡萄牙語:Palácio da Praia Grande),澳葡時期曾為澳門總督辦公之地。昔日澳門總督被稱為「兵頭」,故亦有兵頭行之俗稱�...
Gaudium et Spes atau Konstitusi Pastoral tentang Gereja di Dunia Dewasa Ini adalah dokumen puncak dari Konsili Vatikan Kedua. Konstitusi ini disetujui oleh para Uskup dalam sebuah pemungutan suara 2.307 berbanding 75, dan diresmikan oleh Paus Paulus VI pada 7 Desember 1965. Judul Gaudium et Spes atau Kegembiraan dan Harapan (dalam Bahasa Inggris Joy and Hope) diambil dari baris pertama dokumen ini, sebagaimana umumnya dokumen Gereja Katolik dinamai. Daftar Isi Dokumen Dokumen Gaudium et Spes ...
هذه المقالة يتيمة إذ تصل إليها مقالات أخرى قليلة جدًا. فضلًا، ساعد بإضافة وصلة إليها في مقالات متعلقة بها. (يناير 2021) جزء من سلسلة مقالات حولاقتصاد تاريخأفرع تاريخ الاقتصاد مدارس الاقتصاد اقتصاد سائد اقتصاد بدعي المنهجية الاقتصادية اقتصاد سياسي الاقتصاد الجزئي الاقتصاد ا�...
Spanish general (1836-1915) In this Spanish name, the first or paternal surname is Jáudines and the second or maternal family name is Álvarez. The Most ExcellentFermín JáudenesOIC114th Governor-General of the PhilippinesIn officeJuly 24 – August 13, 1898Preceded byBasilio AugustínSucceeded byFrancisco Rizzo Personal detailsBorn(1836-07-07)July 7, 1836A Coruña, Province of A CoruñaDiedFebruary 11, 1915(1915-02-11) (aged 78)Logroño, La RiojaMilitary serviceAllegi...
Water company SunwaterTypeGovernment-owned corporationIndustryWater resourcesHeadquartersBrisbane, AustraliaArea servedQueenslandProductsWaterServicesFrom designing and building dams, managing and operating infrastructureTotal assetsA$10 billionParentGovernment of QueenslandWebsitesunwater.com.auFootnotes / references[1][2][3][4][5] Sunwater, the trading name of Sunwater Limited, is a statutory Queensland Government-owned corporation[6...
Bombus pratorum alimentándose en una inflorescencia de tipo espiga. Mariposa Pieris napi, ejemplo de psicofilia. En Botánica se llama zoofilia a la adaptación de las plantas fanerógamas que aseguran su polinización con ayuda de animales. En este contexto se llama vectores a los animales que realizan el transporte del polen. Las plantas zoófilas presentan flores o caracteres adaptativos con rasgos particulares, como pueden ser rasgos ópticos u olfativos llamativos para los animales poli...
Roman Catholic evangelical society Not to be confused with Order of Saint Paul the First Hermit. Missionary Society of Saint Paul the ApostleSocietas Sacerdotum Missionariorum a Sancto Paulo ApostoloAbbreviationCSPNicknamePaulist FathersFormationJuly 7, 1858; 165 years ago (July 7, 1858)FounderFr. Isaac Thomas Hecker, CSPFounded atNew York City, New York, USTypeSociety of apostolic life of Pontifical Right for menHeadquartersMotherhouse:415 West 59th Street, New York, NY, USARegi...
UFC mixed martial arts event in 2022 UFC Fight Night: Dern vs. YanThe poster for UFC Fight Night: Dern vs. YanInformationPromotionUltimate Fighting ChampionshipDateOctober 1, 2022 (2022-10-01)VenueUFC ApexCityEnterprise, Nevada, United StatesAttendanceNot announced[1]Event chronology UFC Fight Night: Sandhagen vs. Song UFC Fight Night: Dern vs. Yan UFC Fight Night: Grasso vs. Araújo UFC Fight Night: Dern vs. Yan (also known as UFC Fight Night 211, UFC on ESPN+ 69 and U...
Amusement park in Surfers Paradise, Australia Adrenalin Park in 2007 Funtime is a small amusement park in the centre of Surfers Paradise, City of Gold Coast, Queensland, Australia, operated by the Funtime amusement ride manufacturing company of Austria.[1] The park features installations of their Sling Shot and Vomatron rides.[2][3] The park was previously known as Adrenalin Park and featured several other rides and attractions including an Intamin Parachute Drop tower...
Guitar tuning This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: Drop C tuning – news · newspapers · books · scholar · JSTOR (March 2009) (Learn how and when to remove this template message) Drop C tuning (CGCFAD) Drop C tuning (CGCFAD) (listen) Drop C tuning is an alternative guitar tuning where at least one s...
Online web archive For a guide to using archive.today within Wikipedia, see Help:Using archive.today. archive.todayScreenshot of the archive.today home pageType of siteWeb archivingAvailable inMultilingualURL archive.today (main) archive.ph archive.is archive.li archive.vn archive.fo archive.md archiveiya74codqgiixo33q62qlrqtkgmcitqx5u2oeqnmn5bpcbiyd.onion (Accessing link help)[1] RegistrationNoLaunchedMay 16, 2012; 11 years ago (2012-05-16)[2] archive.t...
1991 Kosovan independence referendum 26–30 September 1991 Results Choice Votes % Yes 913,705 99.98% No 164 0.02% Valid votes 913,869 99.90% Invalid or blank votes 933 0.10% Total votes 914,802 100.00% Registered voters/turnout 1,051,357 87.01% Politics of Kosovo Constitution and law Executive President Vjosa Osmani Prime Minister Albin Kurti Second Kurti cabinet Government Legislature Assembly Chairman: Glauk Konjufca Judiciary Constitutional Court Kosovo Judicial Council Administrative div...
Class of green algae Chloropicophyceae Chloropicon sieburthii, type species Scientific classification Kingdom: Plantae Division: Chlorophyta Class: ChloropicophyceaeLopes dos Santos & Eikrem 2017 Order: ChloropicalesLopes dos Santos & Eikrem 2017 Family: ChloropicaceaeLopes dos Santos & Eikrem 2017 Genera Chloropicon Chloroparvula Wikimedia Commons has media related to Chloropicophyceae. Wikispecies has information related to Chloropicophyceae. Chloropicophyceae is a class of gree...