Calibration (statistics)

There are two main uses of the term calibration in statistics that denote special types of statistical inference problems. Calibration can mean

  • a reverse process to regression, where instead of a future dependent variable being predicted from known explanatory variables, a known observation of the dependent variables is used to predict a corresponding explanatory variable;[1]
  • procedures in statistical classification to determine class membership probabilities which assess the uncertainty of a given new observation belonging to each of the already established classes.

In addition, calibration is used in statistics with the usual general meaning of calibration. For example, model calibration can be also used to refer to Bayesian inference about the value of a model's parameters, given some data set, or more generally to any type of fitting of a statistical model. As Philip Dawid puts it, "a forecaster is well calibrated if, for example, of those events to which he assigns a probability 30 percent, the long-run proportion that actually occurs turns out to be 30 percent."[2]

In classification

Calibration in classification means transforming classifier scores into class membership probabilities. An overview of calibration methods for two-class and multi-class classification tasks is given by Gebel (2009).[3] A classifier might separate the classes well, but be poorly calibrated, meaning that the estimated class probabilities are far from the true class probabilities. In this case, a calibration step may help improve the estimated probabilities. A variety of metrics exist that are aimed to measure the extent to which a classifier produces well-calibrated probabilities. Foundational work includes the Expected Calibration Error (ECE).[4] Into the 2020s, variants include the Adaptive Calibration Error (ACE) and the Test-based Calibration Error (TCE), which address limitations of the ECE metric that may arise when classifier scores concentrate on narrow subset of the [0,1] range.[5][6]

A 2020s advancement in calibration assessment is the introduction of the Estimated Calibration Index (ECI).[7] The ECI extends the concepts of the Expected Calibration Error (ECE) to provide a more nuanced measure of a model's calibration, particularly addressing overconfidence and underconfidence tendencies. Originally formulated for binary settings, the ECI has been adapted for multiclass settings, offering both local and global insights into model calibration. This framework aims to overcome some of the theoretical and interpretative limitations of existing calibration metrics. Through a series of experiments, Famiglini et al. demonstrate the framework's effectiveness in delivering a more accurate understanding of model calibration levels and discuss strategies for mitigating biases in calibration assessment. An online tool has been proposed to compute both ECE and ECI.[8] The following univariate calibration methods exist for transforming classifier scores into class membership probabilities in the two-class case:

In probability prediction and forecasting

In prediction and forecasting, a Brier score is sometimes used to assess prediction accuracy of a set of predictions, specifically that the magnitude of the assigned probabilities track the relative frequency of the observed outcomes. Philip E. Tetlock employs the term "calibration" in this sense in his 2015 book Superforecasting.[16] This differs from accuracy and precision. For example, as expressed by Daniel Kahneman, "if you give all events that happen a probability of .6 and all the events that don't happen a probability of .4, your calibration is perfect but your discrimination is miserable".[16] In meteorology, in particular, as concerns weather forecasting, a related mode of assessment is known as forecast skill.

In regression

The calibration problem in regression is the use of known data on the observed relationship between a dependent variable and an independent variable to make estimates of other values of the independent variable from new observations of the dependent variable.[17][18][19] This can be known as "inverse regression";[20] there is also sliced inverse regression. The following multivariate calibration methods exist for transforming classifier scores into class membership probabilities in the case with classes count greater than two:

  • Reduction to binary tasks and subsequent pairwise coupling, see Hastie and Tibshirani (1998)[21]
  • Dirichlet calibration, see Gebel (2009)[3]

Example

One example is that of dating objects, using observable evidence such as tree rings for dendrochronology or carbon-14 for radiometric dating. The observation is caused by the age of the object being dated, rather than the reverse, and the aim is to use the method for estimating dates based on new observations. The problem is whether the model used for relating known ages with observations should aim to minimise the error in the observation, or minimise the error in the date. The two approaches will produce different results, and the difference will increase if the model is then used for extrapolation at some distance from the known results.

See also

References

  1. ^ Cook, Ian; Upton, Graham (2006). Oxford Dictionary of Statistics. Oxford: Oxford University Press. ISBN 978-0-19-954145-4.
  2. ^ Dawid, A. P (1982). "The Well-Calibrated Bayesian". Journal of the American Statistical Association. 77 (379): 605–610. doi:10.1080/01621459.1982.10477856.
  3. ^ a b Gebel, Martin (2009). Multivariate calibration of classifier scores into the probability space (PDF) (PhD thesis). University of Dortmund.
  4. ^ M.P. Naeini, G. Cooper, and M. Hauskrecht, Obtaining well calibrated probabilities using bayesian binning. In: Proceedings of the AAAI Conference on Artificial Intelligence, 2015.
  5. ^ J. Nixon, M.W. Dusenberry, L. Zhang, G. Jerfel, & D. Tran. Measuring Calibration in Deep Learning. In: CVPR workshops (Vol. 2, No. 7), 2019.
  6. ^ T. Matsubara, N. Tax, R. Mudd, & I. Guy. TCE: A Test-Based Approach to Measuring Calibration Error. In: Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI), PMLR, 2023.
  7. ^ Famiglini, Lorenzo, Andrea Campagner, and Federico Cabitza. "Towards a Rigorous Calibration Assessment Framework: Advancements in Metrics, Methods, and Use." ECAI 2023. IOS Press, 2023. 645-652. Doi 10.3233/FAIA230327
  8. ^ Famiglini, Lorenzo; Campagner, Andrea; Cabitza, Federico (2023), "Towards a Rigorous Calibration Assessment Framework: Advancements in Metrics, Methods, and Use", ECAI 2023, IOS Press, pp. 645–652, doi:10.3233/faia230327, hdl:10281/456604, retrieved 25 March 2024
  9. ^ U. M. Garczarek "[1] Archived 2004-11-23 at the Wayback Machine," Classification Rules in Standardized Partition Spaces, Dissertation, Universität Dortmund, 2002
  10. ^ P. N. Bennett, Using asymmetric distributions to improve text classifier probability estimates: A comparison of new and standard parametric methods, Technical Report CMU-CS-02-126, Carnegie Mellon, School of Computer Science, 2002.
  11. ^ B. Zadrozny and C. Elkan, Transforming classifier scores into accurate multiclass probability estimates. In: Proceedings of the Eighth International Conference on Knowledge Discovery and Data Mining, 694–699, Edmonton, ACM Press, 2002.
  12. ^ D. D. Lewis and W. A. Gale, A Sequential Algorithm for Training Text classifiers. In: W. B. Croft and C. J. van Rijsbergen (eds.), Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '94), 3–12. New York, Springer-Verlag, 1994.
  13. ^ J. C. Platt, Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. In: A. J. Smola, P. Bartlett, B. Schölkopf and D. Schuurmans (eds.), Advances in Large Margin Classiers, 61–74. Cambridge, MIT Press, 1999.
  14. ^ Naeini MP, Cooper GF, Hauskrecht M. Obtaining Well Calibrated Probabilities Using Bayesian Binning. Proceedings of the . AAAI Conference on Artificial Intelligence AAAI Conference on Artificial Intelligence. 2015;2015:2901-2907.
  15. ^ Meelis Kull, Telmo Silva Filho, Peter Flach; Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, PMLR 54:623-631, 2017.
  16. ^ a b "Edge Master Class 2015: A Short Course in Superforecasting, Class II". edge.org. Edge Foundation. 24 August 2015. Retrieved 13 April 2018. Calibration is when I say there's a 70 percent likelihood of something happening, things happen 70 percent of time.
  17. ^ Brown, P.J. (1994) Measurement, Regression and Calibration, OUP. ISBN 0-19-852245-2
  18. ^ Ng, K. H., Pooi, A. H. (2008) "Calibration Intervals in Linear Regression Models", Communications in Statistics - Theory and Methods, 37 (11), 1688–1696. [2]
  19. ^ Hardin, J. W., Schmiediche, H., Carroll, R. J. (2003) "The regression-calibration method for fitting generalized linear models with additive measurement error", Stata Journal, 3 (4), 361–372. link, pdf
  20. ^ Draper, N.L., Smith, H. (1998) Applied Regression analysis, 3rd Edition, Wiley. ISBN 0-471-17082-8
  21. ^ T. Hastie and R. Tibshirani, "[3]," Classification by pairwise coupling. In: M. I. Jordan, M. J. Kearns and S. A. Solla (eds.), Advances in Neural Information Processing Systems, volume 10, Cambridge, MIT Press, 1998.

Read other articles:

Serangan Dnieper HilirBagian dari Front Timur dari Perang Dunia IIPara prajurit Soviet sedang melintasi DnieperTanggal24 August 1943 — 23 Desember 1943LokasiSungai Dnieper, Uni SovietHasil Kemenangan SovietPerubahanwilayah Soviet mengklaim kembali tepi kiri Ukraina, termasuk kota Kiev dan cekungan DonetsPihak terlibat  Uni Soviet Brigade Independen Cekoslowakia  Germany RumaniaTokoh dan pemimpin Georgy Zhukov Aleksandr Vasilevsky Nikolai Vatutin Ivan Konev Rodion Malinovsky Fyodor...

 

Padma BhushanJenisSipilKategoriNasionalDiinstitusikan1954Penghargaan pertama1954Total yang diberi penghargaan205Dianugerahi olehPemerintah IndiaNama sebelumnyaPadma Vibhushan Dusra Warg (Kelas II)ObverseSebuah bunga teratai di bagian tengah dan tulisan Padma yang ditulis dalam aksara Devanagari ditempatkan di bagian atas dan tulisan Bhushan ditempatkan di bagian bawah teratai.ReverseSebuah Lambang Negara India platinum ditempatkan di tengah dengan slogan nasional India, Satyameva Ja...

 

Roy MartenLahirRoy Wicaksono Abdul Salam[1]1 Maret 1952 (umur 72)Salatiga, Jawa Tengah, IndonesiaNama lainTheodoros Roy MartenPekerjaanAktorTahun aktif1974—sekarangSuami/istri Farida Sabtijastuti (cerai) Anna Maria ​(m. 1985)​ Anak6, termauk Gading Marten dan Gibran MartenKerabat Rudy Salam (kakak) Chris Salam (adik) Tanda tangan Theodoros Roy Marten (lahir 1 Maret 1952) adalah pemeran Indonesia. Kehidupan pribadi Roy pernah menikah denga...

العلاقات الأردنية المالطية الأردن مالطا   الأردن   مالطا تعديل مصدري - تعديل   العلاقات الأردنية المالطية هي العلاقات الثنائية التي تجمع بين الأردن ومالطا.[1][2][3][4][5] مقارنة بين البلدين هذه مقارنة عامة ومرجعية للدولتين: وجه المقارنة الأردن ...

 

Part of the Fifth Crusade and the Reconquista (1217) Siege of Alcácer do SalPart of the Fifth Crusade and the ReconquistaBattlements of the castle of Alcácer do SalDate30 July – 18 October 1217LocationQaṣr Abī Dānis, al-Gharb38°22′21″N 8°30′49″W / 38.37250°N 8.51361°W / 38.37250; -8.51361Result Portuguese–crusader victoryBelligerents Kingdom of PortugalCrusaders from northern Europe Almohad CaliphateCommanders and leaders Soeiro II of LisbonWillia...

 

Indian materials physicist Kasturi Lal ChopraInaugurating a seminar at Rajiv Gandhi Technological UniversityDirector at Indian Institute of Technology KharagpurIn office1987–1997Preceded byG. S. SanyalSucceeded byAmitabha GhoshProfessor at Indian Institute of Technology DelhiIn office1970–1987 Personal detailsBorn (1933-07-31) 31 July 1933 (age 90)Chahal Kalan, Gujranwala District, Punjab Province, British IndiaDied18 May 2021SpouseAsha Suri ChopraChildren3OccupationAcademic, Materia...

Questa voce sull'argomento stagioni delle società calcistiche italiane è solo un abbozzo. Contribuisci a migliorarla secondo le convenzioni di Wikipedia. Segui i suggerimenti del progetto di riferimento. Voce principale: Unione Sportiva Avellino. U.S. AvellinoStagione 1967-1968Sport calcio Squadra Avellino Allenatore Domenico Rosati Presidente Annito Abate Serie C7º posto nel girone C. Maggiori presenzeCampionato: Cesero (32) Miglior marcatoreCampionato: Ghio (20) 1966-1967 1968...

 

Artikel ini sebatang kara, artinya tidak ada artikel lain yang memiliki pranala balik ke halaman ini.Bantulah menambah pranala ke artikel ini dari artikel yang berhubungan atau coba peralatan pencari pranala. Aljabar relasional adalah bagian dari ilmu komputer, cabang dari logika predikat tingkat pertama dan aljabar himpunan, yang menangani suatu set relasi hingga yang memiliki sifat ketertutupan dengan operator-operator tertentu. Operator ini bertindak dengan satu atau lebih relasi untuk men...

 

نادي الرجال السريملصق فيلم نادي الرجال السريمعلومات عامةالصنف الفني كوميديتاريخ الصدور23 يناير 2019 (2019-01-23) ( مصر)مدة العرض 105 دقيقةاللغة الأصلية العربيةالبلد  مصرالطاقمالمخرج خالد الحلفاوي الكاتب أيمن وتارالبطولة كريم عبد العزيزغادة عادلماجد الكدوانينسري�...

Humberto Vélez Humberto Vélez en 2010.Información personalNombre de nacimiento Francisco de Humberto Vélez-MontielNacimiento 30 de marzo de 1955 (69 años) Orizaba, Veracruz, MéxicoNacionalidad MexicanaFamiliaCónyuge Cony Madera (matr. 1995; div. 2006)Hijos Alicia VélezHumberto Vélez Jr.Información profesionalOcupación Actor y actor de voz Años activo 1979-presenteConocido por Homero Simpson en Los Simpson[editar datos en Wikidata] Humberto...

 

Iranian professor of Persian culture and literature (1923–1999) Abdulhussein Zarrinkoubعبدالحسین زرین‌کوبAbdolhossein ZarrinkoubBorn(1923-03-21)March 21, 1923Borujerd, PersiaDiedSeptember 15, 1999(1999-09-15) (aged 76)Tehran, IranNationalityIranianKnown forscholar of Iranian literature, history of literature, Persian culture and history Abdolhossein Zarrinkoub (Luri/Persian: عبدالحسین زرین‌کوب, also Romanized as Zarrinkoob, Zarrinkub, Persian pr...

 

British Army officer and courtier Sir Charles Beaumont Phipps Colonel Sir Charles Beaumont Phipps KCB (27 December 1801 – 24 February 1866), was a British soldier and courtier. He was the second son of Henry Phipps, 1st Earl of Mulgrave, and was born at the family estate of Mulgrave Castle in 1801. Educated at Harrow,[1] Phipps joined the army by purchasing a commission as an ensign and lieutenant in the Scots Fusilier Guards on 17 August 1820.[2] He ranked as lieutenant...

Coal-fired power plant located in Boardman, Oregon The Boardman plant. Interior of Boardman Plant showing coal grinding machines. The Boardman Coal Plant was a coal-fired power plant located in Boardman, Oregon. The facility had a nameplate capacity of 550 megawatts (MWs) and is owned by Portland General Electric.[1] In 2010, the plant was the only remaining coal powered plant in Oregon and received much attention from regional media due to its being the largest single source of green...

 

رسم متحرك لعملية الترابط الأيوني بين الصوديوم (Na) والكلور (Cl) لتشكيل كلوريد الصوديوم ، أو ملح الطعام العادي. الترابط الأيوني ينطوي على ذرة واحدة تأخذ إلكترونات التكافؤ من أخرى (على عكس التقاسم ، الذي يحدث في الترابط التساهمي). الرابطة الأيونية أو الرابطة الشاردية[1] هي ال...

 

Ini adalah nama Maluku, Ambon marganya adalah Sahuleka Daniël SahulekaDaniel Sahuleka (1981)Informasi latar belakangLahir6 Desember 1950 (umur 73) Semarang, Jawa Tengah, IndonesiaAsalHuizen, BelandaGenrePopsouldiskofunkPekerjaanPenyanyimusisiInstrumenVokalgitarTahun aktif1976–sekarangLabelSunflightSitus websahuleka.com Daniel Sahuleka (lahir 6 Desember 1950[1]) adalah seorang penyanyi Belanda berdarah Ambon, Indonesia.[2] Ia tinggal di Winterswijk, Provinsi Gelderland,...

Sven VandenbroeckVandenbroeck nel 2009Nazionalità Belgio Altezza181 cm Calcio RuoloAllenatore (ex centrocampista) Termine carriera2009 - giocatore CarrieraSquadre di club1 1996-2000 Malines? (?)2000-2005 Roda JC55 (0)2005 De Graafschap19 (0)2006 Akratītos1 (0)2006-2007 Lierse23 (0)2007 MVV? (?)2007-2008 Visé? (?)2009 Løv-Ham4 (0) Carriera da allenatore 2014 Nikī VoloVice2014 Nikī VoloInterim2015-2016 OH LovanioVice2016-2017 Cam...

 

此條目没有列出任何参考或来源。 (2017年8月5日)維基百科所有的內容都應該可供查證。请协助補充可靠来源以改善这篇条目。无法查证的內容可能會因為異議提出而被移除。 時間單位是測量時間所用的基本單位,是任何特定的时间间隔,用作测量或表达持续时间的标准方式。从大到小排列分别為千年、世紀、年代、年、季度、月、旬、星期、日、时辰、小时、刻、字(福建�...

 

Si ce bandeau n'est plus pertinent, retirez-le. Cliquez ici pour en savoir plus. Cet article ne cite pas suffisamment ses sources (mars 2020). Si vous disposez d'ouvrages ou d'articles de référence ou si vous connaissez des sites web de qualité traitant du thème abordé ici, merci de compléter l'article en donnant les références utiles à sa vérifiabilité et en les liant à la section « Notes et références ». En pratique : Quelles sources sont attendues ? Comm...

Cathedral city and county town in England This article is about the city in England. For other uses, see Carlisle (disambiguation). City in EnglandCarlisleCityThe city skyline, cathedral, old town hall, citadel and castleCarlisleLocation within CumbriaOS grid referenceNY395555• London261 mi (420 km) SSEUnitary authorityCumberlandCeremonial countyCumbriaRegionNorth WestCountryEnglandSovereign stateUnited KingdomPost townCARLISLEPostcode distr...

 

Una immagine di Sen no Rikyū dipinta da Hasegawa Tōhaku (長谷川等伯, 1539-1610). Sen no Rikyū (千利休, anche Sen Rikyū; Sakai, 1522 – 21 aprile 1591) è stato un monaco buddhista giapponese, zen, riformatore della cerimonia del tè giapponese, che codificò in maniera definitiva nella forma wabi-cha, e maestro del tè di personaggi politici di primo piano del suo tempo quali Oda Nobunaga e Toyotomi Hideyoshi. Indice 1 La vita 2 Il wabi-cha 3 L'eredità di Rikyu 4 Nella cultura d...