Platt scaling

In machine learning, Platt scaling or Platt calibration is a way of transforming the outputs of a classification model into a probability distribution over classes. The method was invented by John Platt in the context of support vector machines,[1] replacing an earlier method by Vapnik, but can be applied to other classification models.[2] Platt scaling works by fitting a logistic regression model to a classifier's scores.

Description

Consider the problem of binary classification: for inputs x, we want to determine whether they belong to one of two classes, arbitrarily labeled +1 and −1. We assume that the classification problem will be solved by a real-valued function f, by predicting a class label y = sign(f(x)).[a] For many problems, it is convenient to get a probability , i.e. a classification that not only gives an answer, but also a degree of certainty about the answer. Some classification models do not provide such a probability, or give poor probability estimates.

Standard logistic function where

.

Platt scaling is an algorithm to solve the aforementioned problem. It produces probability estimates

,

i.e., a logistic transformation of the classifier scores f(x), where A and B are two scalar parameters that are learned by the algorithm. Note that predictions can now be made according to if the probability estimates contain a correction compared to the old decision function y = sign(f(x)).[3]

The parameters A and B are estimated using a maximum likelihood method that optimizes on the same training set as that for the original classifier f. To avoid overfitting to this set, a held-out calibration set or cross-validation can be used, but Platt additionally suggests transforming the labels y to target probabilities

for positive samples (y = 1), and
for negative samples, y = -1.

Here, N+ and N are the number of positive and negative samples, respectively. This transformation follows by applying Bayes' rule to a model of out-of-sample data that has a uniform prior over the labels.[1] The constants 1 and 2, on the numerator and denominator respectively, are derived from the application of Laplace smoothing.

Platt himself suggested using the Levenberg–Marquardt algorithm to optimize the parameters, but a Newton algorithm was later proposed that should be more numerically stable.[4]

Analysis

Platt scaling has been shown to be effective for SVMs as well as other types of classification models, including boosted models and even naive Bayes classifiers, which produce distorted probability distributions. It is particularly effective for max-margin methods such as SVMs and boosted trees, which show sigmoidal distortions in their predicted probabilities, but has less of an effect with well-calibrated models such as logistic regression, multilayer perceptrons, and random forests.[2]

An alternative approach to probability calibration is to fit an isotonic regression model to an ill-calibrated probability model. This has been shown to work better than Platt scaling, in particular when enough training data is available.[2]

Platt scaling can also be applied to deep neural network classifiers. For image classification, such as CIFAR-100, small networks like LeNet-5 have good calibration but low accuracy, and large networks like ResNet has high accuracy but is overconfident in predictions. A 2017 paper proposed temperature scaling, which simply multiplies the output logits of a network by a constant before taking the softmax. During training, is set to 1. After training, is optimized on a held-out calibration set to minimize the calibration loss.[5]

See also

Notes

  1. ^ See sign function. The label for f(x) = 0 is arbitrarily chosen to be either zero, or one.

References

  1. ^ a b Platt, John (1999). "Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods". Advances in Large Margin Classifiers. 10 (3): 61–74.
  2. ^ a b c Niculescu-Mizil, Alexandru; Caruana, Rich (2005). Predicting good probabilities with supervised learning (PDF). ICML. doi:10.1145/1102351.1102430.
  3. ^ Olivier Chapelle; Vladimir Vapnik; Olivier Bousquet; Sayan Mukherjee (2002). "Choosing multiple parameters for support vector machines" (PDF). Machine Learning. 46: 131–159. doi:10.1023/a:1012450327387.
  4. ^ Lin, Hsuan-Tien; Lin, Chih-Jen; Weng, Ruby C. (2007). "A note on Platt's probabilistic outputs for support vector machines" (PDF). Machine Learning. 68 (3): 267–276. doi:10.1007/s10994-007-5018-6.
  5. ^ Guo, Chuan; Pleiss, Geoff; Sun, Yu; Weinberger, Kilian Q. (2017-07-17). "On Calibration of Modern Neural Networks". Proceedings of the 34th International Conference on Machine Learning. PMLR: 1321–1330.

Read other articles:

Dalam artikel ini, nama keluarganya adalah Hoàng. Sesuai dengan kebiasaan Vietnam, tokoh ini seharusnya disebut dengan nama pemberian, Minh. Hoàng Cơ MinhHoàng Cơ Minh sebagai perwira angkatan laut muda.Lahir20 Juni 1935Hanoi, Tonkin, Indochina PrancisMeninggal28 Agustus 1987(1987-08-28) (umur 52)Provinsi Attapeu, LaosSebab meninggalBunuh diriKebangsaanVietnamPekerjaanKomodor, politikusDikenal atasKetua Việt Tân pertama Hoàng Cơ Minh (20 Juni 1935 – 28 Agus...

 

Alexander Mikhailovich ProkhorovLahir11 Juli 1916Atherton, Queensland, AustraliaMeninggal8 Januari 2002(2002-01-08) (umur 85)Moskwa, RusiaKebangsaanRusiaDikenal atasLaserPenghargaanNobel Fisika (1964)Karier ilmiahBidangFisika Untuk pemain sepak bola, lihat Aleksandr Vladimirovich Prokhorov. Aleksandr Mikhailovich Prokhorov (Rusia: Александр Михайлович Прохоровcode: ru is deprecated ; 11 Juli 1916 – 8 Januari 2002) ialah seorang fisikawan Uni So...

 

العلاقات الإكوادورية الجامايكية الإكوادور جامايكا   الإكوادور   جامايكا تعديل مصدري - تعديل   العلاقات الإكوادورية الجامايكية هي العلاقات الثنائية التي تجمع بين الإكوادور وجامايكا.[1][2][3][4][5] مقارنة بين البلدين هذه مقارنة عامة ومرجعية لل�...

Sistem Liga Nasional (Inggris: National League System) adalah sebuah sistem kompetisi yang dibuat dan dikontrol oleh Asosiasi Sepak Bola Inggris (FA) yang mengatur hubungan antara promosi dan degradasi di antara liga-liga yang ikut di dalamnya.[1] Sistem Liga Nasional ini terdiri dari tujuh tingkat (disebut dengan: step) dan langsung berada di bawah tingkat Liga Primer dan Football League. Sistem ini beranggotakan 91 kompetisi liga-liga regional dengan lebih dari 1.600 klub, dan o...

 

American baseball player (1934-1994) Baseball player Gordy ColemanColeman in 1961First basemanBorn: (1934-07-05)July 5, 1934Rockville, Maryland, U.S.Died: March 12, 1994(1994-03-12) (aged 59)Cincinnati, Ohio, U.S.Batted: LeftThrew: RightMLB debutSeptember 19, 1959, for the Cleveland IndiansLast MLB appearanceMay 3, 1967, for the Cincinnati RedsMLB statisticsBatting average.273Home runs98Runs batted in387 Teams Cleveland Indians (1959) Cincinnati Reds (1960�...

 

Chronologies Le 114e régiment d'infanterie à Paris le 14 juillet 1917Données clés 1914 1915 1916  1917  1918 1919 1920Décennies :1880 1890 1900  1910  1920 1930 1940Siècles :XVIIIe XIXe  XXe  XXIe XXIIeMillénaires :-Ier Ier  IIe  IIIe Chronologies géographiques Afrique Afrique du Sud, Algérie, Angola, Bénin, Botswana, Burkina Faso, Burundi, Cameroun, Cap-Vert, République centrafricaine, Comores, République du Congo, Républiq...

American science fiction writer (1904–1988) Simak redirects here. For the village in Iran, see Simak, Iran. Clifford D. SimakBornClifford Donald Simak(1904-08-03)August 3, 1904Millville, Wisconsin, U.S.DiedApril 25, 1988(1988-04-25) (aged 83)Minneapolis, Minnesota, U.S.OccupationJournalist, popular writerAlma materUniversity of Wisconsin–MadisonPeriod1931–1986 (fiction)GenreScience fiction, fantasySubjectPopular scienceNotable works Way Station City The Visitors Simak's first ...

 

Form of potassium feldspar SanidineSanidine from Puy de Sancy, Monts-Dore massif, Puy-de-Dôme, France. Size 5 cm × 4.5 cm (2.0 in × 1.8 in)GeneralCategoryFeldsparFormula(repeating unit)K(AlSi3O8)IMA symbolSa[1]Strunz classification9.FA.30Dana classification76.01.01.02Crystal systemMonoclinicCrystal classPrismatic (2/m) (same H-M symbol)Space groupC2/mIdentificationColorColorless to whiteCrystal habitTabular crystals, may be acicularTwinningCarlsbad twi...

 

追晉陸軍二級上將趙家驤將軍个人资料出生1910年 大清河南省衛輝府汲縣逝世1958年8月23日(1958歲—08—23)(47—48歲) † 中華民國福建省金門縣国籍 中華民國政党 中國國民黨获奖 青天白日勳章(追贈)军事背景效忠 中華民國服役 國民革命軍 中華民國陸軍服役时间1924年-1958年军衔 二級上將 (追晉)部队四十七師指挥東北剿匪總司令部參謀長陸軍�...

Indian newspaper Tripura Bani letter-headTripura Bani is the Bengali language mouthpiece of the Tripura State Committee of the All India Forward Bloc. Tripura Bani is published weekly.[1] It is published from Agartala.[2] As of 1983 Tripura Bani had a circulation of 1,900 copies.[3] As of 2007 it claimed a weekly circulation of 5,895 copies.[2] Brajagopal Roy served as the editor of Tripura Bani until his death in July 2022.[4] This section needs expans...

 

Cet article concerne le préfet de Judée. Pour le film sur le préfet de Judée, voir Ponce Pilate (film). Pour les articles homonymes, voir Ponce (homonymie) et Pilate. Si ce bandeau n'est plus pertinent, retirez-le. Cliquez ici pour en savoir plus. Cet article peut contenir un travail inédit ou des déclarations non vérifiées (mars 2021). Vous pouvez aider en ajoutant des références ou en supprimant le contenu inédit. Voir la page de discussion pour plus de détails. Si ce bande...

 

Voce principale: Law & Order: UK. La terza stagione della serie televisiva Law & Order: UK è stata trasmessa sul canale inglese ITV dal 9 settembre al 21 ottobre 2010. In Italia, la stagione è stata trasmessa in anteprima assoluta dal canale satellitare Fox Crime a partire dal 1º dicembre 2010. nº Titolo originale Titolo italiano Prima TV UK Prima TV Italia 1 Broken Vita spezzata 9 settembre 2010 1º dicembre 2010 2 Hounded Perseguitato 16 settembre 2010 3 Defence Difesa 23 sette...

Federico Baistrocchi Sottosegretario di Stato al Ministero della GuerraDurata mandato22 luglio 1933 –7 ottobre 1936 PredecessoreAngelo Manaresi SuccessoreAlberto Pariani LegislaturaXXIX Incarichi parlamentari XXVIII LegislaturaCommissione per l'esame dei bilanci e dei rendiconti consuntivi nella giunta generale del bilancio (2 maggio 1929-29 luglio 1933) Deputato del Regno d'ItaliaDurata mandato24-5-1924 –2-3-1939 LegislaturaXXVII, XXVIII, XXIX Col...

 

此條目可能包含不适用或被曲解的引用资料,部分内容的准确性无法被证實。 (2023年1月5日)请协助校核其中的错误以改善这篇条目。详情请参见条目的讨论页。 各国相关 主題列表 索引 国内生产总值 石油储量 国防预算 武装部队(军事) 官方语言 人口統計 人口密度 生育率 出生率 死亡率 自杀率 谋杀率 失业率 储蓄率 识字率 出口额 进口额 煤产量 发电量 监禁率 死刑 国债 ...

 

豪栄道 豪太郎 場所入りする豪栄道基礎情報四股名 澤井 豪太郎→豪栄道 豪太郎本名 澤井 豪太郎愛称 ゴウタロウ、豪ちゃん、GAD[1][2]生年月日 (1986-04-06) 1986年4月6日(38歳)出身 大阪府寝屋川市身長 183cm体重 160kgBMI 47.26所属部屋 境川部屋得意技 右四つ・出し投げ・切り返し・外掛け・首投げ・右下手投げ成績現在の番付 引退最高位 東大関生涯戦歴 696勝493敗...

Artikel ini berisi konten yang ditulis dengan gaya sebuah iklan. Bantulah memperbaiki artikel ini dengan menghapus konten yang dianggap sebagai spam dan pranala luar yang tidak sesuai, dan tambahkan konten ensiklopedis yang ditulis dari sudut pandang netral dan sesuai dengan kebijakan Wikipedia. Pemandangan Pantai Watu Leter Pantai Watu Leter adalah sebuah pantai di pesisir selatan yang terletak di tepi Samudera Hindia secara administratif berada di Dusun Rowotrate, Desa Sitiarjo, Kecamatan S...

 

Economic theory about capital structure The Modigliani–Miller theorem (of Franco Modigliani, Merton Miller) is an influential element of economic theory; it forms the basis for modern thinking on capital structure.[1] The basic theorem states that in the absence of taxes, bankruptcy costs, agency costs, and asymmetric information, and in an efficient market, the enterprise value of a firm is unaffected by how that firm is financed.[2][unreliable source?] This is not ...

 

Questa voce o sezione deve essere rivista e aggiornata appena possibile. Sembra infatti che questa voce contenga informazioni superate e/o obsolete. Se puoi, contribuisci ad aggiornarla. «Il cross più bello del mondo.» (G. W. Andersen) Cinque MuliniSport Atletica leggera Tipoindividuale FederazioneWorld Athletics Paese Italia LuogoSan Vittore Olona OrganizzatoreUnione Sportiva San Vittore Olona MottoIl cross più bello del mondo Cadenzaannuale DisciplineCorsa campestre Sito Internet5...

Arawakan language spoken in South America ArawakLokonoNative toFrench Guiana, Guyana, Suriname, Venezuela, Jamaica, BarbadosRegionGuianasEthnicityLokono (Arawak)Native speakers(2,500 cited 1990–2012)[1]Language familyArawakan NorthernTa-ArawakanArawakWriting systemLatin scriptLanguage codesISO 639-2arwISO 639-3arwGlottologaraw1276ELPLokonoArawakan languages in South America and the CaribbeanArawak is classified as Critically Endangered by the UNESCO Atlas of the World's Languag...

 

Stendardo della FlagellazioneAutoreLuca Signorelli Data1475 circa Tecnicatempera su tavola Dimensioni84×60 cm UbicazionePinacoteca di Brera, Milano Madonna del Latte Lo Stendardo della Flagellazione è un dipinto a tempera su tavola (84x60 cm) di Luca Signorelli, databile al 1475 e conservato nella Pinacoteca di Brera a Milano. L'opera era anticamente dipinta su due lati, oggi separati, con la Flagellazione, appunto, e la Madonna del Latte in gloria, entrambi nello stesso museo. Indice ...