Bhattacharyya distance

In statistics, the Bhattacharyya distance is a quantity which represents a notion of similarity between two probability distributions.[1] It is closely related to the Bhattacharyya coefficient, which is a measure of the amount of overlap between two statistical samples or populations.

It is not a metric, despite being named a "distance", since it does not obey the triangle inequality.

History

Both the Bhattacharyya distance and the Bhattacharyya coefficient are named after Anil Kumar Bhattacharyya, a statistician who worked in the 1930s at the Indian Statistical Institute.[2] He has developed this through a series of papers.[3][4][5] He developed the method to measure the distance between two non-normal distributions and illustrated this with the classical multinomial populations,[3] this work despite being submitted for publication in 1941, appeared almost five years later in Sankhya.[3][2] Consequently, Professor Bhattacharyya started working toward developing a distance metric for probability distributions that are absolutely continuous with respect to the Lebesgue measure and published his progress in 1942, at Proceedings of the Indian Science Congress[4] and the final work has appeared in 1943 in the Bulletin of the Calcutta Mathematical Society.[5]

Definition

For probability distributions and on the same domain , the Bhattacharyya distance is defined as

where

is the Bhattacharyya coefficient for discrete probability distributions.

For continuous probability distributions, with and where and are the probability density functions, the Bhattacharyya coefficient is defined as

.

More generally, given two probability measures on a measurable space , let be a (sigma finite) measure such that and are absolutely continuous with respect to i.e. such that , and for probability density functions with respect to defined -almost everywhere. Such a measure, even such a probability measure, always exists, e.g. . Then define the Bhattacharyya measure on by

It does not depend on the measure , for if we choose a measure such that and an other measure choice are absolutely continuous i.e. and , then

,

and similarly for . We then have

.

We finally define the Bhattacharyya coefficient

.

By the above, the quantity does not depend on , and by the Cauchy inequality . Using , and ,

Gaussian case

Let , , where is the normal distribution with mean and variance ; then

.

And in general, given two multivariate normal distributions ,

,

where [6] Note that the first term is a squared Mahalanobis distance.

Properties

and .

does not obey the triangle inequality, though the Hellinger distance does.

Bounds on Bayes error

The Bhattacharyya distance can be used to upper and lower bound the Bayes error rate:

where and is the posterior probability.[7]

Applications

The Bhattacharyya coefficient quantifies the "closeness" of two random statistical samples.

Given two sequences from distributions , bin them into buckets, and let the frequency of samples from in bucket be , and similarly for , then the sample Bhattacharyya coefficient is

which is an estimator of . The quality of estimation depends on the choice of buckets; too few buckets would overestimate , while too many would underestimate.

A common task in classification is estimating the separability of classes. Up to a multiplicative factor, the squared Mahalanobis distance is a special case of the Bhattacharyya distance when the two classes are normally distributed with the same variances. When two classes have similar means but significantly different variances, the Mahalanobis distance would be close to zero, while the Bhattacharyya distance would not be.

The Bhattacharyya coefficient is used in the construction of polar codes.[8]

The Bhattacharyya distance is used in feature extraction and selection,[9] image processing,[10] speaker recognition,[11] phone clustering,[12] and in genetics.[13]

See also

References

  1. ^ Dodge, Yadolah (2003). The Oxford Dictionary of Statistical Terms. Oxford University Press. ISBN 978-0-19-920613-1.
  2. ^ a b Sen, Pranab Kumar (1996). "Anil Kumar Bhattacharyya (1915-1996): A Reverent Remembrance". Calcutta Statistical Association Bulletin. 46 (3–4): 151–158. doi:10.1177/0008068319960301. S2CID 164326977.
  3. ^ a b c Bhattacharyya, A. (1946). "On a Measure of Divergence between Two Multinomial Populations". Sankhyā. 7 (4): 401–406. JSTOR 25047882.
  4. ^ a b Bhattacharyya, A (1942). "On discrimination and divergence". Proceedings of the Indian Science Congress. Asiatic Society of Bengal.
  5. ^ a b Bhattacharyya, A. (March 1943). "On a measure of divergence between two statistical populations defined by their probability distributions". Bulletin of the Calcutta Mathematical Society. 35: 99–109. MR 0010358.
  6. ^ Kashyap, Ravi (2019). "The Perfect Marriage and Much More: Combining Dimension Reduction, Distance Measures and Covariance". Physica A: Statistical Mechanics and Its Applications. 536: 120938. arXiv:1603.09060. Bibcode:2019PhyA..53620938K. doi:10.1016/j.physa.2019.04.174.
  7. ^ Devroye, L., Gyorfi, L. & Lugosi, G. A Probabilistic Theory of Pattern Recognition. Discrete Appl Math 73, 192–194 (1997).
  8. ^ Arıkan, Erdal (July 2009). "Channel polarization: A method for constructing capacity-achieving codes for symmetric binary-input memoryless channels". IEEE Transactions on Information Theory. 55 (7): 3051–3073. arXiv:0807.3917. doi:10.1109/TIT.2009.2021379. S2CID 889822.
  9. ^ Euisun Choi, Chulhee Lee, "Feature extraction based on the Bhattacharyya distance", Pattern Recognition, Volume 36, Issue 8, August 2003, Pages 1703–1709
  10. ^ François Goudail, Philippe Réfrégier, Guillaume Delyon, "Bhattacharyya distance as a contrast parameter for statistical processing of noisy optical images", JOSA A, Vol. 21, Issue 7, pp. 1231−1240 (2004)
  11. ^ Chang Huai You, "An SVM Kernel With GMM-Supervector Based on the Bhattacharyya Distance for Speaker Recognition", Signal Processing Letters, IEEE, Vol 16, Is 1, pp. 49-52
  12. ^ Mak, B., "Phone clustering using the Bhattacharyya distance", Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on, Vol 4, pp. 2005–2008 vol.4, 3−6 Oct 1996
  13. ^ Chattopadhyay, Aparna; Chattopadhyay, Asis Kumar; B-Rao, Chandrika (2004-06-01). "Bhattacharyya's distance measure as a precursor of genetic distance measures". Journal of Biosciences. 29 (2): 135–138. doi:10.1007/BF02703410. ISSN 0973-7138. PMID 15295209.
  1. ^ Nielsen, Frank; Boltz, Sylvain (2011). "The Burbea-Rao and Bhattacharyya Centroids". IEEE Transactions on Information Theory. 57 (8): 5455–5466. arXiv:1004.5049. doi:10.1109/TIT.2011.2159046. ISSN 0018-9448. S2CID 14238708.
  2. ^ Kailath, T. (1967). "The Divergence and Bhattacharyya Distance Measures in Signal Selection". IEEE Transactions on Communications. 15 (1): 52–60. doi:10.1109/TCOM.1967.1089532. ISSN 0096-2244.
  3. ^ Djouadi, A.; Snorrason, O.; Garber, F.D. (1990). "The quality of training sample estimates of the Bhattacharyya coefficient". IEEE Transactions on Pattern Analysis and Machine Intelligence. 12 (1): 92–97. doi:10.1109/34.41388.

Read other articles:

Stasiun Kasei禾生駅Stasiun Kasei, Juni 2009Lokasi524-3 Furukawado, Tsuru-shi, Yamanashi-kenJepangKoordinat35°34′30″N 138°55′45″E / 35.57500°N 138.92917°E / 35.57500; 138.92917Koordinat: 35°34′30″N 138°55′45″E / 35.57500°N 138.92917°E / 35.57500; 138.92917Ketinggian710 meterOperator Fuji KyukoJalur■ Jalur FujikyukoLetak5.6 km dari ŌtsukiJumlah peron1 peron sampingJumlah jalur1Informasi lainStatusTanpa stafKode stasiu...

 

Shelley Moore Capito Portrait officiel de Shelley Moore Capito (2015). Fonctions Sénatrice des États-Unis En fonction depuis le 3 janvier 2015(9 ans, 3 mois et 10 jours) Élection 4 novembre 2014 Réélection 3 novembre 2020 Circonscription Virginie-Occidentale Législature 114e, 115e, 116e, 117e et 118e Groupe politique Républicain Prédécesseur Jay Rockefeller Représentante des États-Unis 3 janvier 2001 – 3 janvier 2015(14 ans) Élection 7 novembre 2000 Réélec...

 

Concrete that is manufactured in a batch plant, according to a set engineered mix design This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: Ready-mix concrete – news · newspapers · books · scholar · JSTOR (November 2009) (Learn how and when to remove this template message) This article is written like a persona...

N

This article is about the letter of the alphabet. For other uses, see N (disambiguation). 14th letter of the Latin alphabet NN nUsageWriting systemLatin scriptTypeAlphabetic and LogographicLanguage of originLatin languagePhonetic usage[n][ŋ][ɲ][ɳ][nˠ][ⁿ][◌̃]/ɛn/Unicode codepointU+004E, U+006EAlphabetical position14HistoryDevelopment Ν ν𐌍N nTime period~-700 to presentDescendants • ₦ • Ƞ • Ŋ • ɧ • ʩ...

 

Zambian daily newspaper This article relies excessively on references to primary sources. Please improve this article by adding secondary or tertiary sources. Find sources: Times of Zambia – news · newspapers · books · scholar · JSTOR (February 2009) (Learn how and when to remove this message) The Times of Zambia is a national daily newspaper published in Zambia and headquartered in Ndola. During the colonial period the newspaper was known firstly as T...

 

This article is about the men's team. For the women's team, see Belize women's national cricket team. BelizeAssociationBelize National Cricket AssociationPersonnelCaptainKenton YoungInternational Cricket CouncilICC statusAssociate member[1] (2017)ICC regionAmericasICC Rankings Current[2] Best-everT20I 72nd 44th (2 May 2019)International cricketFirst international British Honduras v. MCC (Belize City; 4 April 1960)Twenty20 InternationalsFirst T20Iv  Mexico at Reforma...

Protein-coding gene in the species Homo sapiens ATG7IdentifiersAliasesATG7, APG7-LIKE, APG7L, GSA7, autophagy related 7, SCAR31External IDsOMIM: 608760 MGI: 1921494 HomoloGene: 4662 GeneCards: ATG7 Gene location (Human)Chr.Chromosome 3 (human)[1]Band3p25.3Start11,272,309 bp[1]End11,557,665 bp[1]Gene location (Mouse)Chr.Chromosome 6 (mouse)[2]Band6|6 E3Start114,620,058 bp[2]End114,837,575 bp[2]RNA expression patternBgeeHumanMouse (ortholog)T...

 

土库曼斯坦总统土库曼斯坦国徽土库曼斯坦总统旗現任谢尔达尔·别尔德穆哈梅多夫自2022年3月19日官邸阿什哈巴德总统府(Oguzkhan Presidential Palace)機關所在地阿什哈巴德任命者直接选举任期7年,可连选连任首任萨帕尔穆拉特·尼亚佐夫设立1991年10月27日 土库曼斯坦土库曼斯坦政府与政治 国家政府 土库曼斯坦宪法 国旗 国徽 国歌 立法機關(英语:National Council of Turkmenistan) ...

 

Peta lokasi Sominot Sominot adalah munisipalitas yang terletak di provinsi Zamboanga del Sur, Filipina. Sominot terbagi menjadi 18 barangay. Bag-ong Baroy Bag-ong Oroquieta Barubuhan Bulanay Datagan Eastern Poblacion Lantawan Libertad Lumangoy New Carmen Picturan Poblacion Rizal San Miguel Santo Niño Sawa Tungawan Upper Sicpao Pranala luar Philippine Standard Geographic Code Diarsipkan 2012-04-13 di Wayback Machine. 2000 Philippine Census Information lbs Provinsi Zamboanga SelatanMunisipalit...

Olympic rowing event Men's coxed pairat the Games of the XXIV OlympiadA race during the competition; the United States team is in the foregroundVenueMisari RegattaDates20–25 SeptemberCompetitors42 from 14 nationsWinning time6:58.79Medalists Carmine AbbagnaleGiuseppe AbbagnaleGiuseppe Di Capua (cox) Italy Mario StreitDetlef KirchhoffRené Rensch (cox) East Germany Andy HolmesSteve RedgravePatrick Sweeney (cox) Great Britain← 19841992 → Rowing a...

 

German author, translator and publisher Zoë BeckBorn12 March 1975 (1975-03-12) (age 49)Lahn-Dill-Kreis, GermanyOccupationWriter, publisher, translator, dubbing directorNotable worksNormale Menschen, Fade to Black, Ein zufriedener Mann. Erzählungen, A Contented Man and Other StoriesWebsitezoebeck.blog Zoë Beck (born 12 March 1975 as Henrike Heiland in Ehringshausen in the Lahn-Dill district[1]) is a German writer, publisher, translator, dialogue book author and dubbing dir...

 

2010 single by Adele Rolling in the DeepSingle by Adelefrom the album 21 B-sideIf It Hadn't Been for LoveReleased29 November 2010 (2010-11-29)Recorded2010StudioEastcote (London, England)GenreRhythm and bluessoulLength3:48LabelXLColumbiaSongwriter(s)Adele AdkinsPaul EpworthProducer(s)Paul EpworthAdele singles chronology Water and a Flame (2009) Rolling in the Deep (2010) Someone like You (2011) Music videoRolling in the Deep on YouTube Rolling in the Deep is a song by English si...

Artikel ini membutuhkan penyuntingan lebih lanjut mengenai tata bahasa, gaya penulisan, hubungan antarparagraf, nada penulisan, atau ejaan. Anda dapat membantu untuk menyuntingnya. Ovulasi terjadi pada saat ditengah siklus menstruasi. setelah fase folikel. Fase ini dipengaruhi oleh hormon luteinizing hormone (LH) dan follicle-stimulating hormone (FSH) Pada diagram dapat dilihat adanya perubahan hormonal pada saat ovulasi Ovulasi adalah proses yang terjadi pada siklus menstruasi perempuan. Sel...

 

UK statutory authority This article is about the United Kingdom's Gambling Commission. For other jurisdictions, see Gaming Control Board. UKGC redirects here. For UCKG, see Universal Church of the Kingdom of God. Gambling CommissionRoyal Coat of Arms of the United Kingdom as used by HM GovernmentAgency overviewFormed1 September 2007; 16 years ago (2007-09-01)Preceding agencyGaming BoardTypeExecutive non-departmental public bodyJurisdictionGreat BritainHeadquartersVictoria Sq...

 

Cycling race 2015 Grand Prix Cycliste de Montréal2015 UCI World Tour, race 26 of 28Race detailsDates13 September 2015Stages1Distance205.7 km (127.8 mi)Winning time5h 20' 09Results  Winner  Tim Wellens (BEL) (Lotto–Soudal)  Second  Adam Yates (GBR) (Orica–GreenEDGE)  Third  Rui Costa (POR) (Lampre–Merida)← 2014 2016 → The 2015 Grand Prix Cycliste de Montréal was the sixth edition of the Grand Prix Cycliste de...

Viện Khoa học Xã hội Nhân văn Quân sựQuân đội Nhân dân Việt NamQuân kỳQuân hiệuChỉ huyĐại tá, PGS. TS Hoàng Văn PhaiQuốc gia Việt NamThành lập29 tháng 4 năm 1999; 25 năm trước (1999-04-29)Phân cấpViện Nghiên cứu (Nhóm 5)Nhiệm vụNghiên cứu, phát triển, ứng dụng kiến thức xã hội nhân văn vào lĩnh vực quân sựQuy mô200 ngườiBộ phận củaHọc viện Chính trịBộ chỉ huyBa Đình, ...

 

Disambiguazione – Marconi rimanda qui. Se stai cercando altri significati, vedi Marconi (disambigua) o Guglielmo Marconi (disambigua). Guglielmo MarconiGuglielmo Marconi nel 1908 Senatore del Regno d'ItaliaDurata mandato30 aprile 1914 –20 luglio 1937 Tipo nominaCategoria: 20 Sito istituzionale Dati generaliPartito politicoPartito Nazionale Fascista Professionescienziato, imprenditore Firma Premio Nobel per la fisica 1909Guglielmo Giovanni Maria Marconi ...

 

تُعد السياحة في كرواتيا واحدة من أكثر [1] الوجهات السياحية في العالم جاذبية فهي تحتل المركز 18 على مستوى العالم من حيث الشعبية. وتمتاز كرواتيا بتنوع وتعدد منتجعاتها التي تقدم العديد من خدمات العلاج والرفاهية، كما تحتوي على معالم السياحة الجماعية إلى الأسواق المتخصصة وم�...

Sleeping and breathing disorder Medical conditionObstructive sleep apneaOther namesObstructive sleep apnoeaObstructive sleep apnea: As soft tissue falls to the back of the throat, it impedes the passage of air (blue arrows) through the trachea.SpecialtySleep medicine Obstructive sleep apnea (OSA) is the most common sleep-related breathing disorder and is characterized by recurrent episodes of complete or partial obstruction of the upper airway leading to reduced or absent breathing during sle...

 

1965 studio album by Billy PrestonEarly Hits of 1965Studio album by Billy PrestonReleased15 December 1965RecordedMarch–September, 1965GenreSoulLength24:20LabelVee-JayVJLP/VJS 1145ProducerSteve DouglasCompilerBilly PrestonBilly Preston chronology The Most Exciting Organ Ever(1965) Early Hits of 1965(1965) Wildest Organ in Town!(1966) Early Hits of 1965, subtitled A Million Dollars Worth of Music!!! Played by the Greatest Organist Ever, is an album by Billy Preston performing soul arr...