Fisher information metric

In information geometry, the Fisher information metric[1] is a particular Riemannian metric which can be defined on a smooth statistical manifold, i.e., a smooth manifold whose points are probability distributions. It can be used to calculate the distance between probability distributions.[2]

The metric is interesting in several aspects. By Chentsov’s theorem, the Fisher information metric on statistical models is the only Riemannian metric (up to rescaling) that is invariant under sufficient statistics.[3][4]

It can also be understood to be the infinitesimal form of the relative entropy (i.e., the Kullback–Leibler divergence); specifically, it is the Hessian of the divergence. Alternately, it can be understood as the metric induced by the flat space Euclidean metric, after appropriate changes of variable. When extended to complex projective Hilbert space, it becomes the Fubini–Study metric; when written in terms of mixed states, it is the quantum Bures metric.[clarification needed]

Considered purely as a matrix, it is known as the Fisher information matrix. Considered as a measurement technique, where it is used to estimate hidden parameters in terms of observed random variables, it is known as the observed information.

Definition

Given a statistical manifold with coordinates , one writes for the likelihood, that is the probability density of x as a function of . Here is drawn from the value space R for a (discrete or continuous) random variable X. The likelihood is normalized over but not : .

The Fisher information metric then takes the form:[clarification needed]

The integral is performed over all values x in R. The variable is now a coordinate on a Riemann manifold. The labels j and k index the local coordinate axes on the manifold.

When the probability is derived from the Gibbs measure, as it would be for any Markovian process, then can also be understood to be a Lagrange multiplier; Lagrange multipliers are used to enforce constraints, such as holding the expectation value of some quantity constant. If there are n constraints holding n different expectation values constant, then the dimension of the manifold is n dimensions smaller than the original space. In this case, the metric can be explicitly derived from the partition function; a derivation and discussion is presented there.

Substituting from information theory, an equivalent form of the above definition is:

To show that the equivalent form equals the above definition note that

and apply on both sides.

Examples

The Fisher information metric is particularly simple for the exponential family, which has The metric is The metric has a particularly simple form if we are using the natural parameters. In this case, , so the metric is just .

Normal distribution

Multivariate normal distribution Let be the precision matrix.

The metric splits to a mean part and a precision/variance part, because . The mean part is the precision matrix: . The precision part is .

In particular, for single variable normal distribution, . Let , then . This is the Poincaré half-plane model.

The shortest paths (geodesics) between two univariate normal distributions are either parallel to the axis, or half circular arcs centered on the -axis.

The geodesic connecting has formula where , and the arc-length parametrization is .

Relation to the Kullback–Leibler divergence

Alternatively, the metric can be obtained as the second derivative of the relative entropy or Kullback–Leibler divergence.[5] To obtain this, one considers two probability distributions and , which are infinitesimally close to one another, so that

with an infinitesimally small change of in the j direction. Then, since the Kullback–Leibler divergence has an absolute minimum of 0 when , one has an expansion up to second order in of the form

.

The symmetric matrix is positive (semi) definite and is the Hessian matrix of the function at the extremum point . This can be thought of intuitively as: "The distance between two infinitesimally close points on a statistical differential manifold is the informational difference between them."

Relation to Ruppeiner geometry

The Ruppeiner metric and Weinhold metric are the Fisher information metric calculated for Gibbs distributions as the ones found in equilibrium statistical mechanics.[6][7]

Change in free entropy

The action of a curve on a Riemannian manifold is given by

The path parameter here is time t; this action can be understood to give the change in free entropy of a system as it is moved from time a to time b.[7] Specifically, one has

as the change in free entropy. This observation has resulted in practical applications in chemical and processing industry[citation needed]: in order to minimize the change in free entropy of a system, one should follow the minimum geodesic path between the desired endpoints of the process. The geodesic minimizes the entropy, due to the Cauchy–Schwarz inequality, which states that the action is bounded below by the length of the curve, squared.

Relation to the Jensen–Shannon divergence

The Fisher metric also allows the action and the curve length to be related to the Jensen–Shannon divergence.[7] Specifically, one has

where the integrand dJSD is understood to be the infinitesimal change in the Jensen–Shannon divergence along the path taken. Similarly, for the curve length, one has

That is, the square root of the Jensen–Shannon divergence is just the Fisher metric (divided by the square root of 8).

As Euclidean metric

For a discrete probability space, that is, a probability space on a finite set of objects, the Fisher metric can be understood to simply be the Euclidean metric restricted to a positive orthant (e.g. "quadrant" in ) of a unit sphere, after appropriate changes of variable.[8]

Consider a flat, Euclidean space, of dimension N+1, parametrized by points . The metric for Euclidean space is given by

where the are 1-forms; they are the basis vectors for the cotangent space. Writing as the basis vectors for the tangent space, so that

,

the Euclidean metric may be written as

The superscript 'flat' is there to remind that, when written in coordinate form, this metric is with respect to the flat-space coordinate .

An N-dimensional unit sphere embedded in (N + 1)-dimensional Euclidean space may be defined as

This embedding induces a metric on the sphere, it is inherited directly from the Euclidean metric on the ambient space. It takes exactly the same form as the above, taking care to ensure that the coordinates are constrained to lie on the surface of the sphere. This can be done, e.g. with the technique of Lagrange multipliers.

Consider now the change of variable . The sphere condition now becomes the probability normalization condition

while the metric becomes

The last can be recognized as one-fourth of the Fisher information metric. To complete the process, recall that the probabilities are parametric functions of the manifold variables , that is, one has . Thus, the above induces a metric on the parameter manifold:

or, in coordinate form, the Fisher information metric is:

where, as before,

The superscript 'fisher' is present to remind that this expression is applicable for the coordinates ; whereas the non-coordinate form is the same as the Euclidean (flat-space) metric. That is, the Fisher information metric on a statistical manifold is simply (four times) the Euclidean metric restricted to the positive orthant of the sphere, after appropriate changes of variable.

When the random variable is not discrete, but continuous, the argument still holds. This can be seen in one of two different ways. One way is to carefully recast all of the above steps in an infinite-dimensional space, being careful to define limits appropriately, etc., in order to make sure that all manipulations are well-defined, convergent, etc. The other way, as noted by Gromov,[8] is to use a category-theoretic approach; that is, to note that the above manipulations remain valid in the category of probabilities. Here, one should note that such a category would have the Radon–Nikodym property, that is, the Radon–Nikodym theorem holds in this category. This includes the Hilbert spaces; these are square-integrable, and in the manipulations above, this is sufficient to safely replace the sum over squares by an integral over squares.

As Fubini–Study metric

The above manipulations deriving the Fisher metric from the Euclidean metric can be extended to complex projective Hilbert spaces. In this case, one obtains the Fubini–Study metric.[9] This should perhaps be no surprise, as the Fubini–Study metric provides the means of measuring information in quantum mechanics. The Bures metric, also known as the Helstrom metric, is identical to the Fubini–Study metric,[9] although the latter is usually written in terms of pure states, as below, whereas the Bures metric is written for mixed states. By setting the phase of the complex coordinate to zero, one obtains exactly one-fourth of the Fisher information metric, exactly as above.

One begins with the same trick, of constructing a probability amplitude, written in polar coordinates, so:

Here, is a complex-valued probability amplitude; and are strictly real. The previous calculations are obtained by setting . The usual condition that probabilities lie within a simplex, namely that

is equivalently expressed by the idea the square amplitude be normalized:

When is real, this is the surface of a sphere.

The Fubini–Study metric, written in infinitesimal form, using quantum-mechanical bra–ket notation, is

In this notation, one has that and integration over the entire measure space X is written as

The expression can be understood to be an infinitesimal variation; equivalently, it can be understood to be a 1-form in the cotangent space. Using the infinitesimal notation, the polar form of the probability above is simply

Inserting the above into the Fubini–Study metric gives:

Setting in the above makes it clear that the first term is (one-fourth of) the Fisher information metric. The full form of the above can be made slightly clearer by changing notation to that of standard Riemannian geometry, so that the metric becomes a symmetric 2-form acting on the tangent space. The change of notation is done simply replacing and and noting that the integrals are just expectation values; so:

The imaginary term is a symplectic form, it is the Berry phase or geometric phase. In index notation, the metric is:

Again, the first term can be clearly seen to be (one fourth of) the Fisher information metric, by setting . Equivalently, the Fubini–Study metric can be understood as the metric on complex projective Hilbert space that is induced by the complex extension of the flat Euclidean metric. The difference between this, and the Bures metric, is that the Bures metric is written in terms of mixed states.

Continuously-valued probabilities

A slightly more formal, abstract definition can be given, as follows.[10]

Let X be an orientable manifold, and let be a measure on X. Equivalently, let be a probability space on , with sigma algebra and probability .

The statistical manifold S(X) of X is defined as the space of all measures on X (with the sigma-algebra held fixed). Note that this space is infinite-dimensional, and is commonly taken to be a Fréchet space. The points of S(X) are measures.

Pick a point and consider the tangent space . The Fisher information metric is then an inner product on the tangent space. With some abuse of notation, one may write this as

Here, and are vectors in the tangent space; that is, . The abuse of notation is to write the tangent vectors as if they are derivatives, and to insert the extraneous d in writing the integral: the integration is meant to be carried out using the measure over the whole space X. This abuse of notation is, in fact, taken to be perfectly normal in measure theory; it is the standard notation for the Radon–Nikodym derivative.

In order for the integral to be well-defined, the space S(X) must have the Radon–Nikodym property, and more specifically, the tangent space is restricted to those vectors that are square-integrable. Square integrability is equivalent to saying that a Cauchy sequence converges to a finite value under the weak topology: the space contains its limit points. Note that Hilbert spaces possess this property.

This definition of the metric can be seen to be equivalent to the previous, in several steps. First, one selects a submanifold of S(X) by considering only those measures that are parameterized by some smoothly varying parameter . Then, if is finite-dimensional, then so is the submanifold; likewise, the tangent space has the same dimension as .

With some additional abuse of language, one notes that the exponential map provides a map from vectors in a tangent space to points in an underlying manifold. Thus, if is a vector in the tangent space, then is the corresponding probability associated with point (after the parallel transport of the exponential map to .) Conversely, given a point , the logarithm gives a point in the tangent space (roughly speaking, as again, one must transport from the origin to point ; for details, refer to original sources). Thus, one has the appearance of logarithms in the simpler definition, previously given.

See also

Notes

  1. ^ Nielsen, Frank (2023). "A Simple Approximation Method for the Fisher–Rao Distance between Multivariate Normal Distributions". Entropy. 25 (4): 654. arXiv:2302.08175. Bibcode:2023Entrp..25..654N. doi:10.3390/e25040654. PMC 10137715. PMID 37190442.
  2. ^ Atkinson, Colin (1981). "Rao's Distance Measure". Sankhyā: The Indian Journal of Statistics, Series A.
  3. ^ Amari, Shun-ichi; Nagaoka, Horishi (2000). "Chentsov's theorem and some historical remarks". Methods of Information Geometry. New York: Oxford University Press. pp. 37–40. ISBN 0-8218-0531-2.
  4. ^ Dowty, James G. (2018). "Chentsov's theorem for exponential families". Information Geometry. 1 (1): 117–135. arXiv:1701.08895. doi:10.1007/s41884-018-0006-4. S2CID 5954036.
  5. ^ Cover, Thomas M.; Thomas, Joy A. (2006). Elements of Information Theory (2nd ed.). Hoboken: John Wiley & Sons. ISBN 0-471-24195-4.
  6. ^ Brody, Dorje; Hook, Daniel (2008). "Information geometry in vapour-liquid equilibrium". Journal of Physics A. 42 (2): 023001. arXiv:0809.1166. doi:10.1088/1751-8113/42/2/023001. S2CID 118311636.
  7. ^ a b c Crooks, Gavin E. (2009). "Measuring thermodynamic length". Physical Review Letters. 99 (10): 100602. arXiv:0706.0559. doi:10.1103/PhysRevLett.99.100602. PMID 17930381. S2CID 7527491.
  8. ^ a b Gromov, Misha (2013). "In a search for a structure, Part 1: On entropy". European Congress of Mathematics. Zürich: European Mathematical Society. pp. 51–78. doi:10.4171/120-1/4. ISBN 978-3-03719-120-0. MR 3469115.
  9. ^ a b Facchi, Paolo; et al. (2010). "Classical and Quantum Fisher Information in the Geometrical Formulation of Quantum Mechanics". Physics Letters A. 374 (48): 4801–4803. arXiv:1009.5219. Bibcode:2010PhLA..374.4801F. doi:10.1016/j.physleta.2010.10.005. S2CID 55558124.
  10. ^ Itoh, Mitsuhiro; Shishido, Yuichi (2008). "Fisher information metric and Poisson kernels" (PDF). Differential Geometry and Its Applications. 26 (4): 347–356. doi:10.1016/j.difgeo.2007.11.027. hdl:2241/100265.

References

  • Feng, Edward H.; Crooks, Gavin E. (2009). "Far-from-equilibrium measurements of thermodynamic length". Physical Review E. 79 (1 Pt 1): 012104. arXiv:0807.0621. Bibcode:2009PhRvE..79a2104F. doi:10.1103/PhysRevE.79.012104. PMID 19257090. S2CID 8210246.
  • Shun'ichi Amari (1985) Differential-geometrical methods in statistics, Lecture Notes in Statistics, Springer-Verlag, Berlin.
  • Shun'ichi Amari, Hiroshi Nagaoka (2000) Methods of information geometry, Translations of mathematical monographs; v. 191, American Mathematical Society.
  • Paolo Gibilisco, Eva Riccomagno, Maria Piera Rogantin and Henry P. Wynn, (2009) Algebraic and Geometric Methods in Statistics, Cambridge U. Press, Cambridge.

Read other articles:

This article is about the conservation center. For the wild, see wilderness and the bush. For other uses, see Wild. Not to be confused with Muskingum County Animal Farm or The Wilds (TV series). Private, non-profit safari park and conservation center in east-central Ohio The Wilds39°49′46″N 81°43′58″W / 39.82944°N 81.73278°W / 39.82944; -81.73278Date opened1994LocationCumberland, Ohio, United StatesLand area9,154 acres (3,704 ha)No. of animals>300No...

 

Harbi, daerah musuh Darul Harbi merupakan daerah wilayah, atau negara musuh.[1] Istilah ini merujuk pada suatu daerah yang sedang dalam situasi perang di sebuah negara Islam.[1] Rakyat dan pemerintah daerah musuh ini mengancam, memengaruhi, memaksa agar orang-orang Islam di sana meninggalkan agamanya.[1] Kaum Muslim menganggap orang-orang yang berada di daerah Harbi adalah musuh.[1] Maka, kaum Muslim diizinkan untuk melawan dan berperang dengan mereka sampai me...

 

Artikel ini sudah memiliki daftar referensi, bacaan terkait, atau pranala luar, tetapi sumbernya belum jelas karena belum menyertakan kutipan pada kalimat. Mohon tingkatkan kualitas artikel ini dengan memasukkan rujukan yang lebih mendetail bila perlu. (Pelajari cara dan kapan saatnya untuk menghapus pesan templat ini) Ahwil LuthanPotret resmi, c. 2002 Duta Besar Indonesia untuk Meksiko ke-13Masa jabatan30 September 2002 – November 2005PresidenMegawati SoekarnoputriSusilo...

Rishi Sunak Portrait officiel de Rishi Sunak en 2022. Fonctions Premier ministre du Royaume-Uni En fonction depuis le 25 octobre 2022(1 an, 5 mois et 8 jours) Monarque Charles III Vice-Premier ministre Dominic RaabOliver Dowden Gouvernement Sunak Législature 58e Coalition Tories Prédécesseur Liz Truss Chef du Parti conservateur En fonction depuis le 24 octobre 2022(1 an, 5 mois et 9 jours) Élection 24 octobre 2022 Prédécesseur Liz Truss Député britanniqu...

 

Colonial States Athletic ConferenceFormerlyPennsylvania Athletic ConferenceAssociationNCAAFounded1992Ceased2023CommissionerMarie Stroman (final)Sports fielded 16 men's: 7 women's: 9 DivisionDivision IIINo. of teams10 (final)HeadquartersVillanova, Pennsylvania, U.S.RegionMid-AtlanticLocations The Colonial States Athletic Conference (CSAC) was a NCAA Division III collegiate athletic conference in the Mid-Atlantic United States that existed from 1992 to 2023. There were nine full member institut...

 

Educational Launch of Nanosatellites (ELaNa) is an initiative created by NASA to attract and retain students in the science, technology, engineering and mathematics disciplines.[1] The program is managed by the Launch Services Program (LSP) at NASA's Kennedy Space Center in Florida. Overview Engineers processing a CubeSat at a facility of Rocket Lab. The ELaNa initiative has made partnerships with universities in the US to design and launch small research satellites called CubeSats (...

Disambiguazione – Se stai cercando l'opera giudaico-cristiana, vedi Oracoli sibillini. I Libri sibillini erano una raccolta di responsi oracolari scritti in lingua greca e conservati nel tempio di Giove Capitolino sul Campidoglio, poi trasferiti da Augusto nel Tempio di Apollo Palatino. Indice 1 Religione romana 2 Note 3 Bibliografia 4 Voci correlate 5 Collegamenti esterni Religione romana La storia della religione romana tramanda di come la Sibilla Cumana (secondo altre fonti la Sibilla E...

 

ZDF

Si ce bandeau n'est plus pertinent, retirez-le. Cliquez ici pour en savoir plus. Cet article ne cite pas suffisamment ses sources (mai 2018). Si vous disposez d'ouvrages ou d'articles de référence ou si vous connaissez des sites web de qualité traitant du thème abordé ici, merci de compléter l'article en donnant les références utiles à sa vérifiabilité et en les liant à la section « Notes et références ». En pratique : Quelles sources sont attendues ? Comme...

 

2023 television miniseries based on the novel by Bonnie Garmus This article is about the television series. For the novel, see Lessons in Chemistry (novel). Lessons in ChemistryGenreHistorical dramaBased onLessons in Chemistryby Bonnie GarmusDeveloped byLee EisenbergStarring Brie Larson Lewis Pullman Aja Naomi King Stephanie Koenig Patrick Walker Theme music composerCarlos Rafael Rivera[1]Country of originUnited StatesOriginal languageEnglishNo. of episodes8ProductionExecutive produce...

1979 single by Cliff Richard Green LightSingle by Cliff Richardfrom the album Green Light B-sideImagine loveReleased16 February 1979 (1979-02-16)[1]Recorded17 & 25 April 1978[1]StudioAbbey Road Studios, London[2]GenrePop rockLength 3:45 (single version) 4:05 (album version) LabelEMISongwriter(s)Alan Tarney[3]Producer(s)Bruce WelchCliff Richard singles chronology Can't Take the Hurt Anymore (1978) Green Light (1979) We Don't Talk Anymore (1979...

 

P3P

Platform for website privacy preferences For the PlayStation Portable video game, see Persona 3 Portable. P3PPlatform for Privacy PreferencesAbbreviationP3PNative namePlatform for Privacy PreferencesStatusRetiredFirst published16 April 2002 (2002-04-16)[1][2]Latest version1.1 [2]CommitteeP3P Specification Working Group[2]Editors Rigo Wenning[2] Matthias Schunter[2] Authors Lorrie Cranor[2] Brooks Dobbs[2] Serge Ege...

 

Grand Prix PrancisSirkuit Paul Ricard(2018–2019, 2021–2022)Informasi lombaJumlah gelaran90Pertama digelar1906Terakhir digelar2022Terbanyak menang (pembalap) Michael Schumacher (8)Terbanyak menang (konstruktor) Ferrari (17)Panjang sirkuit5.842 km (3.630 mi)Jarak tempuh309.690 km (192.432 mi)Lap53Balapan terakhir (2022)Pole position Charles LeclercFerrari1:30.872Podium 1. Max VerstappenRed Bull Racing-RBPT1:30:02.112 2. Lewis HamiltonMercedes+10.587 3. George RussellMer...

Soviet, Russian and American theoretical physicist For his father, physician, see Alexei Ivanovich Abrikosov. In this name that follows Eastern Slavic naming customs, the patronymic is Alexeyevich and the family name is Abrikosov.Alexei AbrikosovАлексей АбрикосовAbrikosov in 2003Born(1928-06-25)June 25, 1928Moscow, Russian SFSR, Soviet UnionDiedMarch 29, 2017(2017-03-29) (aged 88)Palo Alto, California, United StatesCitizenship Soviet Union (1928–1991) Russia (since ...

 

Municipality of Slovakia Location of Levice District in the Nitra Region Bátovce (Hungarian: Bát, pronounced [ˈbaːt], German: Frauenmarkt) is a village and municipality in the Levice District in the Nitra Region of Slovakia. History In historical records the village was first mentioned in 1037 as FORUM REGINE. The second time Batovce was mentioned as MERKATUM REGINE. In 1327 Hungarian King Karoly named Batovce as The Royal City. In the medieval times it was known as The city of Qu...

 

1959 Italian film General della RovereDirected byRoberto RosselliniWritten bySergio AmideiDiego FabbriIndro MontanelliProduced byMoris ErgasStarringVittorio De SicaHannes MessemerCinematographyCarlo CarliniEdited byCesare CavagnaMusic byRenzo RosselliniProductioncompaniesZebra FilmSociété Nouvelle des Etablissements GaumontRelease dates 31 August 1959 (1959-08-31) (Venice International Film Festival) 7 October 1959 (1959-10-07) (Italy) Running time137 m...

لودفيغ الثاني ملك بافاريا (بالألمانية: Ludwig II. von Bayern)‏  معلومات شخصية اسم الولادة (بالألمانية: Ludwig II. Otto Friedrich Wilhelm von Bayern)‏  الميلاد 25 أغسطس 1845(1845-08-25)ميونخ الوفاة 13 يونيو 1886 (40 سنة)بحيرة شتارنبيرغير[1]  سبب الوفاة غرق  مواطنة مملكة بافاريا  الطول 193 سنتيمتر  ا�...

 

No debe confundirse con Antillas Neerlandesas. Compañía Neerlandesa de las Indias Occidentales Geoctroyeerde West-Indische Compagnie Bandera de la compañía La Casa de las Indias Occidentales en el Herenmarkt en Ámsterdam, sede del WIC desde 1623 hasta 1647Acrónimo WICTipo Empresa públicaIndustria ComercioForma legal empresa de capital abiertoFundación 3 de junio de 1621Fundador Willem UsselincxJoannes de LaetDisolución 1792Sede central Ámsterdam Países BajosÁrea de operación Amé...

 

Tournoi Apertura2019 Généralités Sport Football Organisateur(s) FESFUT Édition 43e Date du 27 juillet 2019au 22 décembre 2019 Participants 12 équipes Matchs joués 141 Site web officiel Site officiel Hiérarchie Hiérarchie 1er échelon Niveau inférieur Segunda División Palmarès Tenant du titre CD Águila Vainqueur Alianza FC Deuxième CD FAS Meilleur(s) buteur(s) Nicolás Muñoz (19) Navigation Saison précédente Saison suivante modifier Le Tournoi Apertura 2019 est le quaran...

Australian runner Adam SpencerSpencer at the 2024 NCAA Division I Indoor Track and Field ChampionshipsPersonal informationNationalityAustralianBorn (2001-10-04) 4 October 2001 (age 22)McKinnon, Victoria, AustraliaSportSportMiddle, long-distance runningEvent800 metres – 3000 metresCollege teamWisconsin Badgers Adam Spencer (born 4 October 2001) is an Australian middle and long-distance runner. He competes for the Wisconsin Badgers.[1] At the 2023 London Diamond League, he ran a ...

 

Parts of this article (those related to Luca Visentini) need to be updated. Please help update this article to reflect recent events or newly available information. (December 2022) Global trade union federation International Trade Union ConfederationAbbreviationITUCFormation1 November 2006; 17 years ago (2006-11-01)Merger ofInternational Confederation of Free Trade UnionsWorld Confederation of LabourTypeTrade union centreHeadquartersBrussels, BelgiumMembership (2018) 20...