Quantities of information

A misleading[1] information diagram showing additive and subtractive relationships among Shannon's basic quantities of information for correlated variables and . The area contained by both circles is the joint entropy . The circle on the left (red and violet) is the individual entropy , with the red being the conditional entropy . The circle on the right (blue and violet) is , with the blue being . The violet is the mutual information .

The mathematical theory of information is based on probability theory and statistics, and measures information with several quantities of information. The choice of logarithmic base in the following formulae determines the unit of information entropy that is used. The most common unit of information is the bit, or more correctly the shannon,[2] based on the binary logarithm. Although bit is more frequently used in place of shannon, its name is not distinguished from the bit as used in data processing to refer to a binary value or stream regardless of its entropy (information content). Other units include the nat, based on the natural logarithm, and the hartley, based on the base 10 or common logarithm.

In what follows, an expression of the form is considered by convention to be equal to zero whenever is zero. This is justified because for any logarithmic base.[3]

Self-information

Shannon derived a measure of information content called the self-information or "surprisal" of a message :

where is the probability that message is chosen from all possible choices in the message space . The base of the logarithm only affects a scaling factor and, consequently, the units in which the measured information content is expressed. If the logarithm is base 2, the measure of information is expressed in units of shannons or more often simply "bits" (a bit in other contexts is rather defined as a "binary digit", whose average information content is at most 1 shannon).

Information from a source is gained by a recipient only if the recipient did not already have that information to begin with. Messages that convey information over a certain (P=1) event (or one which is known with certainty, for instance, through a back-channel) provide no information, as the above equation indicates. Infrequently occurring messages contain more information than more frequently occurring messages.

It can also be shown that a compound message of two (or more) unrelated messages would have a quantity of information that is the sum of the measures of information of each message individually. That can be derived using this definition by considering a compound message providing information regarding the values of two random variables M and N using a message which is the concatenation of the elementary messages m and n, each of whose information content are given by and respectively. If the messages m and n each depend only on M and N, and the processes M and N are independent, then since (the definition of statistical independence) it is clear from the above definition that .

An example: The weather forecast broadcast is: "Tonight's forecast: Dark. Continued darkness until widely scattered light in the morning." This message contains almost no information. However, a forecast of a snowstorm would certainly contain information since such does not happen every evening. There would be an even greater amount of information in an accurate forecast of snow for a warm location, such as Miami. The amount of information in a forecast of snow for a location where it never snows (impossible event) is the highest (infinity).

Entropy

The entropy of a discrete message space is a measure of the amount of uncertainty one has about which message will be chosen. It is defined as the average self-information of a message from that message space:

where

denotes the expected value operation.

An important property of entropy is that it is maximized when all the messages in the message space are equiprobable (e.g. ). In this case .

Sometimes the function is expressed in terms of the probabilities of the distribution:

where each and

An important special case of this is the binary entropy function:

Joint entropy

The joint entropy of two discrete random variables and is defined as the entropy of the joint distribution of and :

If and are independent, then the joint entropy is simply the sum of their individual entropies.

(Note: The joint entropy should not be confused with the cross entropy, despite similar notations.)

Conditional entropy (equivocation)

Given a particular value of a random variable , the conditional entropy of given is defined as:

where is the conditional probability of given .

The conditional entropy of given , also called the equivocation of about is then given by:

This uses the conditional expectation from probability theory.

A basic property of the conditional entropy is that:

Kullback–Leibler divergence (information gain)

The Kullback–Leibler divergence (or information divergence, information gain, or relative entropy) is a way of comparing two distributions, a "true" probability distribution , and an arbitrary probability distribution . If we compress data in a manner that assumes is the distribution underlying some data, when, in reality, is the correct distribution, Kullback–Leibler divergence is the number of average additional bits per datum necessary for compression, or, mathematically,

It is in some sense the "distance" from to , although it is not a true metric due to its not being symmetric.

Mutual information (transinformation)

It turns out that one of the most useful and important measures of information is the mutual information, or transinformation. This is a measure of how much information can be obtained about one random variable by observing another. The mutual information of relative to (which represents conceptually the average amount of information about that can be gained by observing ) is given by:

A basic property of the mutual information is that:

That is, knowing , we can save an average of bits in encoding compared to not knowing . Mutual information is symmetric:


Mutual information can be expressed as the average Kullback–Leibler divergence (information gain) of the posterior probability distribution of given the value of to the prior distribution on :

In other words, this is a measure of how much, on the average, the probability distribution on will change if we are given the value of . This is often recalculated as the divergence from the product of the marginal distributions to the actual joint distribution:

Mutual information is closely related to the log-likelihood ratio test in the context of contingency tables and the multinomial distribution and to Pearson's χ2 test: mutual information can be considered a statistic for assessing independence between a pair of variables, and has a well-specified asymptotic distribution.

Differential entropy

The basic measures of discrete entropy have been extended by analogy to continuous spaces by replacing sums with integrals and probability mass functions with probability density functions. Although, in both cases, mutual information expresses the number of bits of information common to the two sources in question, the analogy does not imply identical properties; for example, differential entropy may be negative.

The differential analogies of entropy, joint entropy, conditional entropy, and mutual information are defined as follows:

where is the joint density function, and are the marginal distributions, and is the conditional distribution.

See also

References

  1. ^ D.J.C. Mackay (2003). Information theory, inferences, and learning algorithms. Bibcode:2003itil.book.....M.: 141 
  2. ^ Stam, A.J. (1959). "Some inequalities satisfied by the quantities of information of Fisher and Shannon". Information and Control. 2 (2): 101–112. doi:10.1016/S0019-9958(59)90348-1.
  3. ^ "Three approaches to the definition of the concept "quantity of information"" (PDF).

Read other articles:

Institute of Physics and TechnologyФизико-технологический институтFormer namesPhysical Engineering faculty of USTU-UPITypeTechnical instituteEstablished1949DirectorVladimir IvanovStudents2000Location Yekaterinburg, Sverdlovsk Oblast, RussiaCampusurbanAffiliationsUral Federal UniversityWebsitehttps://fizteh.urfu.ru/ru/, http://ustu.ru/en/home/faculties/fti/ Institute of Physics and Technology (IPT) is one of leading institutions of Ural Federal University. IPT was ...

 

Compsa Compsa albopicta Klasifikasi ilmiah Kerajaan: Animalia Filum: Arthropoda Kelas: Insecta Ordo: Coleoptera Famili: Cerambycidae Genus: Compsa Compsa adalah genus kumbang tanduk panjang yang tergolong famili Cerambycidae. Genus ini juga merupakan bagian dari ordo Coleoptera, kelas Insecta, filum Arthropoda, dan kingdom Animalia. Larva kumbang dalam genus ini biasanya mengebor ke dalam kayu dan dapat menyebabkan kerusakan pada batang kayu hidup atau kayu yang telah ditebang. Referensi TIT...

 

Mohammad Reza Khanzadeh Piala Dunia 2014: Iran vs. AngolaInformasi pribadiNama lengkap Mohammad Reza Khanzadeh[1]Tanggal lahir 11 Mei 1991 (umur 32)Tempat lahir Tehran, IranTinggi 186 cm (6 ft 1 in)[1]Posisi bermain BekInformasi klubKlub saat ini PadidehNomor 3Karier senior*Tahun Tim Tampil (Gol)2017 – Padideh 20 (4)Tim nasional2012 – Iran 11 (1) * Penampilan dan gol di klub senior hanya dihitung dari liga domestik Mohammad Reza Khanzadeh (lahir 11 Mei...

This article is about the 1999 film. For other uses, see Florentine (disambiguation). 1999 American filmThe FlorentineTheatrical release posterDirected byNick StaglianoWritten byTom BensonDamien GrayProduced byFrancis Ford CoppolaNick StaglianoSteven WeismanStarringJeremy DaviesMichael MadsenChris PennLuke PerryTom SizemoreVirginia MadsenMary Stuart MastersonHal HolbrookBurt YoungJames BelushiCinematographyStephen KazmierskiEdited byPlummy TuckerMusic byMarco BeltramiProductioncompaniesAmeric...

 

This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: Barry Island Pleasure Park – news · newspapers · books · scholar · JSTOR (April 2023) (Learn how and when to remove this template message) Amusement park in Glamorgan, Wales Barry Island Pleasure ParkPreviously known as The New Evesham Pleasure Park (1929–195...

 

GreenGT H24, a hydrogen-powered racing car competing in the 2022 Road to Le Mans. The issue of environmentalism in motorsport surrounds the whole of auto racing to reduce its carbon dioxide emissions contributing to global warming. Initial reception The first series to respond to the call to make motorsport more environmentally friendly was the International Formula Master series, who planned to use a petrol–electric hybrid and regenerative braking systems in their cars for the 2007 se...

Памятник культуры Малопольского воеводства[1]: регистрационный номер А7 ДостопримечательностьЧасовня СигизмундаKaplica Zygmuntowska Часо́вня Сигизму́нда 50°03′16″ с. ш. 19°56′08″ в. д.HGЯO Страна  Польша Краков Краков и Дзельница I Старе-Място Конфессия католиче...

 

Questa voce sull'argomento calciatori messicani è solo un abbozzo. Contribuisci a migliorarla secondo le convenzioni di Wikipedia. Segui i suggerimenti del progetto di riferimento. Jorge Castañeda Reyes Nazionalità  Messico Altezza 175 cm Peso 75 kg Calcio Ruolo Centrocampista Termine carriera 2000 Carriera Squadre di club1 1987-1996 Atlas? (?)1997 Colorado Rapids24 (0)1999 Cruz Azul4 (0)2000 Tecos de la UAG1 (0) Nazionale 1990-1992 Messico? (?)1992 Messico&#...

 

Magnesium iodida Nama Nama IUPAC Magnesium iodida Penanda Nomor CAS 10377-58-9 (anhidrat) Y75535-11-4 (heksahidrat) N7790-31-0 (oktahidrat) N Model 3D (JSmol) Gambar interaktif 3DMet {{{3DMet}}} ChemSpider 59700 N Nomor EC PubChem CID 66322 Nomor RTECS {{{value}}} UNII W74QE3H320 N CompTox Dashboard (EPA) DTXSID6065056 InChI InChI=1S/2HI.Mg/h2*1H;/q;;+2/p-2 NKey: BLQJIBCZHWBKSL-UHFFFAOYSA-L NInChI=1/2HI.Mg/h2*1H;/q;;+2/p-2Key: BLQJI...

土库曼斯坦总统土库曼斯坦国徽土库曼斯坦总统旗現任谢尔达尔·别尔德穆哈梅多夫自2022年3月19日官邸阿什哈巴德总统府(Oguzkhan Presidential Palace)機關所在地阿什哈巴德任命者直接选举任期7年,可连选连任首任萨帕尔穆拉特·尼亚佐夫设立1991年10月27日 土库曼斯坦土库曼斯坦政府与政治 国家政府 土库曼斯坦宪法 国旗 国徽 国歌 立法機關(英语:National Council of Turkmenistan) ...

 

Village near Longford town, Ireland This article may require cleanup to meet Wikipedia's quality standards. The specific problem is: Excessive replication of content from other sources. Excessive links of loosely related sources. Please help improve this article if you can. (April 2013) (Learn how and when to remove this message) Village in Leinster, IrelandMoydow Maigh DumhaVillageMoydow (Castlerea) CastleMoydowLocation in IrelandCoordinates: 53°39′50″N 7°47′17″W / ...

 

莎拉·阿什頓-西里洛2023年8月,阿什頓-西里洛穿著軍服出生 (1977-07-09) 1977年7月9日(46歲) 美國佛羅里達州国籍 美國别名莎拉·阿什頓(Sarah Ashton)莎拉·西里洛(Sarah Cirillo)金髮女郎(Blonde)职业記者、活動家、政治活動家和候選人、軍醫活跃时期2020年—雇主內華達州共和黨候選人(2020年)《Political.tips》(2020年—)《LGBTQ國度》(2022年3月—2022年10月)烏克蘭媒�...

American college football season 1889 Princeton Tigers footballNational championConferenceIndependentRecord10–0Head coachNoneCaptainEdgar Allan PoeSeasons← 18881890 → 1889 Eastern college football independents records vte Conf Overall Team W   L   T W   L   T Princeton   –   10 – 0 – 0 Massachusetts   –   2 – 0 – 0 Yale   –   15 – 1 – 0 Harvard   – &#...

 

Disambiguazione – Se stai cercando l'omonima edizione di pallanuoto maschile, vedi A1 Ethniki 2014-2015 (pallanuoto maschile). A1 Ethniki 2014-2015Dettagli della competizioneSport Pallacanestro OrganizzatoreA1 Ethniki Federazione HEBA Periodo12 ottobre 2014 —14 giugno 2015 Squadre14 VerdettiCampione Olympiakos(11º titolo) Retrocessioni Panelefsiniakos Paniōnios Non ammesse allastagione successiva Drama MVP Aleksandăr Vezenkov Miglior allenatore Giannīs Sfairopoulo...

 

17th-century play sometimes attributed to Shakespeare For other uses, see Puritan (disambiguation). Title page of the 1607 quarto The Puritan, or the Widow of Watling Street, also known as The Puritan Widow, is an anonymous Jacobean stage comedy, first published in 1607. It is often attributed to Thomas Middleton, but also belongs to the Shakespeare Apocrypha due to its title page attribution to W.S.. Date and authorship The Puritan probably dates from the year 1606. Some of its incidents are...

Cet article est une ébauche concernant le radioamateurisme. Vous pouvez partager vos connaissances en l’améliorant (comment ?) selon les recommandations des projets correspondants. La bande 7 MHz désignée aussi par sa longueur d'onde, 40 mètres, est une bande du service radioamateur destinée à établir des radiocommunications de loisir. Cette bande est utilisable de jour pour le trafic radio régional et national. Cette bande est utilisable pour les radiocommunications intercon...

 

Rulers of Junagarh State in the British Raj Nawab Bhadur Khan III in 1885, with officials Nawab of Junagarh or Junagadh refers to the now defunct ex-lineage of rulers of the princely Junagarh State in British Raj, nowadays Junagadh district in the state of Gujarat in India.[citation needed] There are still several forts and palaces in India which were owned by princely Junagarh family but after Partition of India, this property was claimed by the Indian Government.[1][2 ...

 

Traditional Japanese music This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: Min'yō – news · newspapers · books · scholar · JSTOR (March 2010) (Learn how and when to remove this message) A Japanese folkswoman with her shamisen, 1904 Min'yō (民謡), Nihon min'yō, Japanese min'yō or Japanese folk music is a...

Irish-American cardinal His EminenceJohn Murphy FarleyCardinal Archbishop of New YorkSeeNew YorkAppointedSeptember 15, 1902Term endedSeptember 17, 1918PredecessorMichael CorriganSuccessorPatrick Joseph HayesOther post(s)Cardinal-Priest of S. Maria sopra MinervaOrdersOrdinationJune 11, 1870by Costantino Patrizi NaroConsecrationDecember 21, 1895by Michael CorriganCreated cardinalNovember 27, 1911by Pius XRankCardinal-PriestPersonal detailsBorn(1842-04-20)April 20, 1842Newtownhamilton,...

 

Untuk kegunaan lain, lihat Brownis. BrowniesSutradaraHanung BramantyoProduserLeo SutantoSkenario Hanung Bramantyo Salman Aristo Eric Sasono Cerita Salman Aristo Lina Nurmalina Pemeran Marcella Zalianty Bucek Phillip Yusuf Penata musikDewa BudjanaSinematograferTommy JepangPenyuntingCesa David LuckmansyahPerusahaanproduksiSinemArt PicturesTanggal rilis 9 Desember 2004 (2004-12-09) (Indonesia) Durasi108 menitNegaraIndonesiaBahasaBahasa Indonesia Penghargaan Festival Film Indonesia...