Directional statistics

Directional statistics (also circular statistics or spherical statistics) is the subdiscipline of statistics that deals with directions (unit vectors in Euclidean space, Rn), axes (lines through the origin in Rn) or rotations in Rn. More generally, directional statistics deals with observations on compact Riemannian manifolds including the Stiefel manifold.

The overall shape of a protein can be parameterized as a sequence of points on the unit sphere. Shown are two views of the spherical histogram of such points for a large collection of protein structures. The statistical treatment of such data is in the realm of directional statistics.[1]

The fact that 0 degrees and 360 degrees are identical angles, so that for example 180 degrees is not a sensible mean of 2 degrees and 358 degrees, provides one illustration that special statistical methods are required for the analysis of some types of data (in this case, angular data). Other examples of data that may be regarded as directional include statistics involving temporal periods (e.g. time of day, week, month, year, etc.), compass directions, dihedral angles in molecules, orientations, rotations and so on.

Circular distributions

Any probability density function (pdf) on the line can be "wrapped" around the circumference of a circle of unit radius.[2] That is, the pdf of the wrapped variable is

This concept can be extended to the multivariate context by an extension of the simple sum to a number of sums that cover all dimensions in the feature space: where is the -th Euclidean basis vector.

The following sections show some relevant circular distributions.

von Mises circular distribution

The von Mises distribution is a circular distribution which, like any other circular distribution, may be thought of as a wrapping of a certain linear probability distribution around the circle. The underlying linear probability distribution for the von Mises distribution is mathematically intractable; however, for statistical purposes, there is no need to deal with the underlying linear distribution. The usefulness of the von Mises distribution is twofold: it is the most mathematically tractable of all circular distributions, allowing simpler statistical analysis, and it is a close approximation to the wrapped normal distribution, which, analogously to the linear normal distribution, is important because it is the limiting case for the sum of a large number of small angular deviations. In fact, the von Mises distribution is often known as the "circular normal" distribution because of its ease of use and its close relationship to the wrapped normal distribution.[3]

The pdf of the von Mises distribution is: where is the modified Bessel function of order 0.

Circular uniform distribution

The probability density function (pdf) of the circular uniform distribution is given by

It can also be thought of as of the von Mises above.

Wrapped normal distribution

The pdf of the wrapped normal distribution (WN) is: where μ and σ are the mean and standard deviation of the unwrapped distribution, respectively and is the Jacobi theta function: where and

Wrapped Cauchy distribution

The pdf of the wrapped Cauchy distribution (WC) is: where is the scale factor and is the peak position.

Wrapped Lévy distribution

The pdf of the wrapped Lévy distribution (WL) is: where the value of the summand is taken to be zero when , is the scale factor and is the location parameter.

Projected normal distribution

The projected normal distribution is a circular distribution representing the direction of a random variable with multivariate normal distribution, obtained by radial projection of the variable over the unit (n-1)-sphere. Due to this, and unlike other commonly used circular distributions, it is not symmetric nor unimodal.

Distributions on higher-dimensional manifolds

Three points sets sampled from different Kent distributions on the sphere.

There also exist distributions on the two-dimensional sphere (such as the Kent distribution[4]), the N-dimensional sphere (the von Mises–Fisher distribution[5]) or the torus (the bivariate von Mises distribution[6]).

The matrix von Mises–Fisher distribution[7] is a distribution on the Stiefel manifold, and can be used to construct probability distributions over rotation matrices.[8]

The Bingham distribution is a distribution over axes in N dimensions, or equivalently, over points on the (N − 1)-dimensional sphere with the antipodes identified.[9] For example, if N = 2, the axes are undirected lines through the origin in the plane. In this case, each axis cuts the unit circle in the plane (which is the one-dimensional sphere) at two points that are each other's antipodes. For N = 4, the Bingham distribution is a distribution over the space of unit quaternions (versors). Since a versor corresponds to a rotation matrix, the Bingham distribution for N = 4 can be used to construct probability distributions over the space of rotations, just like the Matrix-von Mises–Fisher distribution.

These distributions are for example used in geology,[10] crystallography[11] and bioinformatics.[1] [12] [13]

Moments

The raw vector (or trigonometric) moments of a circular distribution are defined as

where is any interval of length , is the PDF of the circular distribution, and . Since the integral is unity, and the integration interval is finite, it follows that the moments of any circular distribution are always finite and well defined.

Sample moments are analogously defined:

The population resultant vector, length, and mean angle are defined in analogy with the corresponding sample parameters.

In addition, the lengths of the higher moments are defined as:

while the angular parts of the higher moments are just . The lengths of all moments will lie between 0 and 1.

Measures of location and spread

Various measures of central tendency and statistical dispersion may be defined for both the population and a sample drawn from that population.[3]

Central tendency

The most common measure of location is the circular mean. The population circular mean is simply the first moment of the distribution while the sample mean is the first moment of the sample. The sample mean will serve as an unbiased estimator of the population mean.

When data is concentrated, the median and mode may be defined by analogy to the linear case, but for more dispersed or multi-modal data, these concepts are not useful.

Dispersion

The most common measures of circular spread are:

  • The circular variance. For the sample the circular variance is defined as: and for the population Both will have values between 0 and 1.
  • The circular standard deviation with values between 0 and infinity. This definition of the standard deviation (rather than the square root of the variance) is useful because for a wrapped normal distribution, it is an estimator of the standard deviation of the underlying normal distribution. It will therefore allow the circular distribution to be standardized as in the linear case, for small values of the standard deviation. This also applies to the von Mises distribution which closely approximates the wrapped normal distribution. Note that for small , we have .
  • The circular dispersion with values between 0 and infinity. This measure of spread is found useful in the statistical analysis of variance.

Distribution of the mean

Given a set of N measurements the mean value of z is defined as:

which may be expressed as

where

or, alternatively as:

where

The distribution of the mean angle () for a circular pdf P(θ) will be given by:

where is over any interval of length and the integral is subject to the constraint that and are constant, or, alternatively, that and are constant.

The calculation of the distribution of the mean for most circular distributions is not analytically possible, and in order to carry out an analysis of variance, numerical or mathematical approximations are needed.[14]

The central limit theorem may be applied to the distribution of the sample means. (main article: Central limit theorem for directional statistics). It can be shown[14] that the distribution of approaches a bivariate normal distribution in the limit of large sample size.

Goodness of fit and significance testing

For cyclic data – (e.g., is it uniformly distributed) :

See also

References

  1. ^ a b Hamelryck, Thomas; Kent, John T.; Krogh, Anders (2006). "Hamelryck, T., Kent, J., Krogh, A. (2006) Sampling realistic protein conformations using local structural bias. PLoS Comput. Biol., 2(9): e131". PLOS Computational Biology. 2 (9): e131. Bibcode:2006PLSCB...2..131H. doi:10.1371/journal.pcbi.0020131. PMC 1570370. PMID 17002495.
  2. ^ Bahlmann, C., (2006), Directional features in online handwriting recognition, Pattern Recognition, 39
  3. ^ a b Fisher 1993.
  4. ^ Kent, J (1982) The Fisher–Bingham distribution on the sphere[permanent dead link]. J Royal Stat Soc, 44, 71–80.
  5. ^ Fisher, RA (1953) Dispersion on a sphere. Proc. Roy. Soc. London Ser. A., 217, 295–305
  6. ^ Mardia, KM. Taylor; CC; Subramaniam, GK. (2007). "Protein Bioinformatics and Mixtures of Bivariate von Mises Distributions for Angular Data". Biometrics. 63 (2): 505–512. doi:10.1111/j.1541-0420.2006.00682.x. PMID 17688502. S2CID 14293602.
  7. ^ Pal, Subhadip; Sengupta, Subhajit; Mitra, Riten; Banerjee, Arunava (September 2020). "Conjugate Priors and Posterior Inference for the Matrix Langevin Distribution on the Stiefel Manifold". Bayesian Analysis. 15 (3): 871–908. doi:10.1214/19-BA1176. ISSN 1936-0975. S2CID 209974627.
  8. ^ Downs (1972). "Orientational statistics". Biometrika. 59 (3): 665–676. doi:10.1093/biomet/59.3.665.
  9. ^ Bingham, C. (1974). "An Antipodally Symmetric Distribution on the Sphere". Ann. Stat. 2 (6): 1201–1225. doi:10.1214/aos/1176342874.
  10. ^ Peel, D.; Whiten, WJ.; McLachlan, GJ. (2001). "Fitting mixtures of Kent distributions to aid in joint set identification" (PDF). J. Am. Stat. Assoc. 96 (453): 56–63. doi:10.1198/016214501750332974. S2CID 11667311.
  11. ^ Krieger Lassen, N. C.; Juul Jensen, D.; Conradsen, K. (1994). "On the statistical analysis of orientation data". Acta Crystallogr. A50 (6): 741–748. Bibcode:1994AcCrA..50..741K. doi:10.1107/S010876739400437X.
  12. ^ Kent, J.T., Hamelryck, T. (2005). Using the Fisher–Bingham distribution in stochastic models for protein structure Archived 2024-01-20 at the Wayback Machine. In S. Barber, P.D. Baxter, K.V.Mardia, & R.E. Walls (Eds.), Quantitative Biology, Shape Analysis, and Wavelets, pp. 57–60. Leeds, Leeds University Press
  13. ^ Boomsma, Wouter; Mardia, Kanti V.; Taylor, Charles C.; Ferkinghoff-Borg, Jesper; Krogh, Anders; Hamelryck, Thomas (2008). "A generative, probabilistic model of local protein structure". Proceedings of the National Academy of Sciences. 105 (26): 8932–8937. Bibcode:2008PNAS..105.8932B. doi:10.1073/pnas.0801715105. PMC 2440424. PMID 18579771.
  14. ^ a b Jammalamadaka & Sengupta 2001.

Books on directional statistics

Read other articles:

2007 DreamWorks Animation film This article is about the film. For the video game based on the film, see Bee Movie Game. Barry Benson redirects here. For the Mississippi politician, see Barry W. Benson. Not to be confused with Maya the Bee (film) or B movie. Bee MovieTheatrical release posterDirected by Simon J. Smith Steve Hickner Written by Jerry Seinfeld Spike Feresten Barry Marder Andy Robin Produced by Jerry Seinfeld Christina Steinberg Starring Jerry Seinfeld Renée Zellweger Matthew Br...

 

Konsulat Jenderal Republik Indonesia di VancouverConsulate General of the Republic of Indonesia in Vancouver Koordinat49°17′25″N 123°07′55″W / 49.290268°N 123.132037°W / 49.290268; -123.132037Lokasi Vancouver, KanadaAlamat1630 Alberni StreetVancouver, British Columbia, KanadaYurisdiksi Daftar Alberta British Columbia Wilayah Barat Laut Yukon Konsul JenderalHendra HalimSitus webkemlu.go.id/vancouver/id Konsulat Jenderal Republik Indonesia di Vancouver (KJRI ...

 

Geostationary communications satellite BulgariaSat-1BulgariaSat-1 launches on a Falcon 9Mission typeCommunicationsOperatorBulgaria SatCOSPAR ID2017-038A SATCAT no.42801Websitewww.bulgariasat.comMission durationPlanned: 15+ years Elapsed: 6 years, 9 months, 12 days Spacecraft propertiesBusSSL 1300[1]ManufacturerSSL[1]Launch mass3,669 kg (8,089 lb)[1]Power10 kW[2] Start of missionLaunch date23 June 2017, 19:10 (23 June 2017, 19:10)&...

العلاقات الأوزبكستانية البوروندية أوزبكستان بوروندي   أوزبكستان   بوروندي تعديل مصدري - تعديل   العلاقات الأوزبكستانية البوروندية هي العلاقات الثنائية التي تجمع بين أوزبكستان وبوروندي.[1][2][3][4][5] مقارنة بين البلدين هذه مقارنة عامة ومرجعي...

 

كوستارازيون   الإحداثيات 40°26′09″N 21°19′51″E / 40.43583333°N 21.33083333°E / 40.43583333; 21.33083333   تقسيم إداري  البلد اليونان[1]  عدد السكان  عدد السكان 609 (2021)0 (2001)903 (1991)742 (2011)  رمز جيونيمز 734951  تعديل مصدري - تعديل   كوستارازيون (Κωσταράζιον) هي مدينة في كاستوري�...

 

Синелобый амазон Научная классификация Домен:ЭукариотыЦарство:ЖивотныеПодцарство:ЭуметазоиБез ранга:Двусторонне-симметричныеБез ранга:ВторичноротыеТип:ХордовыеПодтип:ПозвоночныеИнфратип:ЧелюстноротыеНадкласс:ЧетвероногиеКлада:АмниотыКлада:ЗавропсидыКласс:Пт�...

追晉陸軍二級上將趙家驤將軍个人资料出生1910年 大清河南省衛輝府汲縣逝世1958年8月23日(1958歲—08—23)(47—48歲) † 中華民國福建省金門縣国籍 中華民國政党 中國國民黨获奖 青天白日勳章(追贈)军事背景效忠 中華民國服役 國民革命軍 中華民國陸軍服役时间1924年-1958年军衔 二級上將 (追晉)部队四十七師指挥東北剿匪總司令部參謀長陸軍�...

 

Irish footballer (born 1989) For people of a similar name, see James McLean. James McClean McClean with the Republic of Ireland in 2013Personal informationFull name James Joseph McClean[1]Date of birth (1989-04-22) 22 April 1989 (age 35)[2]Place of birth Derry, Northern IrelandHeight 1.80 m (5 ft 11 in)[3][4]Position(s) WingerTeam informationCurrent team WrexhamNumber 23Youth career Trojans InstituteSenior career*Years Team Apps (Gls)2007–...

 

LGBT films By decade 1896–1959 1895–1919 1920s 1930s 1940s 1950s 1960s 1960 1961 1962 1963 1964 1965 1966 1967 1968 1969 1970s 1970 1971 1972 1973 1974 1975 1976 1977 1978 1979 1980s 1980 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990s 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000s 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010s 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020s 2020 2021 2022 2023 2024 vte This is a list of lesbian, gay, bisexual or transgender-...

Estonian ballet dancer and ballet master Helmi Puur1954 Swan Lake debut, photo from the Estonian National Archives, with Artur KoitBorn(1933-12-20)20 December 1933Tallinn, EstoniaDied6 July 2014(2014-07-06) (aged 80)Tallinn, EstoniaNationalityEstonianOther namesHelmi KiikOccupationballerinaYears active1953-2010 Helmi Puur (20 December 1933 – 6 July 2014) was an Estonian prima ballerina, dance master and coach. One of the first ballerinas to study in the Tallinn Ballet Sch...

 

العلاقات الكاميرونية المجرية الكاميرون المجر   الكاميرون   المجر تعديل مصدري - تعديل   العلاقات الكاميرونية المجرية هي العلاقات الثنائية التي تجمع بين الكاميرون والمجر.[1][2][3][4][5] مقارنة بين البلدين هذه مقارنة عامة ومرجعية للدولتين: وجه ال...

 

For the communities in the United States, see Dunnville, Kentucky and Dunnville, Wisconsin. Unincorporated community in Ontario, CanadaDunnvilleUnincorporated communitySt. Michael's Catholic ChurchMotto: Grand Living in a Great TownDunnvilleShow map of OntarioDunnvilleShow map of CanadaCoordinates: 42°54′10″N 79°37′00″W / 42.90278°N 79.61667°W / 42.90278; -79.61667CountryCanadaProvinceOntarioCountyHaldimandIncorporated as Village of DunnvilleJanuary 1,...

Russian author and revolutionary (1812–1870) Alexander HerzenPortrait of Herzen by Nikolai Ge (1867)BornAleksandr Ivanovich Herzen6 April 1812 (1812-04-06)Moscow, Moskovsky Uyezd, Moscow Governorate, Russian EmpireDied21 January 1870 (1870-01-22) (aged 57)Paris, FranceAlma materMoscow UniversityEra19th-century philosophyRegionRussian philosophySchoolWesternizersAgrarian populismMain interestsPolitics, economics, class struggleNotable ideasAgrarianism Signature Alexander ...

 

Polish banker Ivan Bliokh redirects here. Not to be confused with Iwan Bloch. Jan Gotlib BlochИван Станиславович БлиохBloch, c. 1902Born(1836-07-24)July 24, 1836Radom, PolandDiedJanuary 7, 1902(1902-01-07) (aged 65)Warsaw, PolandSpouseEmilia Julia Kronenberg h. Koroniec Jan Gotlib (Bogumił) Bloch (Russian: Иван Станиславович Блиох or Блох) (July 24, 1836 – January 7, 1902) was a Polish banker and railway financier who devoted his private...

 

American consumer products company This article contains content that is written like an advertisement. Please help improve it by removing promotional content and inappropriate external links, and by adding encyclopedic content written from a neutral point of view. (September 2018) (Learn how and when to remove this message) Newell Brands Inc.Formerly Newell Company (1903–1999) Newell Rubbermaid (1999–2016) Company typePublicTraded asNasdaq: NWLS&P 600 componentIndustryConsumer g...

つるしま のあ鶴嶋 乃愛 横浜国際映画祭にて(2023年5月3日)プロフィール愛称 のあにゃん、姫、にゃんこ生年月日 2001年5月24日現年齢 23歳出身地 日本高知県高知市[1]血液型 B型[2][3]公称サイズ([4]時点)身長 / 体重 164 cm / ― kgスリーサイズ 78 - 58 - 84.5 cm靴のサイズ 23.5 cm 単位系換算身長 / 体重5′ 5″ / ― lbスリーサイズ31 - 23 - 33 in活動デビュー 2...

 

Italian cardinal Giovanni Vincenzo Acquaviva d'Aragona Giovanni Vincenzo Acquaviva d'Aragona (born between 1490 and 1495 in Naples in Italy, died 16 August 1546 in Itri) was a Cardinal of the Roman Catholic Church. He became bishop of Melfi and Rapolla in 1537. Life Belonging to an illustrious and powerful noble family from the south, Giovanni Vincenzo Acquaviva d'Aragona was born in Naples, the son of Andrea Matteo III Acquaviva d'Aragona, eighth duke of Atri, 15th Count of Conversano and Co...

 

إسماعيل كامل باشا معلومات شخصية تاريخ الميلاد سنة 1795   الوفاة سنة 1822 (26–27 سنة)  شندي  مواطنة الدولة العثمانية  الأب محمد علي باشا  عائلة الأسرة العلوية  تعديل مصدري - تعديل   إسماعيل باشا بن محمد علي باشا. ثالث أبناء محمد علي باشا، وقائد الحملة التي جردها �...

German painter Ernst Wilhelm Nay (right). Ernst Wilhelm Nay (June 11, 1902 – April 8, 1968) was a German painter and graphic designer of classical modernism. He is considered one of the most important painters of German post-war art. Biography Nay came from a Berlin civil servant's family.[1] He was born the second son of six children. His father Johannes Nay fell in 1914 as a captain in Belgium. Nay completed his humanistic education with the Abitur at the provincial school Pforta ...

 

Geographical and cultural region of Southern Italy Irpinia (Modern Latin Hirpinia) is a geographical and cultural region of Southern Italy. It was the inland territory of the ancient Hirpini tribe, and its extent matches approximately today's province of Avellino. A typical landscape of Irpinia Geography The territory is largely mountainous, with an intricate network of hills and valleys and a predominantly limestone Karst topography. To the north-east, however, the rocks are mostly sandstone...