Rich-club coefficient

The rich-club coefficient is a metric on graphs and networks, designed to measure the extent to which well-connected nodes also connect to each other. Networks which have a relatively high rich-club coefficient are said to demonstrate the rich-club effect and will have many connections between nodes of high degree. The rich-club coefficient was first introduced in 2004 in a paper studying Internet topology.[1][2]

The "Rich-club" effect has been measured and noted on scientific collaboration networks and air transportation networks. It has been shown to be significantly lacking on protein interaction networks.

Definition

Non-normalized form

The rich-club coefficient was first introduced as an unscaled metric parametrized by node degree ranks.[1] More recently, this has been updated to be parameterized in terms of node degrees k, indicating a degree cut-off. The rich-club coefficient for a given network N is then defined as:

(1)

[3][4][5]

where is the number of edges between the nodes of degree greater than or equal to k, and is the number of nodes with degree greater than or equal to k. This measures how many edges are present between nodes of degree at least k, normalized by how many edges there could be between these nodes in a complete graph. When this value is close to 1 for values of k close to , it is interpreted that high degree nodes of the network are well connected. The associated subgraph of nodes with degree at least k is also called the "Rich Club" graph.

Normalized for topology randomization

A criticism of the above metric is that it does not necessarily imply the existence of the rich-club effect, as it is monotonically increasing even for random networks. In certain degree distributions, it is not possible to avoid connecting high degree hubs. To account for this, it is necessary to compare the above metric to the same metric on a degree distribution preserving randomized version of the network. This updated metric is defined as:

(2)

where is the rich-club metric on a maximally randomized network with the same degree distribution of the network under study. This new ratio discounts unavoidable structural correlations that are a result of the degree distribution, giving a better indicator of the significance of the rich-club effect.

For this metric, if for certain values of k we have , this denotes the presence of the rich-club effect.

Generalizations

General richness properties

The natural definition of a node's "richness" is its number of neighbours. If instead we replace this with a generic richness metric on nodes r, then we can rewrite the unscaled Rich-Club coefficient as:

(3)

Where we are instead considering the sub graph on only nodes with a richness measure of at least r. For example, on scientific collaboration networks, replacing the degree richness (number of coauthors) with a strength richness (number of published papers), the topology of the rich club graph changes dramatically.

Assortativity

The Assortativity of a network is a measurement of how connected similar nodes are, where similarity is typically viewed in terms of node degree. Rich-club can be viewed as a more specific notation of assortativity, where we are only concerned with the connectivity of nodes beyond a certain richness metric. For example, if a network consisted of a collection of hub and spokes, where the hubs were well connected, such a network would be considered disassortative. However, due to the strong connectedness of the hubs in the network, the network would demonstrate the rich-club effect.

An example of a network which is both disassortative and demonstrates the Rich Club effect. The red nodes are hubs and form the "Rich Club."

Applications

The rich-club coefficient of a network is useful as a heuristic measurement of the robustness of a network. A high rich-club coefficient implies that the hubs are well connected, and global connectivity is resilient to any one hub being removed. It is also useful for verifying theories that generalize to other networks. For example, the consistent observation of high rich-club coefficients for scientific collaboration networks adds evidence to the theory that within social groups, the elite tend to associate with one another.

Implementations

The rich-club coefficient has been implemented in NetworkX, a Python library for network analysis. This implementation includes both the non-normalized and normalized forms as described above.

See also

References

  1. ^ a b Zhou, Shi & Mondragón, Raúl J. (2004). "The Rich-Club Phenomenon In The Internet Topology". IEEE Communications Letters. 8 (3): 180–182. arXiv:cs/0308036. doi:10.1109/lcomm.2004.823426. S2CID 7007263.
  2. ^ Mattia Gasparini, Javier Luis Canovas Izquierdo, Robert Clariso, Marco Brambilla, Jordi Cabot: Analyzing Rich-Club Behavior in Open Source Projects. OpenSym 2019 proceedings
  3. ^ Colizza, V. and Flammini, A. and Serrano, M. A. and Vespignani, A. (2006). "Detecting rich-club ordering in complex networks". Nature Physics. 2. 2 (2): 110–115. arXiv:physics/0602134. Bibcode:2006NatPh...2..110C. doi:10.1038/nphys209. S2CID 2418153.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  4. ^ McAuley, Julian J. and da Fontoura Costa, Luciano and Caetano, Tibério S. (2007). "Rich-club phenomenon across complex network hierarchies". Applied Physics Letters. 91 (8): 084103. arXiv:physics/0701290. Bibcode:2007ApPhL..91h4103M. doi:10.1063/1.2773951. S2CID 16544534.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  5. ^ Opsahl, Tore; Colizza, Vittoria; Panzarasa, Pietro; Ramasco, José J. (2008). "Prominence and Control: The Weighted Rich-Club Effect". Physical Review Letters. 101 (16): 168702. arXiv:0804.0417. Bibcode:2008PhRvL.101p8702O. doi:10.1103/physrevlett.101.168702. PMID 18999722. S2CID 29349737.