2 Introduction To Network Analysis-Network Structure - Jasmine Mondolo
2 Introduction To Network Analysis-Network Structure - Jasmine Mondolo
analysis
Jasmine Mondolo
[email protected]
Network structure/topology: an introduction
• It defines the way different nodes are placed and linked to each other and the
emerging overall patterns
• Each network has its own topology or structure which affects the actions and
capabilities of the nodes at the local level, the “cost” for travelling throughout the
network (e.g., easiness of people’s movements in a transportation network, easiness
of water’s flow in a hydraulic network) and the diffusion/ contagion of a
phenomenon (e.g., idea, disease), which is also influenced by the node(s) from which it
starts
Degree Distribution
• The larger the range of degrees, the more the degree distribution
approximates a curve
• It measures how many edges exist versus how many edges there could
possibly be
• It consists in the ratio between the number of “realized” edges and the number
of all the possible edges it is a measure of network connectiveness
⋕ 𝑒𝑑𝑔𝑒𝑠
𝑑𝑒𝑛𝑠𝑖𝑡𝑦 =
⋕ 𝑝𝑜𝑠𝑠𝑖𝑏𝑙𝑒 𝑒𝑑𝑔𝑒𝑠
About 43% of all the possible edges exist/taking two nodes at random, the
probability of an existing edge between them is about 43%
Density
• Another example: two networks with the same nodes but different density
• Degree Centrality
• Closeness Centrality
• Betweenness Centrality
• Local Clustering Coefficient
• Network-level metrics
• Magnitude/Scale
• Density
• Diameter
• Average degree and average weighted degree
• Average (shortest) path length
• Average/global clustering coefficient
• Modularity «cluster-level» metrics
Node-level measures
• Local (first-order) centrality measures they account only for the “nearest
neighbourood” of a node
• Degree centrality
• Local clustering coefficient
• Other measures (e.g., average neighbours’ degree)
DCD =5
DCR = 9
DCF = 3
DCB =4
• It can be practical to normalise this value (for comparing networks of different node-
size but also for an easier interpretation using N-1 (all the other nodes) as normalized
factor range 0 -1
• High levels of local clustering coefficients are often associated with low
levels of degree centrality
• The average of the local clustering coefficients of all the nodes in the
network is a useful structural measure
Closeness Centrality
• Closeness centrality measures how close (in terms of topological distance) a node
is to every other node in the network
• It can be regarded as a measure of how long it will take to spread something (e.g.,
information or a disease) from the node of interest to all the other nodes
sequentially
• It is often calculated as the average of the shortest path lengths (d) from one
σ𝑛
𝑗=1 𝑑 𝑖,𝑗
node (i) to every other node (j) in the network 𝐶𝐶 𝑖 = (lower values,
𝑛−1
higher centrality)
• It captures how frequently a node (i) lies on the shortest paths (G) linking
pairs of the other n-1 nodes
• Density
• Diameter how many steps it takes for the two most distant nodes in the network to
reach one another
• Average weighted degree the average degree that accounts for the weights of the
nodes’ edges
• Average (shortest) path length/average geodesic distance the average of the shortest
paths how many steps it takes, on average, to get from one node to another one; it is
sometimes regarded as a measure of communication efficiency
Cluster-level metrics
• Modularity
Modularity
• Random networks
• Regular networks
• «Small worlds»
Random network
• The average shortest path length tends to be short -like random networks-
considering the network magnitude (generally very significant)
• Examples: several social networks, neural networks, the power grid networks… (When
Facebook had 721 million users, some researchers calculated that the average shortest path was
equal to just 4.74)
Centralized or scale-free network
• It is very heterogeneous and unequal in terms of how connected and influential the
different nodes in the network are it has a very uneven/skewed/asymmetric
degree distribution (there is a power relationship between the degree a node has and
the frequency of its occurrence)
• Network basics
• Definition of network structure and some basic examples
• Degree distribution
• Main network metrics at the node level, network level and cluster
level
• One-mode vs two-mode networks
• Main types of network
• Things to consider when building a network
• Main software used for performing network anaysis