Course 5-6
Course 5-6
NETWORKS 5-6
Centrality measures
Outline
2
Part 1
Basic Centrality Concepts
Degree Centrality
Betweenness Centrality
Closeness Centrality
Eigenvector Centrality
Centralization
Part 2
Hub and Authorities (HITS Algorithm)
PageRank
PART 1
Basic centrality
Centrality
4
Degree Centrality
Degree Centrality
9
Example:
Degree Centrality
1
1
When to use?
Whom to ask for favor?
People you can talk to
Degree Centrality
1
2
Can be deceiving
Why?
Local measure
1
3
Betweenness Centrality
Betweenness Centrality
1
4
BC of a node 𝑢 is the ratio of the shortest paths
between all other nodes, that pass through node 𝑢
Quantifies the control of a node on the
communication between other nodes
First introduced by Freeman
δ 𝑠t ( 𝑢 )
δ𝑠
𝐶𝐵 ( 𝑢) = ∑ 𝑠 ≠ 𝑣 ≠ 𝑡
t
𝑠 = source
𝑡 = destination
δ𝑠𝑡 = number of shortest paths between (𝑠, 𝑡)
δ𝑠𝑡(𝑢) = number of shortest paths between (𝑠, 𝑡) that pass
through 𝑢
Betweenness Centrality
1
5
Example:
A B C D E
½+½ = 1
Can you figure out why B
D
has betweenness 3.5 while
E has betweenness 0.5?
1
Betweenness Centrality
7
2
Divide by 𝑛−1 for undirected graph
Normalized example:
Betweenness Centrality
1
9
Normalized example:
Red circled node
has low centrality
value. Why?
Green circled node
has high value.
Why?
2
0
Closeness Centrality
Closeness Centrality
2
1
A B C D E
N 1
d(A, j) 1
1
'
Cc (A) j1
1 2 3 4
10
N 1 4
0.4
4
Closeness Centrality
2
3
More example:
Comparison
2
Comparing across 3 centrality values
4
Generally, the 3 types will be positively correlated
When they are not, it tells you something
interesting!
Low Low Closeness Low
Degree Betweenness
Embedded in Ego's
cluster that is far connections are
High Degree
from the rest of the redundant -
network communication
bypasses
him/her
Key player Probably
High tied to multiple paths
Closeness important/acti in the network,
ve alters ego is near
many people,
but so are
many others
Ego's few ties Very rare cell.
Eigenvector Centrality
2
5 Measure of the influence of a node in a network
Connections to high-scoring nodes contribute more
“An important node is connected to important
neighbor”
Google’s PageRank is a variant of eigenvector
centrality
Eigenvector centrality of 𝑣,
𝐶𝐷
network
𝑛 (𝑁
−−𝐶𝐷1)(𝑁
(𝑖)] −
= 2)
theoretically largest such sum of differences in any network of
the same degree
Centralization of Network
2
7
CD = 0.167 CD = 1.0
CD = 0.167
Centralization of Network
2
8
2
How to rank pages
From in-links?
Intuition:
3 1
Implicit endorsement
Single vs aggregate endorsement
Page referred by most preferred
How about out-links
3
3
3 1
4
An example (Kleinberg)
3
4
In-links to pages for the query newspaper
Recursive dependency:
𝑎(𝑣)
Σ 𝑤𝜖𝜖𝜖𝜖𝜖𝑛𝑡[𝑣]
ℎ(𝑣 ℎ(𝑤)
)
Σ 𝑤𝜖𝑐𝑐𝑖𝑐𝑑𝜖𝜖𝑛[𝑣] 𝑎(𝑤
)
Hub-Authority (HITS Algorithm)
3
8
Authority: highly endorsed answers to queries
Hub: high value lists for the query
6 1
3 1
7
4
Hub-Authority (HITS Algorithm)
3
9
starts with all hub and authority scores equal to 1
chooses a number of steps K
performs a sequence of K Authority and Hub updates in this order.
6 1
3 1
7
4
Hub-Authority (HITS Algorithm)
starts with all hub and authority scores equal to 1
chooses a number of steps K
performs a sequence of K Authority and Hub updates in this order.
Problems
Score grows to very large
numbers
Actually converges?
Hub-Authority (HITS Algorithm)
4
1
Problems
Score grows to very large
numbers
normalization
Actually converges?
Equilibrium
Effect of initial values
PageRank
4
2
Keys:
Mode of endorsement form the basis
of PageRank
Starts with simple voting on in-links
Pass endorsement across out-links
Repeated improvement
PageRank (contd.)
4
3
Computation procedure:
1
Each node with initial PageRank 𝑛
A number of steps K
K updates of PageRank values
Each node/page divides it current PageRank
value equally across its out-links
Each page updates its new PageRank value to be
the sum of what it receives
PageRank (contd.)
4
4
Computation procedure:
1
Each node with initial pagerank 8
Step 1: PR(A) = ½*PR(D) + ½*PR(E) + PR(H) +
PR(F) + PR(G)=1/16+1/16+1/8+1/8+1/8=1/2
PageRank (contd.)
4
6
Convergence/equlibrium?
Is there any?
How to check?
PageRank (contd.)
4
7
It’s leaking!
PageRank (contd.)
4
9
Random walk
Choose a page at random
Pick each edge with equal
probability
Follow links for a sequence of k
steps
Pick a random out-links
Follow it to where it leads
5
PageRank: alternate definition
4