14 Link 1
14 Link 1
• Homeworks:
• HW2 (due: 11/08)
• HW3 (will be posted on 11/06)
Link Analysis 1 • Note: Each homework has its own claim session
EE412: Foundation of Big Data Analytics • Textbook vs. slides:
• Prioritize the slides over the textbook.
Recap Outline
• UV Decomposition 1. Web Search as a Graph
• UV Decomposition: Computation 2. PageRank
• UV Decomposition: Variants 3. PageRank: Implementation
n k n
𝑓
✖ 𝑉! k
m R ≈ U 𝑓 𝑦 + 𝛻𝑓(𝑦)
𝑦
Jaemin Yoo 3 Jaemin Yoo 4
Graphs Graph Data: Social Networks
• Data structure that represents connections and relationships.
• Consists of nodes and edges.
• Can be directed or undirected.
• Represented as a sparse adjacency matrix.
Source: GeeksforGeeks
Source: [Backstrom et al., 2011]
CS224W: Computer
I teach a Classes are Science Stanford
class on in the Department University
Networks.
Gates at Stanford
router building
domain3
dog
… raining cats Documents
and dogs ...
1.6
1.6 1.6 1.6 1.6
rj/3 rj/3
• The recursive flow equation can be written as 𝑟 = 𝑀𝑟.
• Then, 𝑀𝑟 $
= 𝑀 𝑐% 𝑥% + 𝑐& 𝑥& + ⋯ + 𝑐' 𝑥' • Since 𝜆" /𝜆% #
= 0 as 𝑘 → ∞, we prove 𝑀 𝑟 # $
= 𝑐% 𝜆%# 𝑥% .
= 𝑐% 𝑀𝑥% + 𝑐& 𝑀𝑥& + ⋯ + 𝑐' 𝑀𝑥' • May not converge if 𝜆) = 𝜆* (discussed later).
• Here the PageRank leaks out since the matrix is not stochastic.
• Surfer will teleport out of a trap within a few time steps. • PageRank equation [Brin and Page, 1998]:
𝑟" 1
𝑟! = : 𝛽 + 1−𝛽
y y
𝑑" 𝑁
"→!
a m a m
• 𝑑, is the out-degree of node 𝑖.
1/
5
7/1
15
7/1
0×0
1/
15
y 7/15 7/15 1/15
13/15
• 𝑀 is preprocessed to be column-stochastic. a 7/15 1/15 1/15
7/15
a m 1/15 7/15 13/15
• In practice, 𝛽 = 0.8 or 0.9 (surfer jumps every 5 to 10 steps). 1/15
m
A
• Note: 𝐴 is stochastic, diagonalizable, and satisfies 𝜆% > 𝜆& .
• 𝜆) and 𝜆* are the two largest eigenvalues. y 1/3 0.33 0.24 0.26 7/33
a = 1/3 0.20 0.20 0.18 ... 5/33
m 1/3 0.46 0.52 0.56 21/33
Jaemin Yoo 33 34
Summary
1. Web Search as a Graph
2. PageRank
• Recursive formulation
• Power iteration
3. PageRank: Implementation
• Spider traps
• Dead ends
• Random teleports
• Sparse matrix computation
Jaemin Yoo 39