Lecture 4

The document outlines the content of Lecture 4 of the course MS&E 317/CS 263 at Stanford University, focusing on Matrix Vector Multiplication and PageRank algorithms. It details the implementation of these algorithms using MapReduce, including specific functions for mapping and reducing data. Additionally, it discusses the challenges of dead-ends in PageRank and introduces the concept of random teleports to improve the algorithm's accuracy.

Uploaded by

Kanagaraj Subramani

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

0 views

Lecture 4

Uploaded by

Kanagaraj Subramani

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

MS&E 317/CS 263: Algorithms for Modern Data Models, Spring 2014

http://msande317.stanford.edu.
Instructors: Ashish Goel and Reza Zadeh, Stanford University.

Lecture 4, 4/9/2014. Scribed by Burak Yavuz.

4.1 Outline
1. Matrix Vector Multiply (Av)

2. PageRank

• on MapReduce
• on RDD’s / Spark

4.2 Matrix Vector Multiplication on MapReduce

We have a sparse matrix A stored in the form < i, j, aij >, where i, j are the row and column
indices and a vector v stored as < j, vj >. We wish to compute Av.
For the following algorithm, we assume v is small enough to fit into the memory of the mapper.

Algorithm 1 Matrix Vector Multiplication on MapReduce

1: function map(< i, j, aij >)
2: Emit(i, aij v[j])
3: end function
4: function reduce(key,values)
5: ret ← 0
6: for val ∈ values do
7: ret ← ret + val
8: end for
9: Emit(key, ret)
10: end function

4.3 PageRank
For a graph G with n nodes, we define the transition matrix Q = D−1 A, where A ∈ Rn×n is the
adjacency matrix and D ∈ Rn×n is a diagonal matrix composed of the outgoing edges from each
node.
We use Power Iteration to estimate importance values for webpages as v (k+1) = v (k) Q, where
v ∈ Rn is a row vector, and k is the number of iterations. We set v (0) = 1, a vector with each
element equaling one.

1
1

2 3

5 4

7 6

Figure 1: Graph G

Using Q as the probability distribution for random walks is a problem when G contains dead-
ends, i.e. “sink” nodes (nodes 2 and 7 in Figure 1). We introduce the idea of random teleports.
With probability α, the random walker can teleport to a random webpage or continue walking with
probability 1 − α where 0 < α < 1. Then we have a new matrix:

P = (1 − α)Q + αΛ

where  
−−− λ −−−
− − − λ − − −
 
 · 
Λ=
 
·

 
 
 · 
− − − λ − − − n×n
and α ∈ Rn is composed of the probability distribution of teleporting to a webpage.
The Power Iteration applies again: π (k+1) = π (k) Q.

Theorem 4.1
kπ − v (k) k2 ≤ e−ak
for some constant a > 0.

According to 4.1, for n = 109 , around 9 iterations are enough to get correct ranking.

4.3.1 PageRank on MapReduce

P
P is stored as < i, {(j, Pij )} >, where j Pij = 1, ∀i ∈ [1, n].
(k)
v is stored as < i, vi >.
We use a two-step algorithm:
Step 1:
Annotate Pi with vi , i.e. Emit < i, vi , {(j, Pij )} >.
Step 2:

2
Algorithm 2 PageRank Computation on MapReduce, Step 2
1: function map(< i, vi , {(j, Pij )} >)
2: for (j, Pij ) ∈ links do
(k)
3: Emit(j, Pij vi )
4: end for
5: end function
6: function reduce(key,values)
(k+1) P
7: vi = v∈values v
(k+1)
8: Emit (i, vi )
9: end function

Pre-Calculus 11 Workbook
No ratings yet
Pre-Calculus 11 Workbook
44 pages
Chapter 6 (CONT') : Application: Powers of Matrices and Their Applications. 1 Powers of Matrices
No ratings yet
Chapter 6 (CONT') : Application: Powers of Matrices and Their Applications. 1 Powers of Matrices
9 pages
EN 10255 2004+A1 2007 Specificaiton
100% (1)
EN 10255 2004+A1 2007 Specificaiton
27 pages
Efficacy of Elliptic Curve Cryptography A Comparative Analysis With RSA
No ratings yet
Efficacy of Elliptic Curve Cryptography A Comparative Analysis With RSA
11 pages
Application of Quaternions To Computa-Tion With Rotations: Working Paper, Stanford AI Lab, 1979 by Eugene Salamin
No ratings yet
Application of Quaternions To Computa-Tion With Rotations: Working Paper, Stanford AI Lab, 1979 by Eugene Salamin
9 pages
Bentbib A.H. Kanber A.
No ratings yet
Bentbib A.H. Kanber A.
14 pages
Journal of Computational and Applied Mathematics: Javid Ahmad Ganie, Renu Jain
No ratings yet
Journal of Computational and Applied Mathematics: Javid Ahmad Ganie, Renu Jain
12 pages
Roughdraft 3
No ratings yet
Roughdraft 3
16 pages
Re-exam2023August (1)
No ratings yet
Re-exam2023August (1)
11 pages
Ex 1.10 (A)
No ratings yet
Ex 1.10 (A)
7 pages
Updating The QR Factorization and The Least Squares Problem (2008)
No ratings yet
Updating The QR Factorization and The Least Squares Problem (2008)
73 pages
10 31801-Cfsuasmas 1009068-2024592
No ratings yet
10 31801-Cfsuasmas 1009068-2024592
10 pages
Robotics1 11.06.17
No ratings yet
Robotics1 11.06.17
5 pages
Representations of Quantum Algebras and Q-Special Functions
No ratings yet
Representations of Quantum Algebras and Q-Special Functions
21 pages
Enumerative Combinatorics and Posets: Assignment # 2: Felipe Bedoya May 4, 2020
No ratings yet
Enumerative Combinatorics and Posets: Assignment # 2: Felipe Bedoya May 4, 2020
10 pages
KJM 47 1-11
No ratings yet
KJM 47 1-11
12 pages
Majority Car Xiv
No ratings yet
Majority Car Xiv
26 pages
T Chuo T
No ratings yet
T Chuo T
4 pages
2_Elliptic_Curve_Cryptography_(ECC)
No ratings yet
2_Elliptic_Curve_Cryptography_(ECC)
4 pages
Pca PDF
No ratings yet
Pca PDF
33 pages
Caam 453 Numerical Analysis I
No ratings yet
Caam 453 Numerical Analysis I
4 pages
Jiang J
No ratings yet
Jiang J
16 pages
Linear Algebra Via Complex Analysis: Alexander P. Campbell Daniel Daners Corrected Version January 24, 2014
No ratings yet
Linear Algebra Via Complex Analysis: Alexander P. Campbell Daniel Daners Corrected Version January 24, 2014
17 pages
Majority CA
No ratings yet
Majority CA
35 pages
Matrix Chain Multiplication - Students
No ratings yet
Matrix Chain Multiplication - Students
3 pages
Full Cse 408 Unit 3
No ratings yet
Full Cse 408 Unit 3
168 pages
LectureNotes0U Merged
No ratings yet
LectureNotes0U Merged
97 pages
The Quaternions With Applications To Rigid Body Dy
No ratings yet
The Quaternions With Applications To Rigid Body Dy
17 pages
Enumerative Combinatorics and Posets: Assignment # 2: Felipe Bedoya May 4, 2020
No ratings yet
Enumerative Combinatorics and Posets: Assignment # 2: Felipe Bedoya May 4, 2020
10 pages
Gaussian Mixture Model: P (X - Y) P (Y - X) P (X)
No ratings yet
Gaussian Mixture Model: P (X - Y) P (Y - X) P (X)
3 pages
Path Length
No ratings yet
Path Length
16 pages
Ajms 498 23
No ratings yet
Ajms 498 23
11 pages
Cap 6.3
No ratings yet
Cap 6.3
9 pages
A Physicists Guide To Math-Electricity
No ratings yet
A Physicists Guide To Math-Electricity
9 pages
PPNCKH
No ratings yet
PPNCKH
12 pages
The Fast Convergence of Incremental PCA
No ratings yet
The Fast Convergence of Incremental PCA
17 pages
Queueing Models With Batch Services and Catastroph
No ratings yet
Queueing Models With Batch Services and Catastroph
8 pages
A Generalization of A Theorem of Archimedes
No ratings yet
A Generalization of A Theorem of Archimedes
4 pages
Chan H H, Hahn H, Lewis R P, Tan S L - New Ramanujan-Kolberg Type Partition Identities
No ratings yet
Chan H H, Hahn H, Lewis R P, Tan S L - New Ramanujan-Kolberg Type Partition Identities
11 pages
1 s2.0 S0195669821001645 Main
No ratings yet
1 s2.0 S0195669821001645 Main
23 pages
S MMC 2023 Solutions
No ratings yet
S MMC 2023 Solutions
23 pages
Week4 Sol
No ratings yet
Week4 Sol
10 pages
Krylov Subspace Methods
No ratings yet
Krylov Subspace Methods
8 pages
Lab3 7
No ratings yet
Lab3 7
5 pages
The Quaternions with an application to Rigid Body Dynamics
No ratings yet
The Quaternions with an application to Rigid Body Dynamics
17 pages
Download full Discrete Mathematics and Its Applications 7th Edition Rosen Solutions Manual all chapters
100% (2)
Download full Discrete Mathematics and Its Applications 7th Edition Rosen Solutions Manual all chapters
59 pages
Goldbach.linnik
No ratings yet
Goldbach.linnik
38 pages
midterm_1_ps1_24_CORR 3
No ratings yet
midterm_1_ps1_24_CORR 3
2 pages
Math/CS 466/666: Shifted Inverse Power Method Lab: 8 Complex Eigenvalues For A Real 5x5 Matrix Im
No ratings yet
Math/CS 466/666: Shifted Inverse Power Method Lab: 8 Complex Eigenvalues For A Real 5x5 Matrix Im
4 pages
FCS Lab2
No ratings yet
FCS Lab2
36 pages
Riemann ζ function in ℍ
No ratings yet
Riemann ζ function in ℍ
3 pages
Linear Least-Squares
No ratings yet
Linear Least-Squares
7 pages
Conjugate Gradient Method
No ratings yet
Conjugate Gradient Method
8 pages
Get Discrete Mathematics and Its Applications 7th Edition Rosen Solutions Manual Free All Chapters Available
100% (6)
Get Discrete Mathematics and Its Applications 7th Edition Rosen Solutions Manual Free All Chapters Available
52 pages
What NPC
No ratings yet
What NPC
5 pages
Immediate download Discrete Mathematics and Its Applications 7th Edition Rosen Solutions Manual all chapters
100% (19)
Immediate download Discrete Mathematics and Its Applications 7th Edition Rosen Solutions Manual all chapters
66 pages
On The Irrationality of Z
No ratings yet
On The Irrationality of Z
7 pages
Machine Learning 2: Exercise Sheet 1
No ratings yet
Machine Learning 2: Exercise Sheet 1
2 pages
Berlekamp-Massey Algorithm Revisited PDF
No ratings yet
Berlekamp-Massey Algorithm Revisited PDF
7 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Numerical Analysis II Essentials
From Everand
Numerical Analysis II Essentials
The Editors of REA
No ratings yet
Expert Systems With Applications: Freddie Åström, Rasit Koker
No ratings yet
Expert Systems With Applications: Freddie Åström, Rasit Koker
5 pages
Neurocomputing: R. Prashanth, Sumantra Dutta Roy
No ratings yet
Neurocomputing: R. Prashanth, Sumantra Dutta Roy
26 pages
Paul 2020
No ratings yet
Paul 2020
9 pages
Computers and Chemical Engineering: Mehrbakhsh Nilashi, Othman Bin Ibrahim, Hossein Ahmadi, Leila Shahmoradi
100% (1)
Computers and Chemical Engineering: Mehrbakhsh Nilashi, Othman Bin Ibrahim, Hossein Ahmadi, Leila Shahmoradi
12 pages
Medical Hypotheses: Zehra Karapinar Senturk T
No ratings yet
Medical Hypotheses: Zehra Karapinar Senturk T
5 pages
Early Detection of Parkinson's Disease Through Patient Questionnaire and Predictive Modelling
No ratings yet
Early Detection of Parkinson's Disease Through Patient Questionnaire and Predictive Modelling
42 pages
A Survey of Machine Learning Based Approaches For Parkinson Disease Prediction
No ratings yet
A Survey of Machine Learning Based Approaches For Parkinson Disease Prediction
8 pages
Lecture Notes - Unit 5
No ratings yet
Lecture Notes - Unit 5
52 pages
30.isca Isc 2013 4CS 82
No ratings yet
30.isca Isc 2013 4CS 82
6 pages
L17 - Knowledge Engineering
No ratings yet
L17 - Knowledge Engineering
16 pages
Poster ChouLunsfordThomson
No ratings yet
Poster ChouLunsfordThomson
10 pages
AllergenFacts CA en
No ratings yet
AllergenFacts CA en
18 pages
7Th Grade Persuasive Essay
100% (2)
7Th Grade Persuasive Essay
6 pages
Skip To News Feed
No ratings yet
Skip To News Feed
76 pages
Chapter 3 Problem Statements
No ratings yet
Chapter 3 Problem Statements
6 pages
Notes Estimation Theory
100% (3)
Notes Estimation Theory
39 pages
THCS Xuân La_ Đề cương học kì 2_Key
No ratings yet
THCS Xuân La_ Đề cương học kì 2_Key
5 pages
Kabbala Genesis
No ratings yet
Kabbala Genesis
5 pages
R.D.S.O: Manak Nagar, Lucknow
No ratings yet
R.D.S.O: Manak Nagar, Lucknow
23 pages
Tablas Viajes Entre Ciudades
No ratings yet
Tablas Viajes Entre Ciudades
5 pages
7 Physiotherapy in Periodontology
100% (1)
7 Physiotherapy in Periodontology
31 pages
Environmental Studies (U1)
No ratings yet
Environmental Studies (U1)
63 pages
The Sumerian Harp of Ur
No ratings yet
The Sumerian Harp of Ur
19 pages
CH 12 B2 Note
No ratings yet
CH 12 B2 Note
2 pages
ComPact NS - 33478
No ratings yet
ComPact NS - 33478
3 pages
Gst Most Expected Questions Part 2 by Vg Sir_250129_134822
No ratings yet
Gst Most Expected Questions Part 2 by Vg Sir_250129_134822
9 pages
Communication Systems by B P Lathi PDF
0% (4)
Communication Systems by B P Lathi PDF
2 pages
Jakki Book 1
No ratings yet
Jakki Book 1
2 pages
Cep 2024
No ratings yet
Cep 2024
2 pages
Journal Homepage: - : Introduction
No ratings yet
Journal Homepage: - : Introduction
7 pages
Petition Before SDM, Panipat and Samalkha Under Section 133 CRPC To Remove Public Nuisance - Abhishek Kadyan
No ratings yet
Petition Before SDM, Panipat and Samalkha Under Section 133 CRPC To Remove Public Nuisance - Abhishek Kadyan
78 pages
Advanced Cosmology
100% (1)
Advanced Cosmology
96 pages
UPDA ELECTRICAL EXAM 08.11.2020: Displacement Factor?
No ratings yet
UPDA ELECTRICAL EXAM 08.11.2020: Displacement Factor?
4 pages
CUFSM Overview: Main Input Properties Post Compare
No ratings yet
CUFSM Overview: Main Input Properties Post Compare
6 pages
Prelim - I Chem - Section II - Q
No ratings yet
Prelim - I Chem - Section II - Q
3 pages
Sessional Test For Electrical. (EDDE-II)
No ratings yet
Sessional Test For Electrical. (EDDE-II)
13 pages
Grade 6 DLP Q3 W5D4
No ratings yet
Grade 6 DLP Q3 W5D4
14 pages
Iso 23233 2009 en FR PDF
No ratings yet
Iso 23233 2009 en FR PDF
6 pages
Calobri 12.10
No ratings yet
Calobri 12.10
140 pages
CastanosM1998 Farming Techniques For Seaweeds
No ratings yet
CastanosM1998 Farming Techniques For Seaweeds
8 pages

Lecture 4

Uploaded by

Lecture 4

Uploaded by

MS&E 317/CS 263: Algorithms for Modern Data Models, Spring 2014

Lecture 4, 4/9/2014. Scribed by Burak Yavuz.

4.2 Matrix Vector Multiplication on MapReduce

Algorithm 1 Matrix Vector Multiplication on MapReduce

4.3.1 PageRank on MapReduce

You might also like