Pageranking Numerical
Pageranking Numerical
Module 2
PageRank (PR) is an algorithm used by Google Search to rank websites in their search
engine results. PageRank was named after Larry Page, one of the founders of Google.
PageRank is a way of measuring the importance of website pages.
According to Google: PageRank works by counting the number and quality of links to a
page to determine a rough estimate of how important the website is. The underlying
assumption is that more important websites are likely to receive more links from other
websites.
It is not the only algorithm used by Google to order search engine results, but it is the
first algorithm that was used by the company, and it is the best-known.
The PageRank computations require several passes, called “iterations”, through the
collection to adjust approximate PageRank values to more closely reflect the theoretical
true value.
Pages Iteration 0
A 0.25
B 0.25
C 0.25
D 0.25
Calculation for 1+ i iterations are as follows:
Page Rank distribution value of a page A = sum of all pages that are pointing to page A
(last iteration of that page/no. of outgoing links of that particular page)
Let’s continue with iteration 1st Iteration of above example
For Page A:
Pages pointing to A = C
PR distribution value of A = last iteration value of C/no. of outgoing links of C
= 0.25/3 = 0.0833333
Thus we can see, Page C is the most “voted” or followed page found by calculations.