0% found this document useful (0 votes)

12 views25 pages

Eigenvector Centrality and HITS Algorithm

Eigenvector Centrality and HITS algorithm

Uploaded by

admscvbtrrsas2345d

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views25 pages

Eigenvector Centrality and HITS Algorithm

Eigenvector Centrality and HITS algorithm

Uploaded by

admscvbtrrsas2345d

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

Eigenvector

Centrality and
Hyperlink
Induced Topic
Search (HITS)
Eigenvector Centrality : Revisited
❑The eigen vector centrality 𝑥𝑣 of a node 𝑣 in a network 𝐺 𝑉, 𝐸 is given by
1 1
𝑥𝑣 = ෍ 𝑥𝑡 = ෍(𝑎𝑣𝑡 × 𝑥𝑡 )
λ λ
𝑡∈𝑁(𝑣) 𝑡∈𝑉

where λ is the largest eigen value of the matrix 𝐴 = 𝑎𝑖𝑗 , the adjacency matrix of the network 𝐺

❑The largest eigen value λ is obtained by solving the equation

𝐴. 𝑋 = λ. 𝑋

❑𝑋 above is a column vector, whose 𝑣 𝑡ℎ entry is 𝑥𝑣 , the eigen vector centrality of the node 𝑣
Hyperlink-Induced Topic Search
(HITS)
❑Based on the concept of Hub nodes and Authority nodes.
Hyperlink-Induced Topic Search
(HITS)
❑In response to a query, instead of an ordered list of pages each meeting the
query, find two sets of inter-related pages:

❑Hub pages are good lists of links on a subject.

`
❑Authority pages occur recurrently on good hubs for the subject.

❑Thus, a good hub page for a topic points to many authoritative pages for that
topic.

❑A good authority page for a topic is pointed to by many good hubs for that topic.
AT&T
Alice

ITIM
Hubs Authorities
Bob
O2

Mobile telecom companies

How to compute hub and authority
scores
❑ Do a regular web search first

❑ Call the search result the root set

❑ Find all pages that are linked to or linked from pages in the root set

❑ Call this larger set the base set

❑ Finally, compute hubs and authorities for the base set (which we’ll view as a
small web graph)
How to compute hub and authority
scores…

• Root set typically has

200–1000 nodes.

Root • Base set may have up to

set 5000 nodes.

Base set
How to compute hub and
authority scores…
▪ Given a broad search query, q, HITS collects a set of pages as
follows:
▪ It sends the query q to a search engine.
▪ It then collects t (t = 200 is used in the HITS paper) highest ranked
pages. This set is called the root set W.
▪ It then grows W by including any page pointed to by a page in W and
any page that points to a page in W. This gives a larger set S, base
set.
How to compute hub and
authority scores…
▪ HITS works on the pages in S, and assigns every page in S an
authority score and a hub score.
▪ Let the number of pages in S be n.
▪ We use G = (V, E) to denote the hyperlink graph of S.
▪ We use L to denote the adjacency matrix of the graph.
How to compute hub and
authority scores…
Let the authority score of the page i be a(i), and the hub score of page i be
h(i).
The mutual reinforcing relationship of the two scores is represented as
follows:
a(i) =  h( j )
( j ,i )E

h(i) =  a( j )
( i , j )E
How to compute hub and
authority scores…
We use a to denote the column vector with all the authority
scores,
a = [a(1), a(2), …, a(n)]T, and
use h to denote the column vector with all the hub scores,
h = [h(1), h(2), …, h(n)]T,
Then,
a = LTh
h = La
How to compute hub and
authority scores…
▪ The computation of authority scores and hub scores is the same as the computation of the
PageRank scores, using power iteration.

▪ If we use ak and hk to denote authority and hub vectors at the kth iteration, the iterations for
generating the final solutions are
How to compute hub and authority
scores…
Example:
Example:
Exercise: Compute Hub and
Authority for the below graph
Co-citation and Bibliographic
Coupling
Another area of research concerned with links is citation analysis of scholarly
publications.
◦ A scholarly publication cites related prior work to acknowledge the origins of some ideas
and to compare the new proposal with existing work.

When a paper cites another paper, a relationship is established between the

publications.
◦ Citation analysis uses these relationships (links) to perform various types of analysis.

We discuss two types of citation analysis, co-citation and bibliographic

coupling. The HITS algorithm is related to these two types of analysis.
Co-citation and Bibliographic
Coupling
If papers i and j are both cited by paper k, then they may be related in some sense to one
another.

The more papers they are cited by, the stronger their relationship is.
Bibliographic coupling
Bibliographic coupling operates on a similar principle.
Bibliographic coupling links papers that cite the same articles
◦ if papers i and j both cite paper k, they may be related.
The more papers they both cite, the stronger their similarity is.
Relationships with co-citation
and bibliographic coupling
Co-citation of pages i and j, denoted by Cij, is
n
Cij = 
k =1
Lki Lkj = ( LT L)ij

The authority matrix (LTL) of HITS is the co-citation matrix C

Bibliographic coupling of two pages i and j, denoted by Bij is

n
Bij = 
k =1
Lik L jk = ( LLT )ij ,
The hub matrix (LLT) of HITS is the bibliographic coupling matrix B
Strengths and weaknesses of
HITS
Strength: its ability to rank pages according to the query topic, which
may be able to provide more relevant authority and hub pages.
Weaknesses:
◦ It is easily spammed. It is in fact quite easy to influence HITS since adding out-
links in one’s own page is so easy.
◦ Topic drift. Many pages in the expanded set may not be on topic.
◦ Inefficiency at query time: The query time evaluation is slow. Collecting the root
set, expanding it and performing eigenvector computation are all expensive
operations
Katz Centrality
❑An extension of eigenvector centrality

❑Can be used to compute centrality in directed networks such as citation networks and the World
Wide Web

❑Mostly suitable in the analysis of directed acyclic graphs

❑Computes the relative influence of a node in a network by considering all immediate neighbors
and all further nodes connected to the node

❑Connections with distant neighbors are, however, penalized by an attenuation factor

Katz Centrality: Attenuation Factor
❑ Let us consider the influence of Jose in the network,
and also let the attenuation factor be 𝛼, 0 < 𝛼 < 1

❑ Immediate neighbours of Jose are Diego, Aziz, Bob,

Priya, and Sri. Influence of these neighbours on Jose
𝑣 would be attenuated at a factor of 𝛼

❑ Second order neighbours of Jose are Agneta, John,

Samantha, and Kim. Influence of these neighbours
on Jose would be attenuated at a factor of 𝛼 2

❑ The (only) third order neighbour of Jose is Jane.

https://www.geeksforgeeks.org/katz-centrality-centrality-measure/
Influence of these neighbours on Jose would be
attenuated at a factor of 𝛼 3
Katz Centrality
❑The Katz centrality of a node 𝑣𝑖 in a network 𝐺(𝑉, 𝐸), denoted 𝐶𝐾𝑎𝑡𝑧 (𝑖), is
defined as
∝ |𝑉|
𝑘
𝐶𝐾𝑎𝑡𝑧 𝑖 = ෍ ෍ 𝛼 𝑘 × 𝐴𝑗𝑖
𝑘=1 𝑗=1

where 𝐴 is the adjacency matrix of 𝐺

❑Matrix 𝐴𝑘 indicates the presence/absence of a path of length 𝑘 between a
node-pair
Slide Credits and Reference Material:
1) Social Network Analysis by Tanmoy Chakrabor ty
2) Slides by : CS583, Bing Liu, UIC
3) Slides by: CS276, Information Retrieval and Web Search Chris
Manning and Pandu Nayak

Hits Algoirthm PDF
No ratings yet
Hits Algoirthm PDF
36 pages
Evans L.,Thompson R. Introduction To Algebraic Topology PDF
No ratings yet
Evans L.,Thompson R. Introduction To Algebraic Topology PDF
248 pages
List of National Anthems
No ratings yet
List of National Anthems
28 pages
Health 1 3rd Quarter Final
100% (3)
Health 1 3rd Quarter Final
37 pages
SNA-UNIT-2 Full
No ratings yet
SNA-UNIT-2 Full
33 pages
Difference Equations To Differential Equations PDF
No ratings yet
Difference Equations To Differential Equations PDF
599 pages
Network Security Assessment Know Your Network 3 Edition Edition Chris Mcnab PDF Download
100% (2)
Network Security Assessment Know Your Network 3 Edition Edition Chris Mcnab PDF Download
56 pages
3.5 WebMining ImportantPages
No ratings yet
3.5 WebMining ImportantPages
11 pages
Social Network Analysis Unit-2
No ratings yet
Social Network Analysis Unit-2
24 pages
SMA Session 05 - Network Measures
No ratings yet
SMA Session 05 - Network Measures
69 pages
Qbit+command+List - V2.0 (Español, Autotradución)
No ratings yet
Qbit+command+List - V2.0 (Español, Autotradución)
10 pages
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet
07 - Chapter 3
No ratings yet
07 - Chapter 3
65 pages
10 Contoh Teks Short 'Story Telling' Pendek Bahasa Inggris Unik Dan Menarik
No ratings yet
10 Contoh Teks Short 'Story Telling' Pendek Bahasa Inggris Unik Dan Menarik
13 pages
CB Chapterwise Index
No ratings yet
CB Chapterwise Index
4 pages
Social Network Analysis: Lakshminarayana Sadineni Assistant Professor Department of Iot & Is
No ratings yet
Social Network Analysis: Lakshminarayana Sadineni Assistant Professor Department of Iot & Is
23 pages
SMA Unit-2-Complete-Notes
No ratings yet
SMA Unit-2-Complete-Notes
31 pages
Introduction To SNA
No ratings yet
Introduction To SNA
33 pages
030-Incidence and Network Matrices
No ratings yet
030-Incidence and Network Matrices
26 pages
SNA - T4-5 - Pagerank and Communities
No ratings yet
SNA - T4-5 - Pagerank and Communities
56 pages
Hits
No ratings yet
Hits
27 pages
SNA - T2-3 - Graphs and Degree
No ratings yet
SNA - T2-3 - Graphs and Degree
62 pages
Unit Ii
No ratings yet
Unit Ii
28 pages
SNA - Link Prediction
No ratings yet
SNA - Link Prediction
46 pages
IS4241 - Exam
No ratings yet
IS4241 - Exam
19 pages
Chapter 2
No ratings yet
Chapter 2
23 pages
SimRank Algorithm
No ratings yet
SimRank Algorithm
17 pages
What Is Computer?
No ratings yet
What Is Computer?
17 pages
IS4241 - Exam
No ratings yet
IS4241 - Exam
18 pages
Link Analysis
No ratings yet
Link Analysis
43 pages
Link Analysis AH
No ratings yet
Link Analysis AH
18 pages
Lec 6
No ratings yet
Lec 6
29 pages
Practical Journal Sna With Writeups
No ratings yet
Practical Journal Sna With Writeups
37 pages
English 6 Quarter 4 Week 1
100% (6)
English 6 Quarter 4 Week 1
30 pages
Who Is The Hidden Champion in A Network?: February 10, 2014
No ratings yet
Who Is The Hidden Champion in A Network?: February 10, 2014
20 pages
Question I4
No ratings yet
Question I4
31 pages
A Measure of Similarity Between Graph Vertices
No ratings yet
A Measure of Similarity Between Graph Vertices
20 pages
Lecture2
No ratings yet
Lecture2
25 pages
On The Properties of Von Neumann Kernels For Link Analysis: Masashi Shimbo Yuji Matsumoto
No ratings yet
On The Properties of Von Neumann Kernels For Link Analysis: Masashi Shimbo Yuji Matsumoto
31 pages
PJSUA2 Doc
No ratings yet
PJSUA2 Doc
273 pages
Social Network Analytics
No ratings yet
Social Network Analytics
16 pages
21tamara Kolda SIAMLACS
No ratings yet
21tamara Kolda SIAMLACS
12 pages
Missing Link Prediction in Social Networks
No ratings yet
Missing Link Prediction in Social Networks
9 pages
Practical No 1 Aim
No ratings yet
Practical No 1 Aim
17 pages
LinAlg Presentation
No ratings yet
LinAlg Presentation
10 pages
A MATLAB Toolbox For Large-Scale Networked Systems
No ratings yet
A MATLAB Toolbox For Large-Scale Networked Systems
12 pages
The Use of The Linear Algebra by Web Search Engine
No ratings yet
The Use of The Linear Algebra by Web Search Engine
6 pages
Mahyuddin Databia
No ratings yet
Mahyuddin Databia
8 pages
Efficient Graph-Based Author Disambiguation by Topological Similarity in DBLP
No ratings yet
Efficient Graph-Based Author Disambiguation by Topological Similarity in DBLP
5 pages
The Pagerank and HITS Algorithms
No ratings yet
The Pagerank and HITS Algorithms
22 pages
141 2020 Missing LP Using CN and Centrality Based Parameterized Algorithm
No ratings yet
141 2020 Missing LP Using CN and Centrality Based Parameterized Algorithm
9 pages
Preparation of Papers For IEEE ACCESS
No ratings yet
Preparation of Papers For IEEE ACCESS
9 pages
Romance 2011
No ratings yet
Romance 2011
7 pages
7-Análisis de Redes Sociales
No ratings yet
7-Análisis de Redes Sociales
21 pages
Mini-Project #2: Instructions
No ratings yet
Mini-Project #2: Instructions
5 pages
Collaboration in Sensor Network Research: An In-Depth Longitudinal Analysis of Assortative Mixing Patterns
No ratings yet
Collaboration in Sensor Network Research: An In-Depth Longitudinal Analysis of Assortative Mixing Patterns
15 pages
2 Centrality
No ratings yet
2 Centrality
4 pages
Networks: Basic Concepts: Centrality
No ratings yet
Networks: Basic Concepts: Centrality
27 pages
The Mathematics of Networks
No ratings yet
The Mathematics of Networks
12 pages
BDS DSC 307 - W03
No ratings yet
BDS DSC 307 - W03
3 pages
Community Detection Using A Measure of Global Influence
No ratings yet
Community Detection Using A Measure of Global Influence
16 pages
HITS
No ratings yet
HITS
3 pages
HCMUT MATHS4CS 055263 Assignment Community Structure Identification IMP
No ratings yet
HCMUT MATHS4CS 055263 Assignment Community Structure Identification IMP
10 pages
Test For Unit 22: He Asked Critics To The Next Two Weeks Thinking About Whether That's True
No ratings yet
Test For Unit 22: He Asked Critics To The Next Two Weeks Thinking About Whether That's True
4 pages
A Cohesion Based Friend Recommendation System
No ratings yet
A Cohesion Based Friend Recommendation System
16 pages
The Mathematics of Networks: M. E. J. Newman
No ratings yet
The Mathematics of Networks: M. E. J. Newman
12 pages
Econ410 HW1
No ratings yet
Econ410 HW1
2 pages
DMW Exp8
No ratings yet
DMW Exp8
3 pages
Similarity Index Based Link Prediction Algorithms in Social Networks: A Survey
No ratings yet
Similarity Index Based Link Prediction Algorithms in Social Networks: A Survey
8 pages
Network Analysis For Wikipedia: F. Bellomi and R. Bonato
No ratings yet
Network Analysis For Wikipedia: F. Bellomi and R. Bonato
12 pages
END OF TERM 1 Maths EXAMINATION 2024 gr6
No ratings yet
END OF TERM 1 Maths EXAMINATION 2024 gr6
7 pages
Gpiozero Readthedocs Io en Stable
No ratings yet
Gpiozero Readthedocs Io en Stable
276 pages
The Use of The Linear Algebra by Web Search Engines
No ratings yet
The Use of The Linear Algebra by Web Search Engines
5 pages
Week 4
No ratings yet
Week 4
25 pages
Science 6 - Week 7 Dll-Bow
No ratings yet
Science 6 - Week 7 Dll-Bow
2 pages
Aktu-Qp BCC302 2023-24 Odd-Sem
No ratings yet
Aktu-Qp BCC302 2023-24 Odd-Sem
4 pages
THIRD CONDITIONAL STUDENT DOCUMENT - Edited Yta
No ratings yet
THIRD CONDITIONAL STUDENT DOCUMENT - Edited Yta
5 pages
1 Transformation and Collineations
No ratings yet
1 Transformation and Collineations
2 pages
Loyola Application
No ratings yet
Loyola Application
3 pages
Bootstrap Datetimepicker Min Css Example
No ratings yet
Bootstrap Datetimepicker Min Css Example
7 pages
Year 1: Semester 2
No ratings yet
Year 1: Semester 2
7 pages
Individual Workplan
No ratings yet
Individual Workplan
1 page
CS - 8TH Bridge Course
No ratings yet
CS - 8TH Bridge Course
3 pages
Greece and The Greeks in Ottoman History and Turkish Historiography
No ratings yet
Greece and The Greeks in Ottoman History and Turkish Historiography
15 pages
Screenshot 2022-11-07 at 07.47.46
No ratings yet
Screenshot 2022-11-07 at 07.47.46
48 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Power Cloud For Technical Sales - Part 2 Private Cloud Quiz - Attempt Review
No ratings yet
Power Cloud For Technical Sales - Part 2 Private Cloud Quiz - Attempt Review
14 pages
UCMC - Syllabus Mapped With Coursera and LinkedIn Learning Courses For LLB
No ratings yet
UCMC - Syllabus Mapped With Coursera and LinkedIn Learning Courses For LLB
5 pages
Ee Mungu Unilinde 122017
No ratings yet
Ee Mungu Unilinde 122017
1 page
TestOut LabSim
No ratings yet
TestOut LabSim
2 pages

Eigenvector Centrality and HITS Algorithm

Uploaded by

Eigenvector Centrality and HITS Algorithm

Uploaded by

Eigenvector

❑The largest eigen value λ is obtained by solving the equation

❑Hub pages are good lists of links on a subject.

Mobile telecom companies

❑ Call the search result the root set

❑ Call this larger set the base set

• Root set typically has

Root • Base set may have up to

When a paper cites another paper, a relationship is established between the

We discuss two types of citation analysis, co-citation and bibliographic

The authority matrix (LTL) of HITS is the co-citation matrix C

Bibliographic coupling of two pages i and j, denoted by Bij is

❑Mostly suitable in the analysis of directed acyclic graphs

❑Connections with distant neighbors are, however, penalized by an attenuation factor

❑ Immediate neighbours of Jose are Diego, Aziz, Bob,

❑ Second order neighbours of Jose are Agneta, John,

❑ The (only) third order neighbour of Jose is Jane.

where 𝐴 is the adjacency matrix of 𝐺

You might also like