0% found this document useful (0 votes)

11 views6 pages

3.2 Detecting Communities in Social Networks

Uploaded by

sshanjay1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views6 pages

3.2 Detecting Communities in Social Networks

Uploaded by

sshanjay1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Social Network Analysis (3 - 3) Extraction & Mining Communities in Web Social Networks

With transpose

t 0 0 0
A = 0 0 0
1 1 0
Assume the initial hub weight vector is :
1
u = 1
1
Ans. :
We compute the authority weight vector by :

t
0 0 0 1 0
v = A u = 0 0 01 = 0
1 1 0 1 2
Then, the updated hub weight is :
0 0 1 0 2
u = Av = 0 0 10 = 2
0 0 0 2 0
This already corresponds to our intuition that node 3 is the most authoritative, since it is the
only one with incoming edges, and that nodes 1 and 2 are equally important hubs. If we repeat
the process further, we will only obtain scalar multiples of the vectors v and u computed at
step 1. So the relative weights of the nodes remain the same.

 3.2 Detecting Communities in Social Networks

 Network is used to represent real world entity. For example social network is connected
with friendships or co-authors. In most of the example, real social network contains two
parts : denser and sparser part.
 In denser sub-network, group of peoples are closely connected to each other. This type of
denser sub-network called as communities.
 Detecting communities from given social networks are practically important for the
following reasons:
1. For information recommendation, communities are used. In communities, members
have similar preferences and tests.
2. Communities will help us understand the structures of given social networks.
3. Communities will play important roles when we visualize large-scale social networks
 Communities, also known as modules and clusters, are sets of nodes which are relatively
more connected, and are believed to be the intrinsic structures in networks in the nature.
 Nodes in the same community often share interesting properties such as a common
function, interest, or purpose. Thus, community detection is one of the most important
problems in network analysis.

TECHNICAL PUBLICATIONS® - An up thrust for knowledge

Social Network Analysis (3 - 4) Extraction & Mining Communities in Web Social Networks

 Why Community Detection ?

1. Communities in a citation network might represent related papers on a single topic;
2. Communities on the web might represent pages of related topics;
3. Community can be considered as a summary of the whole network thus easy to visualize
and understand.
4. Sometimes, community can reveal the properties without releasing the individual
privacy information.

 3.3 Definition of Community

 Definition is divided into three parts:
1. Local definitions
2. Global definitions
3. Definitions based on vertex similarity.

 3.3.1 Local Definition

 The attention is focused on the vertices of the sub-network under investigation and on its
immediate neighborhood.
 A local definition of community is divided into two types : self-referring ones and
comparative ones.
 The examples of self referring definitions are clique , n-clique and k-plex.
1. Clique : A maximal sub-networks where each vertex is adjacent to all the others.
2. n-clique : A maximal sub-network such that the distance of each pair of vertices is not
larger than n
3. k-plex : A maximal sub-network such that each vertex is adjacent to all the others except
at most k of them.
 The examples of comparative definitions are LS set and weak community
 LS set : sub-network where each vertex has more neighbors inside than outside of the sub-
network
 Weak community : The total degree of the vertices inside the community exceeds the
number of edges lying between the community and the rest of the network.
 Fig. 3.3.1 shows a graph and a listing of the cliques contained in it. The sub-graphs are in
fact cliques, and that there are no remaining cliques in the graph. Notice that cliques in a
graph may overlap. The same node or set of nodes might belong to more than one clique.
Cliques = {1, 2, 3}, {1, 3, 5} and {3, 4, 5, 6}
 For example, in figure node 3 belongs in all three cliques. Also, there may be nodes that do
not belong to any cliques (for example node 7). However, no clique can be entirely
contained within another clique, because if it were the smaller clique would not be
maximal.
TECHNICAL PUBLICATIONS® - An up thrust for knowledge
Social Network Analysis (3 - 5) Extraction & Mining Communities in Web Social Networks

Fig. 3.3.1 : Graph and cliques

 3.3.2 Global Definitions

 A global definition of community is related to a sub-network with respect to the network as
a whole. It starts from a null model.
 Network which matches the original network in some of its topological features, but which
does not display community structure. Then, the linking properties of sub-networks of the
initial network are compared with those of the corresponding sub-networks in the null
model. If there is a wide difference between them, the sub-networks are regarded as
communities.
 Null model is designed by using randomness in the distribution of edges among vertices.
The most popular null model is that proposed by Newman and Girvan.
 Null model consists of a randomized version of the original network, where edges are
rewired at random, under the constraint that each vertex keeps its degree. This null model is
the basic concept behind the definition of modularity, a function which evaluates the
goodness of partitions of a network into communities.

 3.3.3 Definitions Based on Vertex Similarity

 Last type of definition is based on an assumption that communities are groups of vertices
which are similar to each other. To evaluate the similarity between each pair of vertices,
some calculation is used.
 Similarity measures are based on hierarchical clustering. Hierarchical clustering is a way to
find several layers of communities that are composed of vertices similar to each other.
 Repetitive merges of similar vertices based on some quantitative similarity measures will
generate a structure shown in Fig. 3.3.2. This structure is called dendrogram.
 Decompose data objects into a several levels of nested partitioning (tree of clusters), called
a dendrogram. A clustering of the data objects is obtained by cutting the dendrogram at
the desired level, then each connected component forms a cluster.
TECHNICAL PUBLICATIONS® - An up thrust for knowledge
Social Network Analysis (3 - 6) Extraction & Mining Communities in Web Social Networks

 Highly similar vertices are connected in the lower part of the dendrogram. Subtrees
obtained by cutting the dendrogram with horizontal line correspond to communities.
Communities of different granularity will be obtained by changing the position of the
horizontal line.

Fig. 3.3.2 : Dendrogram

 The horizontal axis of the dendrogram represents the distance or dissimilarity between
clusters. The vertical axis represents the objects and clusters.
 Each joining of two clusters is represented on the graph by the splitting of a horizontal line
into two horizontal lines. The horizontal position of the split, shown by the short vertical
bar, gives the distance (dissimilarity) between the two clusters.
 A cross-section of the tree at any level, as indicated by the dotted line, will give the
communities at that level

 3.4 Evaluating Communities

 Various methods are used for partitioning given network into communities. It is necessary
to establish which partition exhibit a real community structure.
 Quality function supports for finding good partitions. The most popular quality function is
the modularity.
 Newman and Girvan were among the first to address this issue and proposed modularity to
quantify the strength of community structure.
 This metric, based on the intuition that nodes within the same community should be more
tightly connected than they would be by chance, has been adopted for a variety of uses
including the validation and comparison of community structures, but also as an objective
function for optimization algorithms to identify communities.

TECHNICAL PUBLICATIONS® - An up thrust for knowledge

Social Network Analysis (3 - 7) Extraction & Mining Communities in Web Social Networks

 Fig. 3.4.1 shows a small network with community structure. In this case there are three
communities, denoted by the dashed circles, which have dense internal links but between
which there are only a lower density of external links.

Fig. 3.4.1 : Small network with community structure

 A graph can be split into communities in numerous ways, i.e. for each graph there are many
possible community structures. In the simple case, a community structure is defined as a
graph partition into a set of node sets C = {Ci}.
 To provide a measure of the quality of a community structure, we make use of modularity.
 Modularity quantifies the extent to which a given graph partition into communities presents
a systematic tendency to have more intra-community links than the same community
structure would present if the links would be rewired under ER (Erdos-Renyi) graph model.
 Modularity (Q) is defined in several ways.
k
Q =  ( eii – a i )
2

i=1
Where eii = Probability edge is in module i
2
ai = Probability a random edge would fall into module i
 Another View of Modularity

TECHNICAL PUBLICATIONS® - An up thrust for knowledge

Social Network Analysis (3 - 8) Extraction & Mining Communities in Web Social Networks

 Modularity measures the strength of a community partition by taking into account the
degree distribution. A larger value indicates a good community structure
 One advantage of modularity is that it can be computed using only connectivity of the
network, in the absence of any node labels or other information. However, this property can
also be considered a weakness because modularity is unable to incorporate metadata (e.g.
node labels) even if it is available.
 Modularity measures internal and not external connectivity, but it does so with reference to
a randomized null model.
 The modularity can be either positive or negative. Positive values indicate the possible
presence of community structure

 3.5 Methods for Community Detection and Mining

 The classical methods for dividing given networks into sub-networks are graph partitioning,
hierarchical clustering, and k-means clustering.
 All these methods depend upon the numbers of clusters or their size in advance. It is
necessary to find suitable methods that have abilities of extracting complete information
about the community structure of networks.
 The methods for detecting communities are roughly classified into the following categories:
1. Divisive algorithms
2. Modularity optimization
3. Spectral algorithms

 3.5.1 Divisive Algorithm

 Simple method to identify communities in a network is to find the edges that can connect
vertices of different communities and remove them, so that the communities get
disconnected from each other.
 Newman-Girvan algorithm was has two best features :
1. They involve iterative removal of edges from the network to split it into communities,
the edges removed being identified using “betweenness” measure which represents
number of shortest paths between pair of nodes that pass through the links
2. These measures are recalculated after each removal.
 Newman-Girvan algorithms are highly effective at discovering community structure in both
computer-generated and real-world network data, and they can be also used for complex
structure of networked systems. Fig. 3.5.1 shows detecting communities based on edge
betweenness.
 It uses the idea that “bridges” between communities must have high edge betweenness. The
edge with higher betweenness tends to be the bridge between two communities.
 The edge betweenness of an edge is the number of shortest paths between pairs of vertices
run along it. Iteratively removing the edges with highest betweenness, we can determine a
hierarchical tree and then communities.
TECHNICAL PUBLICATIONS® - An up thrust for knowledge

Matrices - PREVIOUS PAPER WITH SOLUTIONS PDF
82% (17)
Matrices - PREVIOUS PAPER WITH SOLUTIONS PDF
10 pages
Mathematics: Key Notes Terms Definitions Formulae
91% (11)
Mathematics: Key Notes Terms Definitions Formulae
463 pages
QT Study Pack 2006 Wip
100% (1)
QT Study Pack 2006 Wip
446 pages
Continuity in Metric Spaces
100% (1)
Continuity in Metric Spaces
6 pages
BCOM Maths Practice Questions
No ratings yet
BCOM Maths Practice Questions
4 pages
SNS Unit-Iii Notes
No ratings yet
SNS Unit-Iii Notes
33 pages
Chapter 2
No ratings yet
Chapter 2
26 pages
Maths Mock Test
No ratings yet
Maths Mock Test
2 pages
02 SNA Network Measures Basic 2
No ratings yet
02 SNA Network Measures Basic 2
19 pages
Learning Module in General Mathematics: Quarter 1 - Week 3
100% (1)
Learning Module in General Mathematics: Quarter 1 - Week 3
4 pages
CES521 - 3 - Stifeness Method (Truss)
No ratings yet
CES521 - 3 - Stifeness Method (Truss)
49 pages
Andriychuk M. Matrix Theory. Classics and Advances 2023
No ratings yet
Andriychuk M. Matrix Theory. Classics and Advances 2023
249 pages
Social Network Security 2 Marks
No ratings yet
Social Network Security 2 Marks
3 pages
1 Web Community 1
No ratings yet
1 Web Community 1
3 pages
Social Network Analysis
No ratings yet
Social Network Analysis
82 pages
Chapter 10 11
No ratings yet
Chapter 10 11
200 pages
Module3 Communitynetworks
No ratings yet
Module3 Communitynetworks
102 pages
Statistical Properties of Community Structure in Large Social &amp Information Networks
100% (2)
Statistical Properties of Community Structure in Large Social &amp Information Networks
10 pages
Polynomials Assignment 4
No ratings yet
Polynomials Assignment 4
6 pages
UNIT7-Community Detection
No ratings yet
UNIT7-Community Detection
91 pages
Signal Flow Graph
No ratings yet
Signal Flow Graph
34 pages
FALLSEM2018-19 - CSE3021 - ETH - SJT824 - VL2018191006149 - Reference Material I - Module3 - CommunityNetworks1
No ratings yet
FALLSEM2018-19 - CSE3021 - ETH - SJT824 - VL2018191006149 - Reference Material I - Module3 - CommunityNetworks1
98 pages
Social Network Analysis Unit-3
No ratings yet
Social Network Analysis Unit-3
28 pages
AI HL Functions
No ratings yet
AI HL Functions
93 pages
A Comprehensive Survey On Community Detection Methods and Applications in Complex Information Networks
No ratings yet
A Comprehensive Survey On Community Detection Methods and Applications in Complex Information Networks
47 pages
7 CommunityStructure Lastupdate2324
No ratings yet
7 CommunityStructure Lastupdate2324
80 pages
Ruteo Vehicular
No ratings yet
Ruteo Vehicular
65 pages
Module-1 Lecture-2
No ratings yet
Module-1 Lecture-2
60 pages
SIN Research Paper-1-33
No ratings yet
SIN Research Paper-1-33
33 pages
04 Communities
No ratings yet
04 Communities
78 pages
Community Detection in Social Media: Symeon Papadopoulos
No ratings yet
Community Detection in Social Media: Symeon Papadopoulos
75 pages
3.1 Extracting Evolution of Web Community From A Series of Web Archive
No ratings yet
3.1 Extracting Evolution of Web Community From A Series of Web Archive
18 pages
Sma 2321 Numerical Analysis 1
No ratings yet
Sma 2321 Numerical Analysis 1
3 pages
Community Detection
No ratings yet
Community Detection
72 pages
Module VI - Mining Social Network Graph
No ratings yet
Module VI - Mining Social Network Graph
88 pages
Social Network Analysis
No ratings yet
Social Network Analysis
22 pages
Networks BigData 1
No ratings yet
Networks BigData 1
43 pages
Community Structure
No ratings yet
Community Structure
30 pages
Module 3
No ratings yet
Module 3
36 pages
Cs8451 Design and Analysis of Algorithms: Unit-I Part-A 1. Define Algorithm
No ratings yet
Cs8451 Design and Analysis of Algorithms: Unit-I Part-A 1. Define Algorithm
21 pages
Sna It Unit3
No ratings yet
Sna It Unit3
19 pages
E-Communities - Part1
No ratings yet
E-Communities - Part1
80 pages
Week 2 - Social Network Analysis
No ratings yet
Week 2 - Social Network Analysis
30 pages
SNA-Community Detection
No ratings yet
SNA-Community Detection
38 pages
An Analysis On Measuring Graph Patterns in Social Networks
No ratings yet
An Analysis On Measuring Graph Patterns in Social Networks
6 pages
Community Detection in Social Network Ver4
No ratings yet
Community Detection in Social Network Ver4
23 pages
Mahoney Mmds08
No ratings yet
Mahoney Mmds08
30 pages
Unit 3
No ratings yet
Unit 3
18 pages
Communities 2
No ratings yet
Communities 2
24 pages
Homework#3
No ratings yet
Homework#3
21 pages
SNS Unit Iii
No ratings yet
SNS Unit Iii
21 pages
Module Iii
No ratings yet
Module Iii
18 pages
SocialNetworkAnalysis FullNote
No ratings yet
SocialNetworkAnalysis FullNote
10 pages
Data Science 5th Assignment
No ratings yet
Data Science 5th Assignment
13 pages
Unit 3
No ratings yet
Unit 3
11 pages
An Introduction To Banach Space Theory Softcover Reprint of The Original 1st Ed 1998 Robert E Megginson PDF Download
No ratings yet
An Introduction To Banach Space Theory Softcover Reprint of The Original 1st Ed 1998 Robert E Megginson PDF Download
84 pages
Network Centrality Measures in A Graph
No ratings yet
Network Centrality Measures in A Graph
16 pages
Unit 6 Mining Social Network Graph
No ratings yet
Unit 6 Mining Social Network Graph
9 pages
Community Detection
No ratings yet
Community Detection
41 pages
Exclusive Sum Labeling On Gear Graphs
No ratings yet
Exclusive Sum Labeling On Gear Graphs
5 pages
WEEK 3 - GRAPH OF A FUNCTION - Answer Sheet
No ratings yet
WEEK 3 - GRAPH OF A FUNCTION - Answer Sheet
6 pages
Community Detection and Evaluation
No ratings yet
Community Detection and Evaluation
46 pages
IMOmath - Applications of Calculus
No ratings yet
IMOmath - Applications of Calculus
4 pages
Overlapping Community Detection at Scale: A Nonnegative Matrix Factorization Approach
No ratings yet
Overlapping Community Detection at Scale: A Nonnegative Matrix Factorization Approach
10 pages
HCMUT MATHS4CS 055263 Assignment Community Structure Identification IMP
No ratings yet
HCMUT MATHS4CS 055263 Assignment Community Structure Identification IMP
10 pages
Sna Unit III
No ratings yet
Sna Unit III
10 pages
SN Notes
No ratings yet
SN Notes
28 pages
Communities and Bottlenecks Trees and Treelike Networks Have High Modularity
No ratings yet
Communities and Bottlenecks Trees and Treelike Networks Have High Modularity
9 pages
Community Detection in Social Networks An Overview
No ratings yet
Community Detection in Social Networks An Overview
6 pages
Graph Communities in Neo4j: Four Algorithms at Work
No ratings yet
Graph Communities in Neo4j: Four Algorithms at Work
11 pages
The Gamma Function
No ratings yet
The Gamma Function
7 pages
Section 5
No ratings yet
Section 5
21 pages
Comparative Analysis of Community Detection Algorithms
No ratings yet
Comparative Analysis of Community Detection Algorithms
5 pages
Standard Greedy Algorithms PDF
No ratings yet
Standard Greedy Algorithms PDF
3 pages
BDA Unit - 05
No ratings yet
BDA Unit - 05
7 pages
Review On Community Detection Algorithms in Social Network
No ratings yet
Review On Community Detection Algorithms in Social Network
5 pages
Community-Affiliation Graph Model For Overlapping Network Community Detection
No ratings yet
Community-Affiliation Graph Model For Overlapping Network Community Detection
6 pages
Soc - Net - Week 3
No ratings yet
Soc - Net - Week 3
3 pages
Lesson 5 Properties of Exponential Graphs
No ratings yet
Lesson 5 Properties of Exponential Graphs
4 pages
Definition: A Function F From A Set A To Set B Is A Rule of Correspondence That Assigns To Each
No ratings yet
Definition: A Function F From A Set A To Set B Is A Rule of Correspondence That Assigns To Each
4 pages
Community Detection
No ratings yet
Community Detection
5 pages
3 Community Detection Methods and Mining
No ratings yet
3 Community Detection Methods and Mining
3 pages
ANSWER KEY OF FIRST Quarterly Exam IN GENERAL MATHEMATICS-2023-2024-MOJARES & ROCES
No ratings yet
ANSWER KEY OF FIRST Quarterly Exam IN GENERAL MATHEMATICS-2023-2024-MOJARES & ROCES
3 pages
MATA33H3S Calculus For Management II Winter 2022 Syllabus and Lecture Schedule - 2 Pages
No ratings yet
MATA33H3S Calculus For Management II Winter 2022 Syllabus and Lecture Schedule - 2 Pages
2 pages
1 MATH-122 Course Outline
No ratings yet
1 MATH-122 Course Outline
3 pages
Lec4 Duality Exercise
No ratings yet
Lec4 Duality Exercise
1 page
Assisgnment
No ratings yet
Assisgnment
1 page
Troanary Photonic Storage Blueprint - How Light Based Logic can Redefine Computation and Data Storage
From Everand
Troanary Photonic Storage Blueprint - How Light Based Logic can Redefine Computation and Data Storage
Ylia Callan
No ratings yet
Social Media Data Mining and Analytics
From Everand
Social Media Data Mining and Analytics
Gabor Szabo
No ratings yet
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet

3.2 Detecting Communities in Social Networks

Uploaded by

3.2 Detecting Communities in Social Networks

Uploaded by

Social Network Analysis (3 - 3) Extraction & Mining Communities in Web Social Networks

 3.2 Detecting Communities in Social Networks

TECHNICAL PUBLICATIONS® - An up thrust for knowledge

 Why Community Detection ?

 3.3 Definition of Community

 3.3.1 Local Definition

Fig. 3.3.1 : Graph and cliques

 3.3.2 Global Definitions

 3.3.3 Definitions Based on Vertex Similarity

Fig. 3.3.2 : Dendrogram

 3.4 Evaluating Communities

TECHNICAL PUBLICATIONS® - An up thrust for knowledge

Fig. 3.4.1 : Small network with community structure

TECHNICAL PUBLICATIONS® - An up thrust for knowledge

 3.5 Methods for Community Detection and Mining

 3.5.1 Divisive Algorithm

You might also like