0% found this document useful (0 votes)

12 views6 pages

Graph Mining: A Survey of Graph Mining Techniques: August 2012

Uploaded by

2jieguojie2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views6 pages

Graph Mining: A Survey of Graph Mining Techniques: August 2012

Uploaded by

2jieguojie2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/233801707

Graph mining: A survey of graph mining techniques

Conference Paper · August 2012

DOI: 10.1109/ICDIM.2012.6360146

CITATIONS READS
70 16,295

3 authors:

Saif ur Rehman Khan Asmat Ullah

Pir Mehr Ali Shah Arid Agriculture University 1 PUBLICATION 70 CITATIONS
52 PUBLICATIONS 607 CITATIONS
SEE PROFILE
SEE PROFILE

Simon Fong
University of Macau
721 PUBLICATIONS 11,815 CITATIONS

SEE PROFILE

All content following this page was uploaded by Simon Fong on 20 May 2014.

The user has requested enhancement of the downloaded file.

Graph Mining: A Survey of Graph Mining
Techniques

Saif Ur Rehman Asmat Ullah Khan Simon Fong

CORD: Center for Research in Data Department of Computer Science Department of Computer and
Engineering Shaheed Zulifiqar Ali Bhutto Institute of Information Science
Mohammad Ali Jinnah University Science and Technology (SZABIST) University of Macau
Islamabad, Pakistan Islamabad, Pakistan Taipa, Macau SAR
[email protected] [email protected] [email protected]

Abstract-Data mining is comprised of many data analysis vertices of a given input graph into clusters [22] graph
techniques. Its basic objective is to discover the hidden and useful clustering is based on unsupervised learning technique in
data pattern from very large set of data. Graph mining, which which the classes are not known in prior to clustering. The
has gained much attention in the last few decades, is one of the
graph clusters are formed based on some similarities in the
novel approaches for mining the dataset represented by graph
underlying graph structured data graph. (2) Graph
structure. Graph mining finds its applications in various problem
domains, including: bioinformatics, chemical reactions, Program
Classification; in graph classification the main task is to
flow structures, computer networks, social networks etc. classify separate, individual graphs in a graph database into
Different data mining approaches are used for mining the graph two or more categories/classes [22]. Classification is based on
based data and performing useful analysis on these mined data. supervised/semi supervised learning technique in which the
In literature various graph mining approaches have been classes of the data are defined in prior. (3) Sub graph mining;
proposed. Each of these approaches is based on either sub graph is a graph whose vertices and edges are subsets of
classification; clustering or decision trees data mining techniques. another graph. The frequent sub graph mining problem is to
In this study, we present a comprehensive review of various
produce the set of sub graphs occurring in at least some given
graph mining techniq ues. These different graph mining
threshold of the given n input example graphs [23].
techniques have been critically evalnated in this study. This
evalnation is based on different parameters. In our future work,
In this study we have provided comprehensive summary
we will provide our own classification based graph mining details of the different graph mining techniques. Each of these
technique which will efficiently and accurately perform mining techniques has been outlined with their techniques details, their
on the graph structured data. major research contributions along with the limitation of the
proposed techniques. These techniques have been further
Index Terms-Graph Mining, Sub graphs, frequent graphs, critically evaluated.
Data Mining
The rest of this paper is organized as follow: In section II
I. INTRODUCTION the underlying terminologies used in graph theory is provided.
In section III a detailed literature review is provided on the
graph mining techniques proposed in the last few decades.
Over the last few years there has been a number of research Section IV focuses on the critical analysis of these different
work on data mining in seeking for better performance and graph mining techniques, whose details are discussed in section
innovation. One innovation includes mining from structured II. This study will end with the conclusion of our work with
data, which is a new challenge. Since a structure is represented some future directions in section V
by proper relations and a graph can easily represent such
relations, knowledge discovery from graph-structured data II. BASIC GRAPH THEORY
poses a general problem for mining from structured data. Some A graph G is a pair of sets G = (V, E). V is the set of vertices
examples amenable to graph mining are finding typical web and the number of vertices n = IVI is the order of the graph.
browsing patterns, identifying typical substructures of chemical The set E contains the edges of the graph. In an undirected
compounds, finding typical subsequences of DNA and graph, each edge is an unordered pair {v, w}. In a directed
discovering diagnostic rules from patient history records [21]. graph (also called a digraph in much literature), edges are
Graph mining techniques have been categorized into ordered pairs. The vertices v and w are called the endpoints of
following groups. (1) Graph clustering; is the task of grouping the edge. The edge count lEI = m is the size of the graph. In a
the vertices of the graph into clusters taking into consideration
weighted graph, a weight functions (j) : E -? R is defined that
the edge structure of the graph in such a way that there should
assigns a weight on each edge. A graph is planar if it can be
be many edges within each cluster and relatively few between
drawn in a plane without any of the edges crossing [22]. The
the clusters? Graph clustering in the sense of grouping the

density of a graph G = (V, E) is defined as the ratio of the In [1], Callut et al. have proposed a new technique called
number of edges present in a graph to the maximum possible, D-Walks. This technique can efficiently handle the semi
supervised classification issues associated with the graphs of
8(G)=!!!.. For ne{O,l} , we set8(G) = O.
n large size. Their technique is based on the betweeness
2
A graph of density one is called complete [22].
The adjacency matrix AG of a given graph G = (V, E) of order
measures. The detail of betweeness can be found in [1]. The D
walks can classify the unlabeled nodes of different types of
graphs including directed or undirected graphs. This
n is an n x n matrix AG = (a.C: ) where classification has a linear time complexity with respect to the
V,u
(1) number of edges in the graphs, (2) the maximum walk
AG = ( ay
G
)=
{I, if(v,ujEE,
the diagonaI matnx 0f graph length considered and (3) the number of classes [1]. The
0 , otherwise
.

,u
unlabeled nodes of the graphs are predicted by comparing its
G(V,E) is betweeness measure with that of maximum betweeness
measure. The technique proposed in [1] has been implemented

D=
rdeg(V2)deg(v2)
o
0 0

0
o
o
on the CORA database. Then different experiments have been
performed using this database. All of these experiments
showed that [1] is more efficient and can accurately classify the
o
o
0
0
deg(v2) deg(v2) j,J
0
o
unlabeled nodes of the graphs and outperforms the existing
techniques available in the literature such as [2] and [3]. Their
The length of a path is the number of edges on it, and the main achievement is to handle the graphs having large number
distance between v and u is the length of the shortest path of nodes and edges as compare to [2] and [3] techniques
connecting them in G. The distance from a vertex to itself is In [4], Kashima et al. have proposed a new method that can
zero: the path from a vertex to itself is an empty edge sequence. handle the classification problem of graphs that have extremely
A graph is connected if there exist paths between all pairs of large no of nodes and edges. Their graph classification method
vertices. If there are vertices that cannot be reached from others, is based on kernel method. The details of kernel methods can
the graph is disconnected. The minimum number of edges that be found in [5]. The method proposed in [4] efficiently
would need to be removed from G in order to make it computes the inner product of two graphs to make a feature
disconnected is the edge connectivity of the graph. A cycle is a space for classifying the graphs. This technique takes an
simple path that begins and ends at the same vertex. A graph unknown graph as input and classifies the unknown graph into
that contains no cycle is acyclic and is also called a forest. A an appropriate class. Their proposed method calculates the
connected forest is called a tree [21]. similarity of two graphs based on nodes of the graphs and
S
A sub graph G =(S, Es) of G=(V, E) is composed of a set labels of the edges in the graphs. In [4] graphs are classified
of vertices s � y and a set of edges Es� E such that {v, u} � into same group if their similarities are identical. The technique
Es implies u, v E S; the graph G is a super graph of d. A proposed in [4] has been implemented for the prediction of
connected acyclic sub graph that includes all vertices is called a properties of chemical compound using the mutag and PTC
spanning tree of the graph. A spanning tree has necessarily dataset. Then different experiments have been performed using
exactly n-l edges. If the edges are assigned weights, the these datasets. All of these experiments showed that [4] is not
spanning tree with smallest total weight is called the minimum as efficient as [6] for mutag dataset but for PTC dataset it is
spanning tree. more efficient then existing techniques available in the
Note that there may exist several minimum spanning trees literature such as [6] and [7].
that may even be edge disjoint [22]. In [8], Dhillon et al. have presented an efficient and fast
Two graphs Gi = (Vi> Ei) and Gj = (Y;, E) are isomorphic if technique for graph clustering. This technique can handles
there exists a bi-jective (one to one) mappingfVi�Y; (called graph having large number of nodes and very large number of
an isomorphism) such that {u, v}E Ei; if and only if (j(v), itw) } edges. Their graph clustering technique is based on multilevel
E Ei• A bipartite graph i s a graph where the vertex set V can be methods using weighted kernel K-means objective function as
split in two sets A and B such that all edges lie between those refinement algorithms .The details of weighted Kernel k-means
two sets: if {u, v}E Ei, either vE A and wEBor vEBand W objective function for multilevel methods can be found in [9].
The technique proposed in [8] does not restricts the size of the
E A [23]. A complete graph is a graph where every pair of
cluster be nearly equal as compared to existing graph clustering
distinct vertices is adjacent. A complete graph on n vertices is
techniques available in literature. Furthermore, the graph
denoted by Kn (or sometimes by K(n) ) and The complete
clustering objective functions proposed in [8] can be
graph Kn of order n is a simple graph with n vertices in which
specialized for all phases of the algorithm according to
every vertex is adjacent to every other is called clique.
situation. The technique proposed in [8] has been implemented
III. LITERATURE REVIEW on the IMDB Movie dataset. The dataset has 1.2 million nodes
and 7.6 million edges. Furthermore different experiments have
This section summarizes the different proposed graph
been performed using this dataset. The proposed techniques
mining algorithms with their major research contributions and
compute 5000 cluster and 5000 eigenvectors [8] which is
limitations.
impractical for the algorithm in [9] due to requirements of main

89
memory up to 25 GB. All of these experiments showed that [8] In [14], Le et al. have proposed a new method for
is more efficient not only in memory consumption but also in clustering of bi-partite graph. This technique is called Coring
running time compared to the existing techniques available technique. The proposed technique can handle the issues of
such as [9]. Their main achievement is to handle the graphs partitioning a large graph into small sub graphs. The nodes of
having large number of nodes and edges which is impractical the clustered sub graphs are strongly interconnected within
to be handled in existing graph clustering techniques [9]. graph and weakly connected to the nodes of other graphs. Their
In [10], Dias and Ochi have presented enhancement in the method is called coring method that can handle both weighted
basic Genetic Algorithms (GAs). Their proposed technique can and unweighted graphs. The technique in [14] can computes
efficiently handle the issues of graph partitioning in large graph clusters that have a highly dense core region and encircled by
databases. The [10] proposed different procedure as lower density region. The proposed method in [14] works in
evolutionary steps for the improvement in the performance of following steps Step 1: In this step, the coring method
the basic GA. The proposed modifications in [10] to the basic computes the density variation sequence .The method
GA algorithms do not alter the global acting of the basic iteratively computes the minimum density D and set of nodes
technique for GA. Therefore these modifications are having minimum density M. The output of this step is sequence
implemented as fittings to the Basic GA. The proposed of D,s and M,s. Setp2: Following step 1, the coring method
procedures in [10] modify the local search and other identify the core nodes . To identify the method calculates the
diversification procedures [10]. The proposed procedures in rate of decrease/increase in value of minimum density. If the
[10] are implemented in 7 different versions. The performance rate of increase/decrease in the D value is greater than the
of the proposed algorithms was evaluated for different no of threshold and the sequence of M is also in some order then the
nodes in graph. The results established that the proposed nodes are identified as core nodes. Step 3: In this step, the
algorithms produces high quality clusters while maintaining the coring method partition the graph nodes into clusters. The set
same running time as compared to existing GA in the literature. of core nodes is the output to the next step. Step 4: it is the final
The main contribution of the proposed procedures has good step of this technique the core groups are expended into full
performance when the no of nodes are high as 500 nodes. clusters. The core nodes are the center of the clusters and the
In [11], Zhao et al have proposed a new technique for lower density nodes are encircles these core nodes. The
mining closed free tree in large graphs. Their technique is technique proposed in [14] has been implemented on the
called CFFTree (Closed Frequent Free Tree). This technique microarray dataset containing 62 samples including 40 tumor
efficiently mine frequent closed free tree in large graph and 22 normal colon tissues. Each sample consists of 2000
database whose nodes are labeled. The technique proposed in gene expressions database. The [14] successfully cluster the
[11] can handle the issues of mining frequent free trees in large tumor tissues and normal tissues in the database further the
graph database which is NP complete the details of NP method was evaluated using image of size 200x300 and the [14]
problem is found in [12]. A tree t with no designated root is efficiently cluster the core region from the image. The
called a free tree and a free tree t is closed if no super tree of t proposed method was also evaluated for introducing noise into
that has the same frequency of t [11] exists. The authors the image. The method successfully clusters the core region.
suggested that closed free trees are very few in graph but can The main strength of the proposed work is that this method can
maintain the same useful information as free trees. Furthermore, efficiently be used for noisy data.
they established that the computational time of closed frequent In [15], Chen et at. have proposed a graph model that can
free trees mining algorithm is polynomial and closed free trees efficiently handle the many to many correspondences problem
are more efficient. The [11] proposed efficient pruning among concepts in ontologies. Their proposed technique used
methods such as safe labeling pruning , safe positioning weighted bi-partite graph to model ontologies. The similarity
pruning, auto-morphisim-based pruning and canonical measure is computed for the all the edges using similarity
mapping-based pruning the details of these methods can be measure techniques such as in [16]. The proposed technique,
found in [11] to prune free trees that cannot generate closed assigns the similarity degree as weights of the edges in the
free tree in order to tune the mining process of closed free trees. graph. In the proposed technique, edges of the bi-partite graph
The technique proposed in [11] has been implemented on the having weight greater than the threshold are maintained other
AIDS antiviral screen chemical compound from Development edges are purged. The [15] uses graph partitioning technique
Therapeutics program in NCIINIH. Different experiments have [15] to co-cluster the vertex of the graph as concept cluster for
been performed by using this database. All of these two ontologies. The concept cluster produced by [15] in
experiments proved that [11] is more efficient and can previous step contains all common concepts from ontologies.
accurately computes free trees compared to [13]. While the In next step the concept cluster is used to set up mappings
proposed technique is the only technique developed for mining among ontologies. The contribution of the proposed techniques
closed frequent free tree in time the paper was written. The is that many-to-many mapping can be establish among
main contribution of their proposed technique is working on ontologies.
the novel concept of closed frequent free trees mining and In [16], Barber has proposed a new graph clustering
designing an algorithm for mining closed trees from graph mechanism for representing graph in the form of matrix. Their
databases. proposed technique extends the incidence matrix (showing
joining vertex of graph as matrix) to clique matrix. The clique

90
matrix shows that which nodes of the graph can fonn a clique. In [19] T.Ozaki et al, have proposed a new method for sub
The clique matrix can be efficiently used for graph clustering. graph mining in graph-structured database. Their method is
The proposed technique executes in the following steps: (1) in called HSG. The algorithm proposed in [19] is based frequent
first step, it calculates the maximal clique. (2) In this step, the hyper clique patterns; which tries to find the dependencies
clustering is performed by [16] as it identified the matrix with among graph in the large. The method proposed in [19]
smallest no of columns. The size of the clique is controlled by efficiently mine correlation in structured database. The authors
using threshold parameter that controls how large the clique proposed efficient pruning methods based on h-confidence
should be. Their technique is successfully applied to find the measures and depth-first and breadth-depth search methods,
large well-connected group in social network and cluster gene the details of these methods can be found in [19]. The
expression that exists in large population. The main technique proposed in [19] has been implemented on the PTE
contribution of the proposed work is the clique matrix notation and DTP_CM datasets. The [19] efficiently mine frequent
for graphs. hyperclique patterns in these datasets in reasonable time. The
In [17], Kraus et at. have proposed a new algorithm for main contribution of the proposed work is that [19] introduces
handling the graph clustering. Their algorithm is called semi a new concept of hyperclique to mine correlation in graph
supervised divisive hierarchical Graph clustering algorithm. databases and proposed an algorithm to mine frequent hyper
Their proposed technique can effectively handle the problem of clique patterns in large graph databases.
clustering with having no knowledge of the structure of In [20] Fatta et ai, proposed a new method for sub graph
underlying dataset. The authors proposed a hierarchical mining in large graph database. The method is called
algorithm that incorporates background knowledge into the distributed algorithm. Their algorithm is based on distributed
graph. The technique in [17] is used with weighted undirected peer to peer communication framework. The [20] can handle to
graph. The Euclidian distance between two adjacent nodes is very high workload in distributed manner. The distributed
calculated. To calculate the Euclidian distance fonnula is given algorithm proposed in [20] efficiently mines sub graph in
below: molecular compounds, the molecular compounds have very
large trees and very large no of sub graph. The [20] first
n
partitioned the search space dynamically to partition a large
d(p, q) == d(q, p) == �(ql-ql)2+(q2-P2i+....+{q,,-p.,i == L (q;-p;i tree. In the second step the [20] distributes the portioned tree in
i==l peer-to-peer communication framework and in the last step the
distributed algorithm uses load balancing and receiver initiated
and the ratio is computed by dividing the distance with average methods [20] to execute the sub graph mining process in
Euclidian distance of all the nodes in a graph. The must link distributed environment. To further test the effectiveness of the
indicates that two data item must be placed in same groups, and proposed method. The proposed technique has been
can-not links - two data item cannot be placed in same group, implemented on the DPT dataset. Then different experiments
are identified. Links with less weight are removed to control have been perfonned using this dataset. All of these
the chaining effect of the nodes on the clusters. To propagate experiments showed that [1] is more efficient and can
background knowledge in adjacent nodes the probability of the accurately mind sub graph in highly distributed and
visiting nodes with some threshold steps are calculated for two heterogeneous environment. The method proposed in [20] also
nodes and neighborhood similarity is measured for the nodes. has been tested for fault tolerant and the results showed that the
The proposed algorithm increases the weight of the edge if two proposed method has handled the situation very efficiently.
nodes are similar else the weight of the edge is decrease. The main contribution of the proposed work is that it works in
Afterward, nodes with small neighborhood are removed for highly distributed and heterogeneous environment.
creating clusters. Nodes having similar neighborhood values
are cluster in same group. The main contribution of the IV. CRITICAL EVALUATION
proposed work is the including of background knowledge in In this section we comment about the techniques, critically.
the clustering process. The critical evaluation is based on the observation of the
In [18], Schenker et at. have proposed a graph model for following metrics: parameters, technique, method,
classification of web documents. The proposed method is implementation, features, comparison and efficiency. The
based on k-NN [18] that successfully classifies unknown details are shown in Table 1. According to the comparison in
documents to its respective classes automatically. The Table 1, the work in [8] seems to be more efficient in
experiments on [18] is conducted which reveals that the graph computation time and memory usage during the clustering
based model for document classification computation time is process than [1] and [4] for classification. The model in [10] is
parallel to other vector based k-NN model. The experiments capable of handling larger nodes than [16] and [18] for
showed that for small nodes up to 30 the classification time of clustering in a large graph in efficiency and features. The
the proposed technique is as efficient as vector based k-NN results generated from the model [14] is however more
techniques but the technique in [18] out perfonned vector accurate than those in [1] and [8] for feature support. Thus the
based k-NN methods for large no of nodes both in perfonnance above discussion reveals that [14] may be more accurate for
and accuracy. noisy data and [8] may be more efficient for larger graphs.

91
TABLE I. COMPARISON OF RECENT WORKS ON GRAPH MINING

Paper Technique/Method Implementation Features Efficiency Comparison

Callut el al.[I] D-Walks CORA Capable of handling large graphs I .4 seconds per graph yes
Kashima el Multi-level Kernel k-means Mutag, PTC Reduced chaining effect; Computes yes
al[4] similarity both on label and edges
Dhillon el al [8] Multi-level Kernel k-means IMDB Movie Memory efficient 25 minutes for 1.2 yes
Efficient in running time million nodes and 7.6
million edges
Dias and Genetic Algorithm C++ Tracked the performance of GA for 98 % for 500 nodes yes
Ochi[IO] different type of graph
Zhao el al.[II] CFFfree C++,VS More efficient for graph with large no 10 to 1.5 free tree and yes
of nodes closed
Leel al.[14] Coring Method MicroArray Efficiently clustered core region in yes
dataset, image noisy data
Chen el al.[15] Bi-partite graph co-clustering yes
Barber[16] Clique matrix D1MACS Clique matrix notation for graphs. no
Clustering based on clique matrix
notations
Kraus el al.[17] SSHGCA MicroArray Including of background knowledge in yes
dataset clustering process
Schenker el K-NN Yahoo News More efficient and accurate for large Yes
al.[18] C++ size graph
T. Ozaki el HSG PTE, DW_CM Mine correlation in graphs No
al[19] Java
Fatta et al. [20] Distributed Algorithm PTE, DW_CM Efticient; Distributed; Heterogeneous No
Java

[8) Dhillon, Y. Guan and B. Kulis, "A Fast Kernel-based Multilevel

V. CONCLUSION AND fuTURE WORK Algorithm for Graph Clustering", Proceedings of The 11th ACM
SIGKDD, Chicago, IL, Aug. 21 - 24, 2005
In this study, we have presented the summary information [9) G. Karypis and V. Kumar, "A fast and high quality multilevel scheme
of the different graph mining techniques. These graph mining for partitioning irregular graphs". SIAM J. Sci. Comput.,20(\):359-392,
techniques are based on the classification, clustering, decision 1999
[10) C. R.Dias, and L. S.Ochi, "Efficient Evolutionary Algorithms for the
tree approaches, which are the data mining fundamentals. In
Clustering Problem in Directed Graphs", Proceedings of the 2003 IEEE
addition, we also have highlighted the research contributions Congress on Evolutionary Computation, v.l, pp. 983-988, 2003
and found out some limitations in different research works. [II] P. Zhao and 1. X. Yu, "Mining Closed Frequent Free Trees in Graph
Consequently, this work also depicts the critical evaluation in Databases", Proceeding of Database Systems for Advance Application
2007, pp. 91-102, 2007
which comparison and contrast have been taken out to show
[12) Yun Chi, Yirong Yang, and Richard R. Muntz. , "Indexing and mining
the similarities and differences among different author's works. free trees", In Proceedings of ICDM03, 2003.
The spatiality of this work is that it reveals the literature review [13] Wache, H., Vogele, T., Visser, U., Stuckenschmidt, "Ontology-Based
of different graph mining techniques and provides a vast Integration of Information - A Survey of Existing Approaches", In
Proceedings of IJCAI-OI Workshop on Ontologies and Information
amount of information under a single paper. In our future work,
Sharing, 108-117, 200I.
we have planned to propose a new classification method based [14] T. V. Le, C. A. Kulikowaski and I. B. Muchnik, "Coring Method for
on graph mining technique, provide its implementation and Clustering a Graph", In proceedings of IEEE 2008, 2008
compare its results with the different existing classification [15] Y. Chen and F. Fonseca , "A Bipartite Graph Co-Clustering Approach to
Ontology Mapping",2004
based graph mining algorithms.
[16) D. Barber. Clique Matrices for Statistical Graph Decomposition and
Parame- nite Matrices. In D. A. McAllester and P. Myllymaki, editors,
REFERENCES
AUAI Press, pp 26-33, 2008.
[I] J. Callut, K. Fran90isse, M. Saerens and P. Dupont, "Semi-supervised [17) J. M. Kraus, G. Palm and H. A. Kestler, "On the robustness of semi
Classification from Discriminative Random Walks", Lecture Notes in supervised hierarchical graph clustering in functional genomics", 2007
Artificial Intelligence No. 5211, Springer, 2008., pp. 162-177, [18) A. Schenker, M. Last, H. Bonke and A. Kandel "Classification of Web
[2] S.Macskassy and F. Provost, "Classification in networked data: A toolkit Documents Using a Graph Mode",Proceedings of the Seventh
and a univariate case study", J. Mach. Learn. Res., 8, pp935-983,2007 International Conference on Document Analysis and Recognition, 2003
[3] M.Newman, "A measure of betweenness centrality based on random [19) T.Ozaki and T.Ohkawa , "Mining Correlated Subgraphsin Graph
walks", Social networks 27,pp39-54, 2005 Databases", PAKDD 2008,pp 272-283, 2008
[4] H. Kashima and A. Inokuchi, "Kernels for graph classification", ICDM [20) G.D. Fatat and M.R. Berthold "High Performance Subgraph Mining in
Workshop on Active Mining 2002, 2002. Molecular Compounds",HPCC 2005, pp 866-877 ,2005
[5] M.Swell ,"Kernel Methods",2009 [21] S.E Schaeffer, "Graph Clustering", Computer Science Review 2007, pp
[6] J. Han.,X. Yan and P.S. Yu, "Mining and searching graphs and 27-64, 2007
structures", in the proceedings of 12th ACM Conference on Knowledge [22) H. Motoda, "What Can We Do with Graph-Structured Data A Data
Discovery and Data Mining (SIGKDD'2006),2006 Mining Perspective", Springe 2006, pp 1-2,2006
[7] S. Kramer and L. D. Raedt., "Feature construction with version spaces [23) N. S. Ketkar, L.B.Holder and OJ. Cook," Empirical Comparison of
for biochemical application", In Proc. of the 18th ICML,200I Graph Classification Algorithms",IEEE,2009

View publication stats

Gitlab Ci/Cd: An Overview
No ratings yet
Gitlab Ci/Cd: An Overview
32 pages
Graph Mining Tools
No ratings yet
Graph Mining Tools
3 pages
Pattern Mining Current Challenges and Op
No ratings yet
Pattern Mining Current Challenges and Op
16 pages
Graph Pattern Mining, Search and OLAP
No ratings yet
Graph Pattern Mining, Search and OLAP
14 pages
An Introduction To Graph Data: IBM T. J. Watson Research Center Hawthorne, NY 10532
No ratings yet
An Introduction To Graph Data: IBM T. J. Watson Research Center Hawthorne, NY 10532
11 pages
4 IJAEST Vol No.4 Issue No.2 Classification of Approaches and Challenges of Frequent Subgraphs Mining in Biological Networks 014 017
No ratings yet
4 IJAEST Vol No.4 Issue No.2 Classification of Approaches and Challenges of Frequent Subgraphs Mining in Biological Networks 014 017
4 pages
Data Mining Graphs and Networks
No ratings yet
Data Mining Graphs and Networks
5 pages
Scalable Maximal Subgraph Mining With Backbone-Preserving Graph Convolutions
No ratings yet
Scalable Maximal Subgraph Mining With Backbone-Preserving Graph Convolutions
22 pages
Online Visualization of Bibliography Using Visualization Techniques
No ratings yet
Online Visualization of Bibliography Using Visualization Techniques
43 pages
Data Mining-Graph Mining
No ratings yet
Data Mining-Graph Mining
9 pages
Paper Graph Mining
No ratings yet
Paper Graph Mining
8 pages
A Comparative Study of Frequent Subgraph Mining Algorithms
No ratings yet
A Comparative Study of Frequent Subgraph Mining Algorithms
17 pages
Graph Data Mining: Slides Are Modified From Jiawei Han & Micheline Kamber
No ratings yet
Graph Data Mining: Slides Are Modified From Jiawei Han & Micheline Kamber
37 pages
A Graph Mining Approach For Ranking and Discovering The Interesting Frequent Subgraph Patterns
No ratings yet
A Graph Mining Approach For Ranking and Discovering The Interesting Frequent Subgraph Patterns
17 pages
Modeling Relational Data As Graphs For Mining
No ratings yet
Modeling Relational Data As Graphs For Mining
6 pages
Introduction T o Web Mining
No ratings yet
Introduction T o Web Mining
12 pages
GraphMining 04 FrequentSubgraph
No ratings yet
GraphMining 04 FrequentSubgraph
61 pages
Graph Mining: Anuraj Mohan 13MZ01, CSED
No ratings yet
Graph Mining: Anuraj Mohan 13MZ01, CSED
50 pages
CA10 GraphMining
No ratings yet
CA10 GraphMining
59 pages
Continuous Subgraph Pattern Search Over Certain and Uncertain Graph Streams
No ratings yet
Continuous Subgraph Pattern Search Over Certain and Uncertain Graph Streams
18 pages
Unit 4
No ratings yet
Unit 4
78 pages
Graph-Based Clustering and Data Visualization Algorithms (PDFDrive)
No ratings yet
Graph-Based Clustering and Data Visualization Algorithms (PDFDrive)
120 pages
Sangma2022 Article HierarchicalClusteringForMulti
No ratings yet
Sangma2022 Article HierarchicalClusteringForMulti
26 pages
11 Graph Pattern Mining
No ratings yet
11 Graph Pattern Mining
71 pages
Grami-2014-Elseidy
No ratings yet
Grami-2014-Elseidy
12 pages
Mining Frequent Subgraph Patterns From Uncertain Graph Data
No ratings yet
Mining Frequent Subgraph Patterns From Uncertain Graph Data
16 pages
Contrast Subgraph Mining From Coherent Cores
No ratings yet
Contrast Subgraph Mining From Coherent Cores
10 pages
BC2017
No ratings yet
BC2017
28 pages
Upadhyay 2018 Ijca 916573
No ratings yet
Upadhyay 2018 Ijca 916573
9 pages
Graph Mining Handout
No ratings yet
Graph Mining Handout
7 pages
An Introduction To Data Mining Technique: August 2014
No ratings yet
An Introduction To Data Mining Technique: August 2014
6 pages
DM 5th Unit
No ratings yet
DM 5th Unit
54 pages
Clustering Techniquesin Data Mining
No ratings yet
Clustering Techniquesin Data Mining
7 pages
Big Data in The Machine Learning Techniq
No ratings yet
Big Data in The Machine Learning Techniq
8 pages
Uncovering Overlap Community Structure
No ratings yet
Uncovering Overlap Community Structure
11 pages
A Novel Methodology For Discrimination Prevention in Data Mining
No ratings yet
A Novel Methodology For Discrimination Prevention in Data Mining
21 pages
Data Mining and Knowledge Discovery For Big Data - Methodologies, Challenge and Opportunities (Chu 2013-10-09)
No ratings yet
Data Mining and Knowledge Discovery For Big Data - Methodologies, Challenge and Opportunities (Chu 2013-10-09)
310 pages
Strategies and Algorithms For Clustering Large Datasets: A Review
No ratings yet
Strategies and Algorithms For Clustering Large Datasets: A Review
20 pages
By Kanchan Jadhav Guided by Prof. R.N. Phursule Computer Engg Dept. Jspm's Imperial College of Engineering & Research
No ratings yet
By Kanchan Jadhav Guided by Prof. R.N. Phursule Computer Engg Dept. Jspm's Imperial College of Engineering & Research
20 pages
A06-A Survey of Clustering Techniques
No ratings yet
A06-A Survey of Clustering Techniques
5 pages
Information Sciences: Chunyao Song, Tingjian Ge, Yao Ge, Haowen Zhang, Xiaojie Yuan
No ratings yet
Information Sciences: Chunyao Song, Tingjian Ge, Yao Ge, Haowen Zhang, Xiaojie Yuan
24 pages
Community Detection Using Statistically Significant Subgraph Mining
No ratings yet
Community Detection Using Statistically Significant Subgraph Mining
10 pages
1st Slides
No ratings yet
1st Slides
60 pages
AReviewof Clustering Algorithms
No ratings yet
AReviewof Clustering Algorithms
8 pages
Co So Du Lieu Do Thi
No ratings yet
Co So Du Lieu Do Thi
46 pages
MARGIN Maximal Frequent Subgraph Mining
No ratings yet
MARGIN Maximal Frequent Subgraph Mining
6 pages
Critical Graphs For Minimum Vertex Cover
No ratings yet
Critical Graphs For Minimum Vertex Cover
26 pages
Clustering Techniques in Data Mining
No ratings yet
Clustering Techniques in Data Mining
7 pages
Surveyofclusteringmethods
No ratings yet
Surveyofclusteringmethods
29 pages
1.1 Web Mining
No ratings yet
1.1 Web Mining
16 pages
Menendez Llorente
No ratings yet
Menendez Llorente
22 pages
A Review of Self Optimal Clustering Technique and Data Mining Approach
No ratings yet
A Review of Self Optimal Clustering Technique and Data Mining Approach
6 pages
Web Mining and Web Usage Mining Techniques: Bulletin de La Société Des Sciences de Liège, Vol. 85, 2016, P. 321 - 328
No ratings yet
Web Mining and Web Usage Mining Techniques: Bulletin de La Société Des Sciences de Liège, Vol. 85, 2016, P. 321 - 328
8 pages
Web Mining Report
100% (2)
Web Mining Report
46 pages
Book Modern Trends in Fuzzy Graphs Full
No ratings yet
Book Modern Trends in Fuzzy Graphs Full
324 pages
Support Computation For Mining Frequent Subgraphs in A Single Graph
No ratings yet
Support Computation For Mining Frequent Subgraphs in A Single Graph
6 pages
Community Detection: Statistical Inference Models: Anupama Chowdhary Satya Prakash Sharma
No ratings yet
Community Detection: Statistical Inference Models: Anupama Chowdhary Satya Prakash Sharma
6 pages
DM Laqs
No ratings yet
DM Laqs
14 pages
STAR Ext CGF Sub
No ratings yet
STAR Ext CGF Sub
26 pages
Graph Data Modeling and Analytics with Neo4j: Definitive Reference for Developers and Engineers
From Everand
Graph Data Modeling and Analytics with Neo4j: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
From Everand
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
Fouad Sabry
No ratings yet
Vector Graphics Algo
No ratings yet
Vector Graphics Algo
24 pages
KELOMPOK 5 - An Overview of Business Intelligence, Analytics, and Data Science
No ratings yet
KELOMPOK 5 - An Overview of Business Intelligence, Analytics, and Data Science
15 pages
100 HRS New - syllabus-ITT
No ratings yet
100 HRS New - syllabus-ITT
11 pages
Tech Achievements With Photos (IT Batch 2026)
No ratings yet
Tech Achievements With Photos (IT Batch 2026)
23 pages
Bigdata Notes
No ratings yet
Bigdata Notes
136 pages
Class 4-MATHS Asssignment - Holiday HW-Answer Key
No ratings yet
Class 4-MATHS Asssignment - Holiday HW-Answer Key
14 pages
Brit J Educational Tech - 2023 - Giannakos - The Role of Learning Theory in Multimodal Learning Analytics
No ratings yet
Brit J Educational Tech - 2023 - Giannakos - The Role of Learning Theory in Multimodal Learning Analytics
22 pages
MES Manual R-19
No ratings yet
MES Manual R-19
28 pages
Unit-1-Android-And-Its-Tools MAD
No ratings yet
Unit-1-Android-And-Its-Tools MAD
10 pages
HART® Transmitter Calibration
No ratings yet
HART® Transmitter Calibration
16 pages
Path, Path Products and Regular Expressions - G9
No ratings yet
Path, Path Products and Regular Expressions - G9
37 pages
UOVision Glory LTE Cellular Trail Camera User Manual - Manuals+ PDF
No ratings yet
UOVision Glory LTE Cellular Trail Camera User Manual - Manuals+ PDF
47 pages
A Guide To UX Design and Development: Developer's Journey Through The UX Process 1st Edition Tom Green All Chapters Instant Download
100% (5)
A Guide To UX Design and Development: Developer's Journey Through The UX Process 1st Edition Tom Green All Chapters Instant Download
66 pages
Brochure SRT 4930 - en
No ratings yet
Brochure SRT 4930 - en
2 pages
2022 Icas TC Ar V Imp
No ratings yet
2022 Icas TC Ar V Imp
534 pages
Analyzing Malicious Documents Cheat Sheet
No ratings yet
Analyzing Malicious Documents Cheat Sheet
7 pages
A Survey On E-Commerce Recommendation Systems Using Artificial Intelligence and Current Trends For Personalization To Improve Customer Experience
No ratings yet
A Survey On E-Commerce Recommendation Systems Using Artificial Intelligence and Current Trends For Personalization To Improve Customer Experience
5 pages
Algorithms Lectures
No ratings yet
Algorithms Lectures
28 pages
A Workbook in Lexical Semantics
No ratings yet
A Workbook in Lexical Semantics
35 pages
Songwriting Project
No ratings yet
Songwriting Project
7 pages
Tux Paint 06
No ratings yet
Tux Paint 06
6 pages
Aw E-book - คู่มือการใช้งาน SOLIDWORKS
No ratings yet
Aw E-book - คู่มือการใช้งาน SOLIDWORKS
4 pages
Javell: Address: 23 A East Avenue, Linstead P.O., Jamaica Email: Telephone: (876) 484-8766 1876-416-8765
No ratings yet
Javell: Address: 23 A East Avenue, Linstead P.O., Jamaica Email: Telephone: (876) 484-8766 1876-416-8765
3 pages
C Co Ob Ba As S C 311 Analyzer: Experience The Benefits of Standardizing With Solutions
No ratings yet
C Co Ob Ba As S C 311 Analyzer: Experience The Benefits of Standardizing With Solutions
2 pages
Aveva™ - Engineering - Commands - 2024 09 26 13 33 05
No ratings yet
Aveva™ - Engineering - Commands - 2024 09 26 13 33 05
5 pages
Control Structure C
No ratings yet
Control Structure C
12 pages
eSEC01 NetSec
No ratings yet
eSEC01 NetSec
24 pages
Leica Aibot: Line 1 Line 2 (Optional)
No ratings yet
Leica Aibot: Line 1 Line 2 (Optional)
2 pages
Implementing Cisco Service Provider Next-Generation Core Network Services
No ratings yet
Implementing Cisco Service Provider Next-Generation Core Network Services
9 pages

Graph Mining: A Survey of Graph Mining Techniques: August 2012

Uploaded by

Graph Mining: A Survey of Graph Mining Techniques: August 2012

Uploaded by

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

Graph mining: A survey of graph mining techniques

Conference Paper · August 2012

Saif ur Rehman Khan Asmat Ullah

The user has requested enhancement of the downloaded file.

Saif Ur Rehman Asmat Ullah Khan Simon Fong

978-1-4673-2430-4112/$31.00 ©2012 IEEE 88

Paper Technique/Method Implementation Features Efficiency Comparison

[8) Dhillon, Y. Guan and B. Kulis, "A Fast Kernel-based Multilevel

View publication stats

You might also like