KDD Tutorial Part2 Network Embedding and GCN
KDD Tutorial Part2 Network Embedding and GCN
Peng Cui
Tsinghua University
3
http://www.cytoscape.org/
https://www.future-science.com/doi/10.4155/fmc.13.202
4
Network
Applications
5
Network
GCN
Embedding
6
Network Embedding
G = ( V, E ) G=(V)
Vector Space
generate
embed
• Easy to parallel
• Can apply classical ML methods
7
A C
Transitivity
Transform network nodes into vectors that are fit for
off-the-shelf machine learning models.
8
F. Scarselli, et al. The graph neural network model. IEEE TNN, 2009.
9
T. N. Kipf and M. Welling. Semi-supervised classification with graph convolutional networks. ICLR, 2017.
10
Network
GCN
Embedding
12
Community Dynamic
Embedding1
Shifting
Embedding2
Eigen-decomposition
Λ 𝑋𝑋
Embedding3
Embedding4
Efficient!
……
• Preserving arbitrary-order proximity
• Low marginal cost
• Accurate and efficient
Z. Zhang,
Z. Zhang, et
et al.
al.Arbitrary-Order
Arbitrary-Order Proximity
Proximity Preserved
Preserved Network
Network Embedding.
Embedding. KDD,
KDD, 2018.
2018.
Preserving Arbitrary-Order Proximity
• High-order proximity: a polynomial function of the adjacency matrix
𝑆𝑆 = 𝑓𝑓 𝐴𝐴 = 𝑤𝑤1 𝐴𝐴1 + 𝑤𝑤2 𝐴𝐴2 + ⋯ + 𝑤𝑤𝑞𝑞 𝐴𝐴𝑞𝑞
• 𝑞𝑞: order; 𝑤𝑤1 …𝑤𝑤𝑞𝑞 : weights, assuming to be non-negative
• 𝐴𝐴: could be replaced by other variations (such as the Laplacian matrix)
Efficient!
Eigen-decomposition
𝐴𝐴 Λ 𝑋𝑋
𝑆𝑆 Eigen-decomposition ℱ Λ
𝑋𝑋
Time Consuming!
+100% +200%
Hyper-network embedding
𝒆𝒆𝟏𝟏
𝒆𝒆𝟐𝟐
𝑨𝑨𝟐𝟐 𝐴𝐴2
𝑼𝑼𝟐𝟐 𝐿𝐿2 U2
𝑳𝑳𝟐𝟐
𝑳𝑳𝟏𝟏 𝐿𝐿1
𝑨𝑨𝟏𝟏 𝐴𝐴1
𝑼𝑼𝟏𝟏 𝑈𝑈1
𝒆𝒆𝟑𝟑
𝒆𝒆𝟒𝟒
Networks Hyper-Networks
• A hyper-network is a network in which an edge can include any number
of nodes
18
Person
paper venue
Drug
Author 1 Author 3
Author 2
.
…
~14%
Network
GCN
Embedding
22
N d d
N N N
X =
Robust GCN
Adversarial attacks
small perturbations in graph structures and node attributes
great challenges for applying GCNs to node classification
Robust GCN
Dingyuan Zhu, Ziwei Zhang, Peng Cui, Wenwu Zhu. Robust Graph Convolutional Networks Against Adversarial Attacks. KDD, 2019.
Disentangled GCN
• A real-world graph is typically formed due to many latent factors.
Existing GNNs/GCNs:
Social Circles A holistic approach, that takes in
the whole neighborhood to produce
a single node representation.
We suggest:
To disentangle the latent factors.
(By segmenting the heterogeneous parts, and learning
multiple factor-specific representations for a node.)
Jianxin Ma, Peng Cui, Kun Kuang, Xin Wang, Wenwu Zhu. Disentangled Graph Convolutional Networks. ICML, 2019.
Disentangled GCN
Jianxin Ma, Peng Cui, Kun Kuang, Xin Wang, Wenwu Zhu. Disentangled Graph Convolutional Networks. ICML, 2019.
Some interesting questions for GCN…
30
Case 1 Case 2
= 𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠 𝑆𝑆 𝐾𝐾 𝐻𝐻 0 𝑊𝑊
High-order proximity
Wu, Felix, et al. Simplifying graph convolutional networks. ICML, 2019.
33
Node
Topology
Features
35
Peng Cui, Xiao Wang, Jian Pei, Wenwu Zhu. A Survey on Network Embedding. IEEE TKDE,
2019.
37
Ziwei Zhang, Peng Cui, Wenwu Zhu. Deep Learning on Graphs: A Survey. Arxiv, 2019.
38
Thanks!
Peng Cui
[email protected]
http://pengcui.thumedialab.com