0% found this document useful (0 votes)

2 views

Tutorial3

The document provides an overview of Graph Attention Networks (GAT), detailing their structure, advantages, and implementation. It discusses the graph attention layer, message passing, and the computational efficiency of GAT, highlighting its ability to assign different importances to nodes. Additionally, it includes implementation details using PyTorch Geometric and the steps to create a GCNConv layer.

Uploaded by

Mohammed Hassan

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Tutorial3

Uploaded by

Mohammed Hassan

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 82

Graph attention Networks (GAT)

Antonio Longa1,2
MobS1 Lab, Fondazione Bruno Kessler,Trento, Italy
SML2 Lab, University of Trento, Italy
Recap 01
Introduction
02
TABLE OF 04 Pros of GAT

03
Graph attention layer
(GAT) CONTENTS
05 Message passing
Implementation

06 Implement our
GCNConv

07 GAT implementation
01 Recap
01 Recap

PROBLEMS:

■ Different sizes
01 Recap

PROBLEMS:

■ Different sizes

■ NOT invariant to nodes ordering

𝙂 Adj(𝙂)

𝙂 = 𝙂’ Adj(𝙂)≠ Adj(𝙂’)

𝙂’ Adj(𝙂’)
01 Recap

COMPUTATION GRAPH
The neighbour of a node deﬁnes its computation graph

INPUT GRAPH
01 Recap

COMPUTATION GRAPH
The neighbour of a node deﬁnes its computation graph

INPUT GRAPH COMPUTATION GRAPH

01 Recap

COMPUTATION GRAPH
The neighbour of a node deﬁnes its computation graph

INPUT GRAPH COMPUTATION GRAPH

01 Recap

COMPUTATION GRAPH
The neighbour of a node deﬁnes its computation graph

INPUT GRAPH COMPUTATION GRAPH

01 Recap

Neural Networks
Ordering invariant
Aggregation

Sum
Average
02 Introduction
02 Introduction
02 Introduction
02 Introduction

How much features of node “c” are important to node “i”?

02 Introduction

How much features of node “c” are important to node “i”?

Can we learn such importance, in an automatic manner?

02 Introduction

How much features of node “c” are important to node “i”?

Can we learn such importance, in an automatic manner?

YES, with GAT

03 Graph Attention Networks GAT
Petar
Veličković
Senior Research Scientist at DeepMind
03 Graph Attention layer

INPUT: a set of node features

OUTPUT: a new set of node features

03 Graph Attention layer

1) apply a parameterized linear transformation to every node

03 Graph Attention layer

1) apply a parameterized linear transformation to every node

03 Graph Attention layer

1) apply a parameterized linear transformation to every node

03 Graph Attention layer

2) Self attention
03 Graph Attention layer

2) Self attention

Specify the importance of node j’s features to node i

03 Graph Attention layer

3) Normalization
03 Graph Attention layer

4) Attention mechanism

1
03 Graph Attention layer
4) Attention mechanism

1
1
03 Graph Attention layer
4) Attention mechanism

1
1
03 Graph Attention layer
5) Use it :)
03 Graph Attention layer
6) Multi-head attention
03 Graph Attention layer
6) Multi-head attention
03 Graph Attention layer
6) Multi-head attention
03 Graph Attention layer
6) Multi-head attention

Concatenation Average

● On the ﬁnal (prediction) layer of the

network
04 Pros of GAT
Self-attention layers can be parallelized across edges

Output features can be parallelized across nodes

● Computationally eﬃcient
04 Pros of GAT
Self-attention layers can be parallelized across edges

Output features can be parallelized across nodes

● Computationally eﬃcient

● Allows to assign different importances to nodes of a same neighborhood

04 Pros of GAT
Self-attention layers can be parallelized across edges

Output features can be parallelized across nodes

● Computationally eﬃcient

● Allows to assign different importances to nodes of a same neighborhood

Not required to have the entire graph

● It is applied in a shared manner to all edges in the graph
04 Pros of GAT
Self-attention layers can be parallelized across edges

Output features can be parallelized across nodes

● Computationally eﬃcient

● Allows to assign different importances to nodes of a same neighborhood

Not required to have the entire graph

● It is applied in a shared manner to all edges in the graph

Transductive learning (Cora, Citeseer, Pubmed)

● Works in both: Inductive learning (PPI)

05 Message passing implementation

Features representations of
node i at the k-th layer
05 Message passing implementation

Features representations of
node i at the k-th layer

Differentiable function
Eg: MLP
05 Message passing implementation

● Feature rep of node i at

the (k-1)-th layer
Features representations of ● Feature rep of node j at
node i at the k-th layer the (k-1)-th layer
● [optionally] features of
edge (i,j)

Differentiable function
Eg: MLP
05 Message passing implementation

● Feature rep of node i at

the (k-1)-th layer
Features representations of ● Feature rep of node j at
Differentiable, ordering the (k-1)-th layer
node i at the k-th layer
invariant function. ● [optionally] features of
For every j in the edge (i,j)
neighbourhood of i.
Eg: sum, average, etc...
Differentiable function Differentiable function
Eg: MLP Eg: MLP
05 Message passing implementation
PyTorch Geometric provides the MessagePassing base class.
05 Message passing implementation
PyTorch Geometric provides the MessagePassing base class.

Differentiable function
Eg: MLP

message()
05 Message passing implementation
PyTorch Geometric provides the MessagePassing base class.

Differentiable function Differentiable functions

Eg: MLP Eg: MLP

update() message()
05 Message passing implementation
PyTorch Geometric provides the MessagePassing base class.

Differentiable function Differentiable function

Eg: MLP Aggregation Eg: MLP

Sum, avg, concat

update() message()
05 Message passing implementation
PyTorch Geometric provides the MessagePassing base class.
PARAMETERS
05 Message passing implementation
PyTorch Geometric provides the MessagePassing base class.
PARAMETERS
05 Message passing implementation
PyTorch Geometric provides the MessagePassing base class.
PARAMETERS
05 Message passing implementation
PyTorch Geometric provides the MessagePassing base class.
METHODS

Aggregates messages from neighbors

(sum, mean, max)
05 Message passing implementation
PyTorch Geometric provides the MessagePassing base class.
METHODS

Aggregates messages from neighbors

(sum, mean, max)

Constructs messages from node j to

node i in analogy to ϕΘ
05 Message passing implementation
PyTorch Geometric provides the MessagePassing base class.
METHODS

Aggregates messages from neighbors

(sum, mean, max)

Constructs messages from node j to

node i in analogy to ϕΘ

Propagate messages
05 Message passing implementation
PyTorch Geometric provides the MessagePassing base class.
METHODS

Aggregates messages from neighbors

(sum, mean, max)

Constructs messages from node j to

node i in analogy to ϕΘ

Propagate messages

Updates node embeddings in

analogy to γΘ
05 Message passing implementation
HOW TO USE IT?

Layer Name
05 Message passing implementation
HOW TO USE IT?

GCNConv inherits from MessagePassing

Layer Name
05 Message passing implementation
HOW TO USE IT?

GCNConv inherits from MessagePassing

Layer Name

Initialize the class, call “super” specifying your

aggregations (add,max,mean)
05 Message passing implementation
HOW TO USE IT?

GCNConv inherits from MessagePassing

Layer Name

Initialize the class, call “super” specifying your

aggregations (add,max,mean)

Forward and propagate

05 Message passing implementation
HOW TO USE IT?

GCNConv inherits from MessagePassing

Layer Name

Initialize the class, call “super” specifying your

aggregations (add,max,mean)

Forward and propagate

Compute the message

06 Implement our GCNConv
Simple example
06 Implement our GCNConv
Simple example
06 Implement our GCNConv
Simple example
06 Implement our GCNConv
Simple example

In steps:
1. Add self loops
2. A linear transformation to node feature matrix
3. Compute normalization coeﬃcients
4. Normalize node features
5. Sum up neighboring node features
06 Implement our GCNConv
Simple example

In steps:
1. Add self loops
2. A linear transformation to node feature matrix Forward method
3. Compute normalization coeﬃcients
4. Normalize node features Message method
5. Sum up neighboring node features int
06 Implement our GCNConv
GCNConv inherits from MessagePassing
06 Implement our GCNConv
GCNConv inherits from MessagePassing

1) Add self loops

06 Implement our GCNConv
GCNConv inherits from MessagePassing

1) Add self loops

2) A linear transformation to node feature matrix
06 Implement our GCNConv
GCNConv inherits from MessagePassing

1) Add self loops

2) A linear transformation to node feature matrix

3) Compute normalization coeﬃcients

06 Implement our GCNConv
GCNConv inherits from MessagePassing

1) Add self loops

2) A linear transformation to node feature matrix

3) Compute normalization coeﬃcients

4) Normalize node features

06 Implement our GCNConv
GCNConv inherits from MessagePassing

5) Sum up neighboring node features

1) Add self loops

2) A linear transformation to node feature matrix

3) Compute normalization coeﬃcients

4) Normalize node features

06 GAT implementation

Jupyter-Notebook

Masked Attention Is All You Need For Graphs: Duvenaud Et Al. 2015 Kearnes Et Al. 2016 Gilmer Et Al. 2017
No ratings yet
Masked Attention Is All You Need For Graphs: Duvenaud Et Al. 2015 Kearnes Et Al. 2016 Gilmer Et Al. 2017
15 pages
Hierarchical Message-Passing Graph Neural Networks: Zhiqiang Zhong Cheng-Te Li Jun Pang
No ratings yet
Hierarchical Message-Passing Graph Neural Networks: Zhiqiang Zhong Cheng-Te Li Jun Pang
28 pages
2107 07432
No ratings yet
2107 07432
8 pages
Unit III GNN
No ratings yet
Unit III GNN
56 pages
End-To-End Learning of Latent Edge Weights For Graph Convolutional Networks
No ratings yet
End-To-End Learning of Latent Edge Weights For Graph Convolutional Networks
49 pages
CS8
No ratings yet
CS8
50 pages
GraphGPT
No ratings yet
GraphGPT
10 pages
Graphs Convolutions and Neural Networks an Introduction[1]
No ratings yet
Graphs Convolutions and Neural Networks an Introduction[1]
11 pages
Chapter 4 - Machine Learning With Graphs III: Prepared By: Shier Nee, SAW
No ratings yet
Chapter 4 - Machine Learning With Graphs III: Prepared By: Shier Nee, SAW
71 pages
Zhong 2023 HirarchicalMPGNN
No ratings yet
Zhong 2023 HirarchicalMPGNN
28 pages
GNNs
No ratings yet
GNNs
28 pages
Featgraph: A Flexible and Efficient Backend For Graph Neural Network Systems
No ratings yet
Featgraph: A Flexible and Efficient Backend For Graph Neural Network Systems
12 pages
A Gentle Introduction To Graph Neural Network
No ratings yet
A Gentle Introduction To Graph Neural Network
122 pages
04-GNN2
No ratings yet
04-GNN2
73 pages
Original GNN
No ratings yet
Original GNN
22 pages
Approximation- and Quantization-Aware Training for Graph Neural Networks
No ratings yet
Approximation- and Quantization-Aware Training for Graph Neural Networks
14 pages
Ampn: A M P G N N: ET Ttention As Essage Assing For Raph Eural Etworks
No ratings yet
Ampn: A M P G N N: ET Ttention As Essage Assing For Raph Eural Etworks
16 pages
07 GNN2
No ratings yet
07 GNN2
71 pages
Thesis Master 2022 Application of GNN For Graph Classification
No ratings yet
Thesis Master 2022 Application of GNN For Graph Classification
81 pages
GNNS
No ratings yet
GNNS
7 pages
Intro To GNN
No ratings yet
Intro To GNN
49 pages
Graph Representation Learning
No ratings yet
Graph Representation Learning
141 pages
C G N N: Ooperative Raph Eural Etworks
No ratings yet
C G N N: Ooperative Raph Eural Etworks
22 pages
Graph Neural Networks (GNNs)
No ratings yet
Graph Neural Networks (GNNs)
22 pages
F G R L P T G: AST Raph Epresentation Earning With Y Orch Eometric
No ratings yet
F G R L P T G: AST Raph Epresentation Earning With Y Orch Eometric
9 pages
CDL 2024 - GNNs Masterclass
No ratings yet
CDL 2024 - GNNs Masterclass
171 pages
Chap7 GNN (20240229) - DL4H Practioner Guide
No ratings yet
Chap7 GNN (20240229) - DL4H Practioner Guide
37 pages
Improving Global Awareness of Linkset Predictions Using Cross-Attentive Modulation Tokens
No ratings yet
Improving Global Awareness of Linkset Predictions Using Cross-Attentive Modulation Tokens
17 pages
06-GNN3
No ratings yet
06-GNN3
73 pages
Gated Attention Networks For Learning On Large
No ratings yet
Gated Attention Networks For Learning On Large
11 pages
A Graph Neural Network Accelerator
No ratings yet
A Graph Neural Network Accelerator
14 pages
2106 - Rethinking Graph Transformers With Spectral Attention
No ratings yet
2106 - Rethinking Graph Transformers With Spectral Attention
18 pages
Seminar Presentation
No ratings yet
Seminar Presentation
19 pages
What is Graph Neural Network_ An Introduction to GNN and Its Applications _ Simplilearn
No ratings yet
What is Graph Neural Network_ An Introduction to GNN and Its Applications _ Simplilearn
13 pages
WWW23-Tutorial-V6 Self-Supervised Learning and Pre-Training On Graphs
No ratings yet
WWW23-Tutorial-V6 Self-Supervised Learning and Pre-Training On Graphs
107 pages
Neural Networks
No ratings yet
Neural Networks
10 pages
ReGNN a Redundancy-Eliminated Graph Neural Networks Accelerator
No ratings yet
ReGNN a Redundancy-Eliminated Graph Neural Networks Accelerator
15 pages
GRL Book-Chapter 5-GNNs
No ratings yet
GRL Book-Chapter 5-GNNs
21 pages
CS224w Machine Learning With Graphs
No ratings yet
CS224w Machine Learning With Graphs
127 pages
2302.08043v3
No ratings yet
2302.08043v3
12 pages
Original Paper
No ratings yet
Original Paper
10 pages
Graph Neural Networks
100% (1)
Graph Neural Networks
27 pages
Wind Farm Presentation
No ratings yet
Wind Farm Presentation
55 pages
Thesis 2021 Optimizaiton GNN Stathas Nistath Meng Eecs 2021 Thesis
No ratings yet
Thesis 2021 Optimizaiton GNN Stathas Nistath Meng Eecs 2021 Thesis
79 pages
GNN Review
No ratings yet
GNN Review
26 pages
Graph Neural Networks: A Review of Methods and Applications
No ratings yet
Graph Neural Networks: A Review of Methods and Applications
22 pages
Distributed Message Passing Research Paper
No ratings yet
Distributed Message Passing Research Paper
8 pages
Week 3 4 SNA+Recommender
No ratings yet
Week 3 4 SNA+Recommender
92 pages
Ometric
100% (1)
Ometric
26 pages
GNN PPoPP 2021
No ratings yet
GNN PPoPP 2021
14 pages
GNNChap 7
No ratings yet
GNNChap 7
26 pages
CS 224W Fall 2023 HW1
No ratings yet
CS 224W Fall 2023 HW1
11 pages
Dirac-Bianconi Graph Neural Networks - Enabling Non-Diffusive Long-Range Graph Predictions
No ratings yet
Dirac-Bianconi Graph Neural Networks - Enabling Non-Diffusive Long-Range Graph Predictions
14 pages
2024_Introduction to Graph Neural Networks A Starting
No ratings yet
2024_Introduction to Graph Neural Networks A Starting
49 pages
Edgenets: Edge Varying Graph Neural Networks: Elvin Isufi, Fernando Gama and Alejandro Ribeiro
No ratings yet
Edgenets: Edge Varying Graph Neural Networks: Elvin Isufi, Fernando Gama and Alejandro Ribeiro
15 pages
04 GNNBasic
No ratings yet
04 GNNBasic
107 pages
A Practical Guide To Graph Neural Networks
No ratings yet
A Practical Guide To Graph Neural Networks
28 pages
A Comprehensive Survey On Graph Neural Networks
No ratings yet
A Comprehensive Survey On Graph Neural Networks
22 pages
Hierarchical Graph Neural Networks
No ratings yet
Hierarchical Graph Neural Networks
14 pages
Scanline Rendering: Exploring Visual Realism Through Scanline Rendering Techniques
From Everand
Scanline Rendering: Exploring Visual Realism Through Scanline Rendering Techniques
Fouad Sabry
No ratings yet
Error Detection Correction I
No ratings yet
Error Detection Correction I
26 pages
Depth First and Breadth First Traversal of Graphs Represented Using Adjacency
No ratings yet
Depth First and Breadth First Traversal of Graphs Represented Using Adjacency
4 pages
Efficient Solution of Otsu Multilevel Image Thresholding: A Comparative Study
No ratings yet
Efficient Solution of Otsu Multilevel Image Thresholding: A Comparative Study
21 pages
JHS SSG REVIEWER 2nd Q
No ratings yet
JHS SSG REVIEWER 2nd Q
2 pages
Tarea 3-Ejercicios 1,2,3,4,5 y 6
No ratings yet
Tarea 3-Ejercicios 1,2,3,4,5 y 6
38 pages
05 Deep Learning
No ratings yet
05 Deep Learning
53 pages
Rec2 Sol
No ratings yet
Rec2 Sol
6 pages
Insertion Sort
No ratings yet
Insertion Sort
21 pages
Lecture 2
No ratings yet
Lecture 2
21 pages
At Codechef: Take U Forward
No ratings yet
At Codechef: Take U Forward
8 pages
Computations 2
No ratings yet
Computations 2
2 pages
2019 WMI Prelim G08 Paper A
No ratings yet
2019 WMI Prelim G08 Paper A
4 pages
Data Mining and Machine Learning: Fundamental Concepts and Algorithms
No ratings yet
Data Mining and Machine Learning: Fundamental Concepts and Algorithms
58 pages
Tarea 3 - Ejercicio 1 y 2
No ratings yet
Tarea 3 - Ejercicio 1 y 2
11 pages
Message Authentication and Hash Functions
No ratings yet
Message Authentication and Hash Functions
26 pages
Cs3491-Aiml Lab Manual
No ratings yet
Cs3491-Aiml Lab Manual
18 pages
Quick Sort: Characteristics
No ratings yet
Quick Sort: Characteristics
20 pages
Simplex
No ratings yet
Simplex
4 pages
Find The Positive Root of The Equation Correct To Five Decimal Places
No ratings yet
Find The Positive Root of The Equation Correct To Five Decimal Places
5 pages
Problems 9
No ratings yet
Problems 9
7 pages
Roots of Polynomials
No ratings yet
Roots of Polynomials
4 pages
Lecture 06 - Algorithm Analysis PDF
No ratings yet
Lecture 06 - Algorithm Analysis PDF
6 pages
System Model of TH-UWB Using LDPC Code Implementation
No ratings yet
System Model of TH-UWB Using LDPC Code Implementation
7 pages
Algorithm For Scheduling Runway Operations Under Constrained Position Shifting
No ratings yet
Algorithm For Scheduling Runway Operations Under Constrained Position Shifting
4 pages
Digital Signal Processing
No ratings yet
Digital Signal Processing
5 pages
Worksheet - 2
No ratings yet
Worksheet - 2
5 pages
Sascha Schnepp: Skiena's TADM Problems Chapter 8
No ratings yet
Sascha Schnepp: Skiena's TADM Problems Chapter 8
17 pages
A Mathematical Derivation For Error Correction and Detection in Communication Using BCH Codes
No ratings yet
A Mathematical Derivation For Error Correction and Detection in Communication Using BCH Codes
7 pages
Define Stack and Queue and Also Write Its Basic Operations
No ratings yet
Define Stack and Queue and Also Write Its Basic Operations
36 pages
Signals and Systems With (MATLAB) Computing and Simulink Modeling - Karris - 5th Edition
100% (1)
Signals and Systems With (MATLAB) Computing and Simulink Modeling - Karris - 5th Edition
68 pages