Retrofitting Temporal Graph Neural Networks with Transformer

Huang, Qiang; Yan, Xiao; Wang, Xin; Rao, Susie Xi; Han, Zhichao; Fu, Fangcheng; Zhang, Wentao; Jiang, Jiawei

Computer Science > Machine Learning

arXiv:2409.05477 (cs)

[Submitted on 9 Sep 2024 (v1), last revised 18 Sep 2024 (this version, v3)]

Title:Retrofitting Temporal Graph Neural Networks with Transformer

Authors:Qiang Huang, Xiao Yan, Xin Wang, Susie Xi Rao, Zhichao Han, Fangcheng Fu, Wentao Zhang, Jiawei Jiang

View PDF HTML (experimental)

Abstract:Temporal graph neural networks (TGNNs) outperform regular GNNs by incorporating time information into graph-based operations. However, TGNNs adopt specialized models (e.g., TGN, TGAT, and APAN ) and require tailored training frameworks (e.g., TGL and ETC). In this paper, we propose TF-TGN, which uses Transformer decoder as the backbone model for TGNN to enjoy Transformer's codebase for efficient training. In particular, Transformer achieves tremendous success for language modeling, and thus the community developed high-performance kernels (e.g., flash-attention and memory-efficient attention) and efficient distributed training schemes (e.g., PyTorch FSDP, DeepSpeed, and Megatron-LM). We observe that TGNN resembles language modeling, i.e., the message aggregation operation between chronologically occurring nodes and their temporal neighbors in TGNNs can be structured as sequence modeling. Beside this similarity, we also incorporate a series of algorithm designs including suffix infilling, temporal graph attention with self-loop, and causal masking self-attention to make TF-TGN work. During training, existing systems are slow in transforming the graph topology and conducting graph sampling. As such, we propose methods to parallelize the CSR format conversion and graph sampling. We also adapt Transformer codebase to train TF-TGN efficiently with multiple GPUs. We experiment with 9 graphs and compare with 2 state-of-the-art TGNN training frameworks. The results show that TF-TGN can accelerate training by over 2.20 while providing comparable or even superior accuracy to existing SOTA TGNNs. TF-TGN is available at this https URL.

Comments:	conference Under review
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2409.05477 [cs.LG]
	(or arXiv:2409.05477v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2409.05477

Submission history

From: Qiang Huang [view email]
[v1] Mon, 9 Sep 2024 10:11:25 UTC (209 KB)
[v2] Tue, 10 Sep 2024 07:54:18 UTC (220 KB)
[v3] Wed, 18 Sep 2024 09:15:10 UTC (220 KB)

Computer Science > Machine Learning

Title:Retrofitting Temporal Graph Neural Networks with Transformer

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Retrofitting Temporal Graph Neural Networks with Transformer

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators