A Provable Framework of Learning Graph Embeddings via Summarization

Authors

  • Houquan Zhou Institute of Computing Technology, Chinese Academy of Science
  • Shenghua Liu Institute of Computing Technology, CAS, China
  • Danai Koutra U Michigan
  • Huawei Shen Institute of Computing Technology, Chinese Academy of Sciences
  • Xueqi Cheng Institute of Computing Technology, Chinese Academy of Sciences

DOI:

https://doi.org/10.1609/aaai.v37i4.25621

Keywords:

DMKM: Graph Mining, Social Network Analysis & Community Mining, DMKM: Data Visualization & Summarization, ML: Graph-based Machine Learning, ML: Representation Learning

Abstract

Given a large graph, can we learn its node embeddings from a smaller summary graph? What is the relationship between embeddings learned from original graphs and their summary graphs? Graph representation learning plays an important role in many graph mining applications, but learning em-beddings of large-scale graphs remains a challenge. Recent works try to alleviate it via graph summarization, which typ-ically includes the three steps: reducing the graph size by combining nodes and edges into supernodes and superedges,learning the supernode embedding on the summary graph and then restoring the embeddings of the original nodes. How-ever, the justification behind those steps is still unknown. In this work, we propose GELSUMM, a well-formulated graph embedding learning framework based on graph sum-marization, in which we show the theoretical ground of learn-ing from summary graphs and the restoration with the three well-known graph embedding approaches in a closed form.Through extensive experiments on real-world datasets, we demonstrate that our methods can learn graph embeddings with matching or better performance on downstream tasks.This work provides theoretical analysis for learning node em-beddings via summarization and helps explain and under-stand the mechanism of the existing works.

Downloads

Published

2023-06-26

How to Cite

Zhou, H., Liu, S., Koutra, D., Shen, H., & Cheng, X. (2023). A Provable Framework of Learning Graph Embeddings via Summarization. Proceedings of the AAAI Conference on Artificial Intelligence, 37(4), 4946-4953. https://doi.org/10.1609/aaai.v37i4.25621

Issue

Section

AAAI Technical Track on Data Mining and Knowledge Management