Implicit degree bias in the link prediction task

Aiyappa, Rachith; Wang, Xin; Kim, Munjung; Seckin, Ozgur Can; Yoon, Jisung; Ahn, Yong-Yeol; Kojaku, Sadamori

Computer Science > Social and Information Networks

arXiv:2405.14985 (cs)

[Submitted on 23 May 2024 (v1), last revised 29 May 2024 (this version, v2)]

Title:Implicit degree bias in the link prediction task

Authors:Rachith Aiyappa, Xin Wang, Munjung Kim, Ozgur Can Seckin, Jisung Yoon, Yong-Yeol Ahn, Sadamori Kojaku

View PDF HTML (experimental)

Abstract:Link prediction -- a task of distinguishing actual hidden edges from random unconnected node pairs -- is one of the quintessential tasks in graph machine learning. Despite being widely accepted as a universal benchmark and a downstream task for representation learning, the validity of the link prediction benchmark itself has been rarely questioned. Here, we show that the common edge sampling procedure in the link prediction task has an implicit bias toward high-degree nodes and produces a highly skewed evaluation that favors methods overly dependent on node degree, to the extent that a ``null'' link prediction method based solely on node degree can yield nearly optimal performance. We propose a degree-corrected link prediction task that offers a more reasonable assessment that aligns better with the performance in the recommendation task. Finally, we demonstrate that the degree-corrected benchmark can more effectively train graph machine-learning models by reducing overfitting to node degrees and facilitating the learning of relevant structures in graphs.

Comments:	13 pages, 3 figures
Subjects:	Social and Information Networks (cs.SI); Physics and Society (physics.soc-ph)
Cite as:	arXiv:2405.14985 [cs.SI]
	(or arXiv:2405.14985v2 [cs.SI] for this version)
	https://doi.org/10.48550/arXiv.2405.14985

Submission history

From: Sadamori Kojaku [view email]
[v1] Thu, 23 May 2024 18:38:42 UTC (4,401 KB)
[v2] Wed, 29 May 2024 08:39:50 UTC (4,393 KB)

Computer Science > Social and Information Networks

Title:Implicit degree bias in the link prediction task

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Social and Information Networks

Title:Implicit degree bias in the link prediction task

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators