Network Representation Learning: Consolidation and Renewed Bearing

Gurukar, Saket; Vijayan, Priyesh; Srinivasan, Aakash; Bajaj, Goonmeet; Cai, Chen; Keymanesh, Moniba; Kumar, Saravana; Maneriker, Pranav; Mitra, Anasua; Patel, Vedang; Ravindran, Balaraman; Parthasarathy, Srinivasan

Computer Science > Machine Learning

arXiv:1905.00987 (cs)

[Submitted on 2 May 2019 (v1), last revised 15 Jun 2019 (this version, v2)]

Title:Network Representation Learning: Consolidation and Renewed Bearing

Authors:Saket Gurukar, Priyesh Vijayan, Aakash Srinivasan, Goonmeet Bajaj, Chen Cai, Moniba Keymanesh, Saravana Kumar, Pranav Maneriker, Anasua Mitra, Vedang Patel, Balaraman Ravindran, Srinivasan Parthasarathy

View PDF

Abstract:Graphs are a natural abstraction for many problems where nodes represent entities and edges represent a relationship across entities. An important area of research that has emerged over the last decade is the use of graphs as a vehicle for non-linear dimensionality reduction in a manner akin to previous efforts based on manifold learning with uses for downstream database processing, machine learning and visualization. In this systematic yet comprehensive experimental survey, we benchmark several popular network representation learning methods operating on two key tasks: link prediction and node classification. We examine the performance of 12 unsupervised embedding methods on 15 datasets. To the best of our knowledge, the scale of our study -- both in terms of the number of methods and number of datasets -- is the largest to date.
Our results reveal several key insights about work-to-date in this space. First, we find that certain baseline methods (task-specific heuristics, as well as classic manifold methods) that have often been dismissed or are not considered by previous efforts can compete on certain types of datasets if they are tuned appropriately. Second, we find that recent methods based on matrix factorization offer a small but relatively consistent advantage over alternative methods (e.g., random-walk based methods) from a qualitative standpoint. Specifically, we find that MNMF, a community preserving embedding method, is the most competitive method for the link prediction task. While NetMF is the most competitive baseline for node classification. Third, no single method completely outperforms other embedding methods on both node classification and link prediction tasks. We also present several drill-down analysis that reveals settings under which certain algorithms perform well (e.g., the role of neighborhood context on performance) -- guiding the end-user.

Subjects:	Machine Learning (cs.LG); Social and Information Networks (cs.SI); Machine Learning (stat.ML)
Cite as:	arXiv:1905.00987 [cs.LG]
	(or arXiv:1905.00987v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1905.00987

Submission history

From: Saket Gurukar [view email]
[v1] Thu, 2 May 2019 22:42:11 UTC (1,510 KB)
[v2] Sat, 15 Jun 2019 20:10:46 UTC (2,416 KB)

Computer Science > Machine Learning

Title:Network Representation Learning: Consolidation and Renewed Bearing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Network Representation Learning: Consolidation and Renewed Bearing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators