Scalable Probabilistic Matrix Factorization with Graph-Based Priors

Strahl, Jonathan; Peltonen, Jaakko; Mamitsuka, Hiroshi; Kaski, Samuel

doi:10.1609/aaai.v34i04.6043

Computer Science > Machine Learning

arXiv:1908.09393 (cs)

[Submitted on 25 Aug 2019 (v1), last revised 11 Sep 2019 (this version, v2)]

Title:Scalable Probabilistic Matrix Factorization with Graph-Based Priors

Authors:Jonathan Strahl, Jaakko Peltonen, Hiroshi Mamitsuka, Samuel Kaski

View PDF

Abstract:In matrix factorization, available graph side-information may not be well suited for the matrix completion problem, having edges that disagree with the latent-feature relations learnt from the incomplete data matrix. We show that removing these $\textit{contested}$ edges improves prediction accuracy and scalability. We identify the contested edges through a highly-efficient graphical lasso approximation. The identification and removal of contested edges adds no computational complexity to state-of-the-art graph-regularized matrix factorization, remaining linear with respect to the number of non-zeros. Computational load even decreases proportional to the number of edges removed. Formulating a probabilistic generative model and using expectation maximization to extend graph-regularised alternating least squares (GRALS) guarantees convergence. Rich simulated experiments illustrate the desired properties of the resulting algorithm. On real data experiments we demonstrate improved prediction accuracy with fewer graph edges (empirical evidence that graph side-information is often inaccurate). A 300 thousand dimensional graph with three million edges (Yahoo music side-information) can be analyzed in under ten minutes on a standard laptop computer demonstrating the efficiency of our graph update.

Comments:	Under review
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
ACM classes:	I.2.1; I.2.6; I.5.4
Cite as:	arXiv:1908.09393 [cs.LG]
	(or arXiv:1908.09393v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1908.09393
Journal reference:	AAAI 2020
Related DOI:	https://doi.org/10.1609/aaai.v34i04.6043

Submission history

From: Jonathan Strahl [view email]
[v1] Sun, 25 Aug 2019 21:21:18 UTC (1,091 KB)
[v2] Wed, 11 Sep 2019 12:26:43 UTC (1,280 KB)

Computer Science > Machine Learning

Title:Scalable Probabilistic Matrix Factorization with Graph-Based Priors

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Scalable Probabilistic Matrix Factorization with Graph-Based Priors

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators