High-performance sampling of generic Determinantal Point Processes

Poulson, Jack

doi:10.1098/rsta.2019.0059

Mathematics > Numerical Analysis

arXiv:1905.00165 (math)

[Submitted on 1 May 2019 (v1), last revised 17 Aug 2019 (this version, v2)]

Title:High-performance sampling of generic Determinantal Point Processes

Authors:Jack Poulson

View PDF

Abstract:Determinantal Point Processes (DPPs) were introduced by Macchi as a model for repulsive (fermionic) particle distributions. But their recent popularization is largely due to their usefulness for encouraging diversity in the final stage of a recommender system.
The standard sampling scheme for finite DPPs is a spectral decomposition followed by an equivalent of a randomly diagonally-pivoted Cholesky factorization of an orthogonal projection, which is only applicable to Hermitian kernels and has an expensive setup cost. Researchers have begun to connect DPP sampling to $LDL^H$ factorizations as a means of avoiding the initial spectral decomposition, but existing approaches have only outperformed the spectral decomposition approach in special circumstances, where the number of kept modes is a small percentage of the ground set size.
This article proves that trivial modifications of $LU$ and $LDL^H$ factorizations yield efficient direct sampling schemes for non-Hermitian and Hermitian DPP kernels, respectively. Further, it is experimentally shown that even dynamically-scheduled, shared-memory parallelizations of high-performance dense and sparse-direct factorizations can be trivially modified to yield DPP sampling schemes with essentially identical performance.
The software developed as part of this research, Catamari, this https URL, is released under the Mozilla Public License v2.0. It contains header-only, C++14 plus OpenMP 4.0 implementations of dense and sparse-direct, Hermitian and non-Hermitian DPP samplers.

Comments:	25 pages, 11 figures. Submitted to the Royal Society's Philosophical Transactions A
Subjects:	Numerical Analysis (math.NA)
Cite as:	arXiv:1905.00165 [math.NA]
	(or arXiv:1905.00165v2 [math.NA] for this version)
	https://doi.org/10.48550/arXiv.1905.00165
Related DOI:	https://doi.org/10.1098/rsta.2019.0059

Submission history

From: Jack Poulson [view email]
[v1] Wed, 1 May 2019 02:37:43 UTC (598 KB)
[v2] Sat, 17 Aug 2019 17:55:28 UTC (607 KB)

Mathematics > Numerical Analysis

Title:High-performance sampling of generic Determinantal Point Processes

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Numerical Analysis

Title:High-performance sampling of generic Determinantal Point Processes

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators