Uniform error bound for PCA matrix denoising

Tong, Xin T.; Wang, Wanjie; Wang, Yuguan

Mathematics > Statistics Theory

arXiv:2306.12690 (math)

[Submitted on 22 Jun 2023 (v1), last revised 28 Aug 2024 (this version, v3)]

Title:Uniform error bound for PCA matrix denoising

Authors:Xin T. Tong, Wanjie Wang, Yuguan Wang

View PDF HTML (experimental)

Abstract:Principal component analysis (PCA) is a simple and popular tool for processing high-dimensional data. We investigate its effectiveness for matrix denoising.
We consider the clean data are generated from a low-dimensional subspace, but masked by independent high-dimensional sub-Gaussian noises with standard deviation $\sigma$. Under the low-rank assumption on the clean data with a mild spectral gap assumption, we prove that the distance between each pair of PCA-denoised data point and the clean data point is uniformly bounded by $O(\sigma \log n)$. To illustrate the spectral gap assumption, we show it can be satisfied when the clean data are independently generated with a non-degenerate covariance matrix. We then provide a general lower bound for the error of the denoised data matrix, which indicates PCA denoising gives a uniform error bound that is rate-optimal. Furthermore, we examine how the error bound impacts downstream applications such as clustering and manifold learning. Numerical results validate our theoretical findings and reveal the importance of the uniform error.

Comments:	33 pages, 2 figures
Subjects:	Statistics Theory (math.ST); Methodology (stat.ME)
MSC classes:	62H25(primary), 62H30, 62R30
Cite as:	arXiv:2306.12690 [math.ST]
	(or arXiv:2306.12690v3 [math.ST] for this version)
	https://doi.org/10.48550/arXiv.2306.12690

Submission history

From: Wanjie Wang [view email]
[v1] Thu, 22 Jun 2023 06:26:36 UTC (460 KB)
[v2] Mon, 11 Mar 2024 08:15:37 UTC (460 KB)
[v3] Wed, 28 Aug 2024 09:10:29 UTC (468 KB)

Mathematics > Statistics Theory

Title:Uniform error bound for PCA matrix denoising

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Statistics Theory

Title:Uniform error bound for PCA matrix denoising

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators