Self-Expressive Decompositions for Matrix Approximation and Clustering

Dyer, Eva L.; Goldstein, Tom A.; Patel, Raajen; Kording, Konrad P.; Baraniuk, Richard G.

Computer Science > Information Theory

arXiv:1505.00824 (cs)

[Submitted on 4 May 2015]

Title:Self-Expressive Decompositions for Matrix Approximation and Clustering

Authors:Eva L. Dyer, Tom A. Goldstein, Raajen Patel, Konrad P. Kording, Richard G. Baraniuk

View PDF

Abstract:Data-aware methods for dimensionality reduction and matrix decomposition aim to find low-dimensional structure in a collection of data. Classical approaches discover such structure by learning a basis that can efficiently express the collection. Recently, "self expression", the idea of using a small subset of data vectors to represent the full collection, has been developed as an alternative to learning. Here, we introduce a scalable method for computing sparse SElf-Expressive Decompositions (SEED). SEED is a greedy method that constructs a basis by sequentially selecting incoherent vectors from the dataset. After forming a basis from a subset of vectors in the dataset, SEED then computes a sparse representation of the dataset with respect to this basis. We develop sufficient conditions under which SEED exactly represents low rank matrices and vectors sampled from a unions of independent subspaces. We show how SEED can be used in applications ranging from matrix approximation and denoising to clustering, and apply it to numerous real-world datasets. Our results demonstrate that SEED is an attractive low-complexity alternative to other sparse matrix factorization approaches such as sparse PCA and self-expressive methods for clustering.

Comments:	11 pages, 7 figures
Subjects:	Information Theory (cs.IT); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1505.00824 [cs.IT]
	(or arXiv:1505.00824v1 [cs.IT] for this version)
	https://doi.org/10.48550/arXiv.1505.00824

Submission history

From: Eva Dyer [view email]
[v1] Mon, 4 May 2015 21:56:54 UTC (2,290 KB)

Computer Science > Information Theory

Title:Self-Expressive Decompositions for Matrix Approximation and Clustering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Theory

Title:Self-Expressive Decompositions for Matrix Approximation and Clustering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators