Large-scale subspace clustering using sketching and validation

Traganitis, Panagiotis A.; Slavakis, Konstantinos; Giannakis, Georgios B.

Computer Science > Machine Learning

arXiv:1510.01628 (cs)

[Submitted on 6 Oct 2015]

Title:Large-scale subspace clustering using sketching and validation

Authors:Panagiotis A. Traganitis, Konstantinos Slavakis, Georgios B. Giannakis

View PDF

Abstract:The nowadays massive amounts of generated and communicated data present major challenges in their processing. While capable of successfully classifying nonlinearly separable objects in various settings, subspace clustering (SC) methods incur prohibitively high computational complexity when processing large-scale data. Inspired by the random sampling and consensus (RANSAC) approach to robust regression, the present paper introduces a randomized scheme for SC, termed sketching and validation (SkeVa-)SC, tailored for large-scale data. At the heart of SkeVa-SC lies a randomized scheme for approximating the underlying probability density function of the observed data by kernel smoothing arguments. Sparsity in data representations is also exploited to reduce the computational burden of SC, while achieving high clustering accuracy. Performance analysis as well as extensive numerical tests on synthetic and real data corroborate the potential of SkeVa-SC and its competitive performance relative to state-of-the-art scalable SC approaches. Keywords: Subspace clustering, big data, kernel smoothing, randomization, sketching, validation, sparsity.

Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:1510.01628 [cs.LG]
	(or arXiv:1510.01628v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1510.01628

Submission history

From: Panagiotis Traganitis [view email]
[v1] Tue, 6 Oct 2015 15:34:32 UTC (997 KB)

Computer Science > Machine Learning

Title:Large-scale subspace clustering using sketching and validation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Large-scale subspace clustering using sketching and validation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators