Multi-Layer Convolutional Sparse Modeling: Pursuit and Dictionary Learning

Sulam, Jeremias; Papyan, Vardan; Romano, Yaniv; Elad, Michael

doi:10.1109/TSP.2018.2846226

Computer Science > Computer Vision and Pattern Recognition

arXiv:1708.08705 (cs)

[Submitted on 29 Aug 2017 (v1), last revised 30 Jun 2018 (this version, v2)]

Title:Multi-Layer Convolutional Sparse Modeling: Pursuit and Dictionary Learning

Authors:Jeremias Sulam, Vardan Papyan, Yaniv Romano, Michael Elad

View PDF

Abstract:The recently proposed Multi-Layer Convolutional Sparse Coding (ML-CSC) model, consisting of a cascade of convolutional sparse layers, provides a new interpretation of Convolutional Neural Networks (CNNs). Under this framework, the computation of the forward pass in a CNN is equivalent to a pursuit algorithm aiming to estimate the nested sparse representation vectors -- or feature maps -- from a given input signal. Despite having served as a pivotal connection between CNNs and sparse modeling, a deeper understanding of the ML-CSC is still lacking: there are no pursuit algorithms that can serve this model exactly, nor are there conditions to guarantee a non-empty model. While one can easily obtain signals that approximately satisfy the ML-CSC constraints, it remains unclear how to simply sample from the model and, more importantly, how one can train the convolutional filters from real data.
In this work, we propose a sound pursuit algorithm for the ML-CSC model by adopting a projection approach. We provide new and improved bounds on the stability of the solution of such pursuit and we analyze different practical alternatives to implement this in practice. We show that the training of the filters is essential to allow for non-trivial signals in the model, and we derive an online algorithm to learn the dictionaries from real data, effectively resulting in cascaded sparse convolutional layers. Last, but not least, we demonstrate the applicability of the ML-CSC model for several applications in an unsupervised setting, providing competitive results. Our work represents a bridge between matrix factorization, sparse dictionary learning and sparse auto-encoders, and we analyze these connections in detail.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1708.08705 [cs.CV]
	(or arXiv:1708.08705v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1708.08705
Journal reference:	IEEE Transactions on Signal Processing, vol. 66, no. 15, pp. 4090-4104, Aug.1, 1 2018
Related DOI:	https://doi.org/10.1109/TSP.2018.2846226

Submission history

From: Jeremias Sulam [view email]
[v1] Tue, 29 Aug 2017 11:43:40 UTC (1,306 KB)
[v2] Sat, 30 Jun 2018 19:46:15 UTC (1,162 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-Layer Convolutional Sparse Modeling: Pursuit and Dictionary Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-Layer Convolutional Sparse Modeling: Pursuit and Dictionary Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators