Adaptive Learning of Tensor Network Structures

Hashemizadeh, Meraj; Liu, Michelle; Miller, Jacob; Rabusseau, Guillaume

Computer Science > Machine Learning

arXiv:2008.05437 (cs)

[Submitted on 12 Aug 2020 (v1), last revised 22 Jun 2021 (this version, v2)]

Title:Adaptive Learning of Tensor Network Structures

Authors:Meraj Hashemizadeh, Michelle Liu, Jacob Miller, Guillaume Rabusseau

View PDF

Abstract:Tensor Networks (TN) offer a powerful framework to efficiently represent very high-dimensional objects. TN have recently shown their potential for machine learning applications and offer a unifying view of common tensor decomposition models such as Tucker, tensor train (TT) and tensor ring (TR). However, identifying the best tensor network structure from data for a given task is challenging. In this work, we leverage the TN formalism to develop a generic and efficient adaptive algorithm to jointly learn the structure and the parameters of a TN from data. Our method is based on a simple greedy approach starting from a rank one tensor and successively identifying the most promising tensor network edges for small rank increments. Our algorithm can adaptively identify TN structures with small number of parameters that effectively optimize any differentiable objective function. Experiments on tensor decomposition, tensor completion and model compression tasks demonstrate the effectiveness of the proposed algorithm. In particular, our method outperforms the state-of-the-art evolutionary topology search [Li and Sun, 2020] for tensor decomposition of images (while being orders of magnitude faster) and finds efficient tensor network structures to compress neural networks outperforming popular TT based approaches [Novikov et al., 2015].

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2008.05437 [cs.LG]
	(or arXiv:2008.05437v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2008.05437

Submission history

From: Meraj Hashemizadeh [view email]
[v1] Wed, 12 Aug 2020 16:41:56 UTC (2,590 KB)
[v2] Tue, 22 Jun 2021 18:46:43 UTC (1,857 KB)

Computer Science > Machine Learning

Title:Adaptive Learning of Tensor Network Structures

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Adaptive Learning of Tensor Network Structures

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators