Parallel Coordinate Descent Newton Method for Efficient $\ell_1$-Regularized Minimization

Bian, An; Li, Xiong; Liu, Yuncai; Yang, Ming-Hsuan

Computer Science > Machine Learning

arXiv:1306.4080 (cs)

[Submitted on 18 Jun 2013 (v1), last revised 7 Dec 2017 (this version, v4)]

Title:Parallel Coordinate Descent Newton Method for Efficient $\ell_1$-Regularized Minimization

Authors:An Bian, Xiong Li, Yuncai Liu, Ming-Hsuan Yang

View PDF

Abstract:The recent years have witnessed advances in parallel algorithms for large scale optimization problems. Notwithstanding demonstrated success, existing algorithms that parallelize over features are usually limited by divergence issues under high parallelism or require data preprocessing to alleviate these problems. In this work, we propose a Parallel Coordinate Descent Newton algorithm using multidimensional approximate Newton steps (PCDN), where the off-diagonal elements of the Hessian are set to zero to enable parallelization. It randomly partitions the feature set into $b$ bundles/subsets with size of $P$, and sequentially processes each bundle by first computing the descent directions for each feature in parallel and then conducting $P$-dimensional line search to obtain the step size. We show that: (1) PCDN is guaranteed to converge globally despite increasing parallelism; (2) PCDN converges to the specified accuracy $\epsilon$ within the limited iteration number of $T_\epsilon$, and $T_\epsilon$ decreases with increasing parallelism (bundle size $P$). Using the implementation technique of maintaining intermediate quantities, we minimize the data transfer and synchronization cost of the $P$-dimensional line search. For concreteness, the proposed PCDN algorithm is applied to $\ell_1$-regularized logistic regression and $\ell_2$-loss SVM. Experimental evaluations on six benchmark datasets show that the proposed PCDN algorithm exploits parallelism well and outperforms the state-of-the-art methods in speed without losing accuracy.

Subjects:	Machine Learning (cs.LG); Numerical Analysis (math.NA)
Cite as:	arXiv:1306.4080 [cs.LG]
	(or arXiv:1306.4080v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1306.4080

Submission history

From: An Bian [view email]
[v1] Tue, 18 Jun 2013 07:03:16 UTC (2,965 KB)
[v2] Fri, 27 Dec 2013 08:41:37 UTC (308 KB)
[v3] Tue, 18 Mar 2014 14:55:49 UTC (486 KB)
[v4] Thu, 7 Dec 2017 09:16:27 UTC (1,557 KB)

Computer Science > Machine Learning

Title:Parallel Coordinate Descent Newton Method for Efficient $\ell_1$-Regularized Minimization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Parallel Coordinate Descent Newton Method for Efficient $\ell_1$-Regularized Minimization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators