Non-strongly-convex smooth stochastic approximation with convergence rate O(1/n)

Bach, Francis; Moulines, Eric

Computer Science > Machine Learning

arXiv:1306.2119 (cs)

[Submitted on 10 Jun 2013]

Title:Non-strongly-convex smooth stochastic approximation with convergence rate O(1/n)

Authors:Francis Bach (INRIA Paris - Rocquencourt, LIENS), Eric Moulines (LTCI)

View PDF

Abstract:We consider the stochastic approximation problem where a convex function has to be minimized, given only the knowledge of unbiased estimates of its gradients at certain points, a framework which includes machine learning methods based on the minimization of the empirical risk. We focus on problems without strong convexity, for which all previously known algorithms achieve a convergence rate for function values of O(1/n^{1/2}). We consider and analyze two algorithms that achieve a rate of O(1/n) for classical supervised learning problems. For least-squares regression, we show that averaged stochastic gradient descent with constant step-size achieves the desired rate. For logistic regression, this is achieved by a simple novel stochastic gradient algorithm that (a) constructs successive local quadratic approximations of the loss functions, while (b) preserving the same running time complexity as stochastic gradient descent. For these algorithms, we provide a non-asymptotic analysis of the generalization error (in expectation, and also in high probability for least-squares), and run extensive experiments on standard machine learning benchmarks showing that they often outperform existing approaches.

Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:1306.2119 [cs.LG]
	(or arXiv:1306.2119v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1306.2119

Submission history

From: Francis Bach [view email] [via CCSD proxy]
[v1] Mon, 10 Jun 2013 07:31:10 UTC (130 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2013-06

Change to browse by:

cs
math
math.OC
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Francis Bach
Francis R. Bach
Eric Moulines

export BibTeX citation

Computer Science > Machine Learning

Title:Non-strongly-convex smooth stochastic approximation with convergence rate O(1/n)

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Non-strongly-convex smooth stochastic approximation with convergence rate O(1/n)

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators