Block stochastic gradient iteration for convex and nonconvex optimization

Xu, Yangyang; Yin, Wotao

Mathematics > Optimization and Control

arXiv:1408.2597 (math)

[Submitted on 12 Aug 2014 (v1), last revised 2 Mar 2015 (this version, v3)]

Title:Block stochastic gradient iteration for convex and nonconvex optimization

Authors:Yangyang Xu, Wotao Yin

View PDF

Abstract:The stochastic gradient (SG) method can minimize an objective function composed of a large number of differentiable functions, or solve a stochastic optimization problem, to a moderate accuracy. The block coordinate descent/update (BCD) method, on the other hand, handles problems with multiple blocks of variables by updating them one at a time; when the blocks of variables are easier to update individually than together, BCD has a lower per-iteration cost. This paper introduces a method that combines the features of SG and BCD for problems with many components in the objective and with multiple (blocks of) variables.
Specifically, a block stochastic gradient (BSG) method is proposed for solving both convex and nonconvex programs. At each iteration, BSG approximates the gradient of the differentiable part of the objective by randomly sampling a small set of data or sampling a few functions from the sum term in the objective, and then, using those samples, it updates all the blocks of variables in either a deterministic or a randomly shuffled order. Its convergence for both convex and nonconvex cases are established in different senses. In the convex case, the proposed method has the same order of convergence rate as the SG method. In the nonconvex case, its convergence is established in terms of the expected violation of a first-order optimality condition. The proposed method was numerically tested on problems including stochastic least squares and logistic regression, which are convex, as well as low-rank tensor recovery and bilinear logistic regression, which are nonconvex.

Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
MSC classes:	90C06
Cite as:	arXiv:1408.2597 [math.OC]
	(or arXiv:1408.2597v3 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.1408.2597

Submission history

From: Yangyang Xu [view email]
[v1] Tue, 12 Aug 2014 01:21:42 UTC (203 KB)
[v2] Tue, 26 Aug 2014 13:14:26 UTC (217 KB)
[v3] Mon, 2 Mar 2015 04:02:54 UTC (252 KB)

Mathematics > Optimization and Control

Title:Block stochastic gradient iteration for convex and nonconvex optimization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Block stochastic gradient iteration for convex and nonconvex optimization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators