Dual Free Adaptive Minibatch SDCA for Empirical Risk Minimization

In this paper we develop an adaptive dual free Stochastic Dual Coordinate Ascent (adfSDCA) algorithm for regularized empirical risk minimization problems. This is motivated by the recent work on dual free SDCA of Shalev-Shwartz [1]. The novelty of our approach is that the coordinates to update at ea...

Full description

Bibliographic Details
Main Authors:	Xi He, Rachael Tappenden, Martin Takáč
Format:	Article
Language:	English
Published:	Frontiers Media S.A. 2018-07-01
Series:	Frontiers in Applied Mathematics and Statistics
Subjects:	SDCA importance sampling non-uniform sampling mini-batch adaptive
Online Access:	https://www.frontiersin.org/article/10.3389/fams.2018.00033/full

id	doaj-77814a7c16e249a789dc03c3d28af204
record_format	Article
spelling	doaj-77814a7c16e249a789dc03c3d28af2042020-11-25T02:08:31ZengFrontiers Media S.A.Frontiers in Applied Mathematics and Statistics2297-46872018-07-01410.3389/fams.2018.00033348053Dual Free Adaptive Minibatch SDCA for Empirical Risk MinimizationXi He0Rachael Tappenden1Martin Takáč2Industrial and Systems Engineering, Lehigh UniversityBethlehem, PA, United StatesSchool of Mathematics and Statistics, University of Canterbury, Christchurch, New ZealandIndustrial and Systems Engineering, Lehigh UniversityBethlehem, PA, United StatesIn this paper we develop an adaptive dual free Stochastic Dual Coordinate Ascent (adfSDCA) algorithm for regularized empirical risk minimization problems. This is motivated by the recent work on dual free SDCA of Shalev-Shwartz [1]. The novelty of our approach is that the coordinates to update at each iteration are selected non-uniformly from an adaptive probability distribution, and this extends the previously mentioned work which only allowed for a uniform selection of “dual” coordinates from a fixed probability distribution. We describe an efficient iterative procedure for generating the non-uniform samples, where the scheme selects the coordinate with the greatest potential to decrease the sub-optimality of the current iterate. We also propose a heuristic variant of adfSDCA that is more aggressive than the standard approach. Furthermore, in order to utilize multi-core machines we consider a mini-batch adfSDCA algorithm and develop complexity results that guarantee the algorithm's convergence. The work is concluded with several numerical experiments to demonstrate the practical benefits of the proposed approach.https://www.frontiersin.org/article/10.3389/fams.2018.00033/fullSDCAimportance samplingnon-uniform samplingmini-batchadaptive
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Xi He Rachael Tappenden Martin Takáč
spellingShingle	Xi He Rachael Tappenden Martin Takáč Dual Free Adaptive Minibatch SDCA for Empirical Risk Minimization Frontiers in Applied Mathematics and Statistics SDCA importance sampling non-uniform sampling mini-batch adaptive
author_facet	Xi He Rachael Tappenden Martin Takáč
author_sort	Xi He
title	Dual Free Adaptive Minibatch SDCA for Empirical Risk Minimization
title_short	Dual Free Adaptive Minibatch SDCA for Empirical Risk Minimization
title_full	Dual Free Adaptive Minibatch SDCA for Empirical Risk Minimization
title_fullStr	Dual Free Adaptive Minibatch SDCA for Empirical Risk Minimization
title_full_unstemmed	Dual Free Adaptive Minibatch SDCA for Empirical Risk Minimization
title_sort	dual free adaptive minibatch sdca for empirical risk minimization
publisher	Frontiers Media S.A.
series	Frontiers in Applied Mathematics and Statistics
issn	2297-4687
publishDate	2018-07-01
description	In this paper we develop an adaptive dual free Stochastic Dual Coordinate Ascent (adfSDCA) algorithm for regularized empirical risk minimization problems. This is motivated by the recent work on dual free SDCA of Shalev-Shwartz [1]. The novelty of our approach is that the coordinates to update at each iteration are selected non-uniformly from an adaptive probability distribution, and this extends the previously mentioned work which only allowed for a uniform selection of “dual” coordinates from a fixed probability distribution. We describe an efficient iterative procedure for generating the non-uniform samples, where the scheme selects the coordinate with the greatest potential to decrease the sub-optimality of the current iterate. We also propose a heuristic variant of adfSDCA that is more aggressive than the standard approach. Furthermore, in order to utilize multi-core machines we consider a mini-batch adfSDCA algorithm and develop complexity results that guarantee the algorithm's convergence. The work is concluded with several numerical experiments to demonstrate the practical benefits of the proposed approach.
topic	SDCA importance sampling non-uniform sampling mini-batch adaptive
url	https://www.frontiersin.org/article/10.3389/fams.2018.00033/full
work_keys_str_mv	AT xihe dualfreeadaptiveminibatchsdcaforempiricalriskminimization AT rachaeltappenden dualfreeadaptiveminibatchsdcaforempiricalriskminimization AT martintakac dualfreeadaptiveminibatchsdcaforempiricalriskminimization
_version_	1724926818593538048

Dual Free Adaptive Minibatch SDCA for Empirical Risk Minimization

Similar Items