A Novel Convex Clustering Method for High-Dimensional Data Using Semiproximal ADMM

Clustering is an important ingredient of unsupervised learning; classical clustering methods include K-means clustering and hierarchical clustering. These methods may suffer from instability because of their tendency prone to sink into the local optimal solutions of the nonconvex optimization model....

Full description

Bibliographic Details
Main Authors: Huangyue Chen, Lingchen Kong, Yan Li
Format: Article
Language:English
Published: Hindawi Limited 2020-01-01
Series:Mathematical Problems in Engineering
Online Access:http://dx.doi.org/10.1155/2020/9216351
id doaj-3b1ab093039642ec9bf97654586f2e18
record_format Article
spelling doaj-3b1ab093039642ec9bf97654586f2e182020-11-25T03:47:53ZengHindawi LimitedMathematical Problems in Engineering1024-123X1563-51472020-01-01202010.1155/2020/92163519216351A Novel Convex Clustering Method for High-Dimensional Data Using Semiproximal ADMMHuangyue Chen0Lingchen Kong1Yan Li2Department of Applied Mathematics, Beijing Jiaotong University, Beijing 100044, ChinaDepartment of Applied Mathematics, Beijing Jiaotong University, Beijing 100044, ChinaSchool of Insurance and Economics, University of International Business and Economics, Beijing 100029, ChinaClustering is an important ingredient of unsupervised learning; classical clustering methods include K-means clustering and hierarchical clustering. These methods may suffer from instability because of their tendency prone to sink into the local optimal solutions of the nonconvex optimization model. In this paper, we propose a new convex clustering method for high-dimensional data based on the sparse group lasso penalty, which can simultaneously group observations and eliminate noninformative features. In this method, the number of clusters can be learned from the data instead of being given in advance as a parameter. We theoretically prove that the proposed method has desirable statistical properties, including a finite sample error bound and feature screening consistency. Furthermore, the semiproximal alternating direction method of multipliers is designed to solve the sparse group lasso convex clustering model, and its convergence analysis is established without any conditions. Finally, the effectiveness of the proposed method is thoroughly demonstrated through simulated experiments and real applications.http://dx.doi.org/10.1155/2020/9216351
collection DOAJ
language English
format Article
sources DOAJ
author Huangyue Chen
Lingchen Kong
Yan Li
spellingShingle Huangyue Chen
Lingchen Kong
Yan Li
A Novel Convex Clustering Method for High-Dimensional Data Using Semiproximal ADMM
Mathematical Problems in Engineering
author_facet Huangyue Chen
Lingchen Kong
Yan Li
author_sort Huangyue Chen
title A Novel Convex Clustering Method for High-Dimensional Data Using Semiproximal ADMM
title_short A Novel Convex Clustering Method for High-Dimensional Data Using Semiproximal ADMM
title_full A Novel Convex Clustering Method for High-Dimensional Data Using Semiproximal ADMM
title_fullStr A Novel Convex Clustering Method for High-Dimensional Data Using Semiproximal ADMM
title_full_unstemmed A Novel Convex Clustering Method for High-Dimensional Data Using Semiproximal ADMM
title_sort novel convex clustering method for high-dimensional data using semiproximal admm
publisher Hindawi Limited
series Mathematical Problems in Engineering
issn 1024-123X
1563-5147
publishDate 2020-01-01
description Clustering is an important ingredient of unsupervised learning; classical clustering methods include K-means clustering and hierarchical clustering. These methods may suffer from instability because of their tendency prone to sink into the local optimal solutions of the nonconvex optimization model. In this paper, we propose a new convex clustering method for high-dimensional data based on the sparse group lasso penalty, which can simultaneously group observations and eliminate noninformative features. In this method, the number of clusters can be learned from the data instead of being given in advance as a parameter. We theoretically prove that the proposed method has desirable statistical properties, including a finite sample error bound and feature screening consistency. Furthermore, the semiproximal alternating direction method of multipliers is designed to solve the sparse group lasso convex clustering model, and its convergence analysis is established without any conditions. Finally, the effectiveness of the proposed method is thoroughly demonstrated through simulated experiments and real applications.
url http://dx.doi.org/10.1155/2020/9216351
work_keys_str_mv AT huangyuechen anovelconvexclusteringmethodforhighdimensionaldatausingsemiproximaladmm
AT lingchenkong anovelconvexclusteringmethodforhighdimensionaldatausingsemiproximaladmm
AT yanli anovelconvexclusteringmethodforhighdimensionaldatausingsemiproximaladmm
AT huangyuechen novelconvexclusteringmethodforhighdimensionaldatausingsemiproximaladmm
AT lingchenkong novelconvexclusteringmethodforhighdimensionaldatausingsemiproximaladmm
AT yanli novelconvexclusteringmethodforhighdimensionaldatausingsemiproximaladmm
_version_ 1715116300115640320