A Novel Convex Clustering Method for High-Dimensional Data Using Semiproximal ADMM
Clustering is an important ingredient of unsupervised learning; classical clustering methods include K-means clustering and hierarchical clustering. These methods may suffer from instability because of their tendency prone to sink into the local optimal solutions of the nonconvex optimization model....
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2020-01-01
|
Series: | Mathematical Problems in Engineering |
Online Access: | http://dx.doi.org/10.1155/2020/9216351 |
id |
doaj-3b1ab093039642ec9bf97654586f2e18 |
---|---|
record_format |
Article |
spelling |
doaj-3b1ab093039642ec9bf97654586f2e182020-11-25T03:47:53ZengHindawi LimitedMathematical Problems in Engineering1024-123X1563-51472020-01-01202010.1155/2020/92163519216351A Novel Convex Clustering Method for High-Dimensional Data Using Semiproximal ADMMHuangyue Chen0Lingchen Kong1Yan Li2Department of Applied Mathematics, Beijing Jiaotong University, Beijing 100044, ChinaDepartment of Applied Mathematics, Beijing Jiaotong University, Beijing 100044, ChinaSchool of Insurance and Economics, University of International Business and Economics, Beijing 100029, ChinaClustering is an important ingredient of unsupervised learning; classical clustering methods include K-means clustering and hierarchical clustering. These methods may suffer from instability because of their tendency prone to sink into the local optimal solutions of the nonconvex optimization model. In this paper, we propose a new convex clustering method for high-dimensional data based on the sparse group lasso penalty, which can simultaneously group observations and eliminate noninformative features. In this method, the number of clusters can be learned from the data instead of being given in advance as a parameter. We theoretically prove that the proposed method has desirable statistical properties, including a finite sample error bound and feature screening consistency. Furthermore, the semiproximal alternating direction method of multipliers is designed to solve the sparse group lasso convex clustering model, and its convergence analysis is established without any conditions. Finally, the effectiveness of the proposed method is thoroughly demonstrated through simulated experiments and real applications.http://dx.doi.org/10.1155/2020/9216351 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Huangyue Chen Lingchen Kong Yan Li |
spellingShingle |
Huangyue Chen Lingchen Kong Yan Li A Novel Convex Clustering Method for High-Dimensional Data Using Semiproximal ADMM Mathematical Problems in Engineering |
author_facet |
Huangyue Chen Lingchen Kong Yan Li |
author_sort |
Huangyue Chen |
title |
A Novel Convex Clustering Method for High-Dimensional Data Using Semiproximal ADMM |
title_short |
A Novel Convex Clustering Method for High-Dimensional Data Using Semiproximal ADMM |
title_full |
A Novel Convex Clustering Method for High-Dimensional Data Using Semiproximal ADMM |
title_fullStr |
A Novel Convex Clustering Method for High-Dimensional Data Using Semiproximal ADMM |
title_full_unstemmed |
A Novel Convex Clustering Method for High-Dimensional Data Using Semiproximal ADMM |
title_sort |
novel convex clustering method for high-dimensional data using semiproximal admm |
publisher |
Hindawi Limited |
series |
Mathematical Problems in Engineering |
issn |
1024-123X 1563-5147 |
publishDate |
2020-01-01 |
description |
Clustering is an important ingredient of unsupervised learning; classical clustering methods include K-means clustering and hierarchical clustering. These methods may suffer from instability because of their tendency prone to sink into the local optimal solutions of the nonconvex optimization model. In this paper, we propose a new convex clustering method for high-dimensional data based on the sparse group lasso penalty, which can simultaneously group observations and eliminate noninformative features. In this method, the number of clusters can be learned from the data instead of being given in advance as a parameter. We theoretically prove that the proposed method has desirable statistical properties, including a finite sample error bound and feature screening consistency. Furthermore, the semiproximal alternating direction method of multipliers is designed to solve the sparse group lasso convex clustering model, and its convergence analysis is established without any conditions. Finally, the effectiveness of the proposed method is thoroughly demonstrated through simulated experiments and real applications. |
url |
http://dx.doi.org/10.1155/2020/9216351 |
work_keys_str_mv |
AT huangyuechen anovelconvexclusteringmethodforhighdimensionaldatausingsemiproximaladmm AT lingchenkong anovelconvexclusteringmethodforhighdimensionaldatausingsemiproximaladmm AT yanli anovelconvexclusteringmethodforhighdimensionaldatausingsemiproximaladmm AT huangyuechen novelconvexclusteringmethodforhighdimensionaldatausingsemiproximaladmm AT lingchenkong novelconvexclusteringmethodforhighdimensionaldatausingsemiproximaladmm AT yanli novelconvexclusteringmethodforhighdimensionaldatausingsemiproximaladmm |
_version_ |
1715116300115640320 |