Improved Initialization of the EM Algorithm for Mixture Model Parameter Estimation

A commonly used tool for estimating the parameters of a mixture model is the Expectation−Maximization (EM) algorithm, which is an iterative procedure that can serve as a maximum-likelihood estimator. The EM algorithm has well-documented drawbacks, such as the need for good initial values a...

Full description

Bibliographic Details
Main Authors: Branislav Panić, Jernej Klemenc, Marko Nagode
Format: Article
Language:English
Published: MDPI AG 2020-03-01
Series:Mathematics
Subjects:
Online Access:https://www.mdpi.com/2227-7390/8/3/373
id doaj-a859a121c3a44bc08a7302a6fb8d1810
record_format Article
spelling doaj-a859a121c3a44bc08a7302a6fb8d18102020-11-25T03:32:29ZengMDPI AGMathematics2227-73902020-03-018337310.3390/math8030373math8030373Improved Initialization of the EM Algorithm for Mixture Model Parameter EstimationBranislav Panić0Jernej Klemenc1Marko Nagode2Faculty of Mechanical Engineering, University of Ljubljana, Aškerčeva ulica 6, 1000 Ljubljana, SloveniaFaculty of Mechanical Engineering, University of Ljubljana, Aškerčeva ulica 6, 1000 Ljubljana, SloveniaFaculty of Mechanical Engineering, University of Ljubljana, Aškerčeva ulica 6, 1000 Ljubljana, SloveniaA commonly used tool for estimating the parameters of a mixture model is the Expectation&#8722;Maximization (EM) algorithm, which is an iterative procedure that can serve as a maximum-likelihood estimator. The EM algorithm has well-documented drawbacks, such as the need for good initial values and the possibility of being trapped in local optima. Nevertheless, because of its appealing properties, EM plays an important role in estimating the parameters of mixture models. To overcome these initialization problems with EM, in this paper, we propose the Rough-Enhanced-Bayes mixture estimation (REBMIX) algorithm as a more effective initialization algorithm. Three different strategies are derived for dealing with the unknown number of components in the mixture model. These strategies are thoroughly tested on artificial datasets, density&#8722;estimation datasets and image&#8722;segmentation problems and compared with state-of-the-art initialization methods for the EM. Our proposal shows promising results in terms of clustering and density-estimation performance as well as in terms of computational efficiency. All the improvements are implemented in the <b>rebmix</b> R package.https://www.mdpi.com/2227-7390/8/3/373mixture modelparameter estimationem algorithmrebmix algorithmdensity estimationclusteringimage segmentation
collection DOAJ
language English
format Article
sources DOAJ
author Branislav Panić
Jernej Klemenc
Marko Nagode
spellingShingle Branislav Panić
Jernej Klemenc
Marko Nagode
Improved Initialization of the EM Algorithm for Mixture Model Parameter Estimation
Mathematics
mixture model
parameter estimation
em algorithm
rebmix algorithm
density estimation
clustering
image segmentation
author_facet Branislav Panić
Jernej Klemenc
Marko Nagode
author_sort Branislav Panić
title Improved Initialization of the EM Algorithm for Mixture Model Parameter Estimation
title_short Improved Initialization of the EM Algorithm for Mixture Model Parameter Estimation
title_full Improved Initialization of the EM Algorithm for Mixture Model Parameter Estimation
title_fullStr Improved Initialization of the EM Algorithm for Mixture Model Parameter Estimation
title_full_unstemmed Improved Initialization of the EM Algorithm for Mixture Model Parameter Estimation
title_sort improved initialization of the em algorithm for mixture model parameter estimation
publisher MDPI AG
series Mathematics
issn 2227-7390
publishDate 2020-03-01
description A commonly used tool for estimating the parameters of a mixture model is the Expectation&#8722;Maximization (EM) algorithm, which is an iterative procedure that can serve as a maximum-likelihood estimator. The EM algorithm has well-documented drawbacks, such as the need for good initial values and the possibility of being trapped in local optima. Nevertheless, because of its appealing properties, EM plays an important role in estimating the parameters of mixture models. To overcome these initialization problems with EM, in this paper, we propose the Rough-Enhanced-Bayes mixture estimation (REBMIX) algorithm as a more effective initialization algorithm. Three different strategies are derived for dealing with the unknown number of components in the mixture model. These strategies are thoroughly tested on artificial datasets, density&#8722;estimation datasets and image&#8722;segmentation problems and compared with state-of-the-art initialization methods for the EM. Our proposal shows promising results in terms of clustering and density-estimation performance as well as in terms of computational efficiency. All the improvements are implemented in the <b>rebmix</b> R package.
topic mixture model
parameter estimation
em algorithm
rebmix algorithm
density estimation
clustering
image segmentation
url https://www.mdpi.com/2227-7390/8/3/373
work_keys_str_mv AT branislavpanic improvedinitializationoftheemalgorithmformixturemodelparameterestimation
AT jernejklemenc improvedinitializationoftheemalgorithmformixturemodelparameterestimation
AT markonagode improvedinitializationoftheemalgorithmformixturemodelparameterestimation
_version_ 1724567882067607552