Modeling in Forestry Using Mixture Models Fitted to Grouped and Ungrouped Data
The creation and maintenance of complex forest structures has become an important forestry objective. Complex forest structures, often expressed in multimodal shapes of tree size/diameter (DBH) distributions, are challenging to model. Mixture probability density functions of two- or three-component...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-09-01
|
Series: | Forests |
Subjects: | |
Online Access: | https://www.mdpi.com/1999-4907/12/9/1196 |
id |
doaj-bf2a827c511f4f14bf24aa93a73c8d5a |
---|---|
record_format |
Article |
spelling |
doaj-bf2a827c511f4f14bf24aa93a73c8d5a2021-09-26T00:10:33ZengMDPI AGForests1999-49072021-09-01121196119610.3390/f12091196Modeling in Forestry Using Mixture Models Fitted to Grouped and Ungrouped DataEric K. Zenner0Mahdi Teimouri1Department of Ecosystem Science and Management, The Pennsylvania State University, 305 Forest Resources Building, University Park, PA 16802, USADepartment of Statistics, Faculty of Science and Engineering, Gonbad Kavous University, No. 163, Basirat Blvd, Gonbad Kavous 4971799151, IranThe creation and maintenance of complex forest structures has become an important forestry objective. Complex forest structures, often expressed in multimodal shapes of tree size/diameter (DBH) distributions, are challenging to model. Mixture probability density functions of two- or three-component gamma, log-normal, and Weibull mixture models offer a solution and can additionally provide insights into forest dynamics. Model parameters can be efficiently estimated with the maximum likelihood (ML) approach using iterative methods such as the Newton-Raphson (NR) algorithm. However, the NR algorithm is sensitive to the choice of initial values and does not always converge. As an alternative, we explored the use of the iterative expectation-maximization (EM) algorithm for estimating parameters of the aforementioned mixture models because it always converges to ML estimators. Since forestry data frequently occur both in grouped (classified) and ungrouped (raw) forms, the EM algorithm was applied to explore the goodness-of-fit of the gamma, log-normal, and Weibull mixture distributions in three sample plots that exhibited irregular, multimodal, highly skewed, and heavy-tailed DBH distributions where some size classes were empty. The EM-based goodness-of-fit was further compared against a nonparametric kernel-based density estimation (NK) model and the recently popularized gamma-shaped mixture (GSM) models using the ungrouped data. In this example application, the EM algorithm provided well-fitting two- or three-component mixture models for all three model families. The number of components of the best-fitting models differed among the three sample plots (but not among model families) and the mixture models of the log-normal and gamma families provided a better fit than the Weibull distribution for grouped and ungrouped data. For ungrouped data, both log-normal and gamma mixture distributions outperformed the GSM model and, with the exception of the multimodal diameter distribution, also the NK model. The EM algorithm appears to be a promising tool for modeling complex forest structures.https://www.mdpi.com/1999-4907/12/9/1196complexitydiameter (DBH) distributionestimation-maximization (EM) algorithmforest structuregammalog-normal |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Eric K. Zenner Mahdi Teimouri |
spellingShingle |
Eric K. Zenner Mahdi Teimouri Modeling in Forestry Using Mixture Models Fitted to Grouped and Ungrouped Data Forests complexity diameter (DBH) distribution estimation-maximization (EM) algorithm forest structure gamma log-normal |
author_facet |
Eric K. Zenner Mahdi Teimouri |
author_sort |
Eric K. Zenner |
title |
Modeling in Forestry Using Mixture Models Fitted to Grouped and Ungrouped Data |
title_short |
Modeling in Forestry Using Mixture Models Fitted to Grouped and Ungrouped Data |
title_full |
Modeling in Forestry Using Mixture Models Fitted to Grouped and Ungrouped Data |
title_fullStr |
Modeling in Forestry Using Mixture Models Fitted to Grouped and Ungrouped Data |
title_full_unstemmed |
Modeling in Forestry Using Mixture Models Fitted to Grouped and Ungrouped Data |
title_sort |
modeling in forestry using mixture models fitted to grouped and ungrouped data |
publisher |
MDPI AG |
series |
Forests |
issn |
1999-4907 |
publishDate |
2021-09-01 |
description |
The creation and maintenance of complex forest structures has become an important forestry objective. Complex forest structures, often expressed in multimodal shapes of tree size/diameter (DBH) distributions, are challenging to model. Mixture probability density functions of two- or three-component gamma, log-normal, and Weibull mixture models offer a solution and can additionally provide insights into forest dynamics. Model parameters can be efficiently estimated with the maximum likelihood (ML) approach using iterative methods such as the Newton-Raphson (NR) algorithm. However, the NR algorithm is sensitive to the choice of initial values and does not always converge. As an alternative, we explored the use of the iterative expectation-maximization (EM) algorithm for estimating parameters of the aforementioned mixture models because it always converges to ML estimators. Since forestry data frequently occur both in grouped (classified) and ungrouped (raw) forms, the EM algorithm was applied to explore the goodness-of-fit of the gamma, log-normal, and Weibull mixture distributions in three sample plots that exhibited irregular, multimodal, highly skewed, and heavy-tailed DBH distributions where some size classes were empty. The EM-based goodness-of-fit was further compared against a nonparametric kernel-based density estimation (NK) model and the recently popularized gamma-shaped mixture (GSM) models using the ungrouped data. In this example application, the EM algorithm provided well-fitting two- or three-component mixture models for all three model families. The number of components of the best-fitting models differed among the three sample plots (but not among model families) and the mixture models of the log-normal and gamma families provided a better fit than the Weibull distribution for grouped and ungrouped data. For ungrouped data, both log-normal and gamma mixture distributions outperformed the GSM model and, with the exception of the multimodal diameter distribution, also the NK model. The EM algorithm appears to be a promising tool for modeling complex forest structures. |
topic |
complexity diameter (DBH) distribution estimation-maximization (EM) algorithm forest structure gamma log-normal |
url |
https://www.mdpi.com/1999-4907/12/9/1196 |
work_keys_str_mv |
AT erickzenner modelinginforestryusingmixturemodelsfittedtogroupedandungroupeddata AT mahditeimouri modelinginforestryusingmixturemodelsfittedtogroupedandungroupeddata |
_version_ |
1717366883811852288 |