Modeling in Forestry Using Mixture Models Fitted to Grouped and Ungrouped Data

The creation and maintenance of complex forest structures has become an important forestry objective. Complex forest structures, often expressed in multimodal shapes of tree size/diameter (DBH) distributions, are challenging to model. Mixture probability density functions of two- or three-component...

Full description

Bibliographic Details
Main Authors: Eric K. Zenner, Mahdi Teimouri
Format: Article
Language:English
Published: MDPI AG 2021-09-01
Series:Forests
Subjects:
Online Access:https://www.mdpi.com/1999-4907/12/9/1196
id doaj-bf2a827c511f4f14bf24aa93a73c8d5a
record_format Article
spelling doaj-bf2a827c511f4f14bf24aa93a73c8d5a2021-09-26T00:10:33ZengMDPI AGForests1999-49072021-09-01121196119610.3390/f12091196Modeling in Forestry Using Mixture Models Fitted to Grouped and Ungrouped DataEric K. Zenner0Mahdi Teimouri1Department of Ecosystem Science and Management, The Pennsylvania State University, 305 Forest Resources Building, University Park, PA 16802, USADepartment of Statistics, Faculty of Science and Engineering, Gonbad Kavous University, No. 163, Basirat Blvd, Gonbad Kavous 4971799151, IranThe creation and maintenance of complex forest structures has become an important forestry objective. Complex forest structures, often expressed in multimodal shapes of tree size/diameter (DBH) distributions, are challenging to model. Mixture probability density functions of two- or three-component gamma, log-normal, and Weibull mixture models offer a solution and can additionally provide insights into forest dynamics. Model parameters can be efficiently estimated with the maximum likelihood (ML) approach using iterative methods such as the Newton-Raphson (NR) algorithm. However, the NR algorithm is sensitive to the choice of initial values and does not always converge. As an alternative, we explored the use of the iterative expectation-maximization (EM) algorithm for estimating parameters of the aforementioned mixture models because it always converges to ML estimators. Since forestry data frequently occur both in grouped (classified) and ungrouped (raw) forms, the EM algorithm was applied to explore the goodness-of-fit of the gamma, log-normal, and Weibull mixture distributions in three sample plots that exhibited irregular, multimodal, highly skewed, and heavy-tailed DBH distributions where some size classes were empty. The EM-based goodness-of-fit was further compared against a nonparametric kernel-based density estimation (NK) model and the recently popularized gamma-shaped mixture (GSM) models using the ungrouped data. In this example application, the EM algorithm provided well-fitting two- or three-component mixture models for all three model families. The number of components of the best-fitting models differed among the three sample plots (but not among model families) and the mixture models of the log-normal and gamma families provided a better fit than the Weibull distribution for grouped and ungrouped data. For ungrouped data, both log-normal and gamma mixture distributions outperformed the GSM model and, with the exception of the multimodal diameter distribution, also the NK model. The EM algorithm appears to be a promising tool for modeling complex forest structures.https://www.mdpi.com/1999-4907/12/9/1196complexitydiameter (DBH) distributionestimation-maximization (EM) algorithmforest structuregammalog-normal
collection DOAJ
language English
format Article
sources DOAJ
author Eric K. Zenner
Mahdi Teimouri
spellingShingle Eric K. Zenner
Mahdi Teimouri
Modeling in Forestry Using Mixture Models Fitted to Grouped and Ungrouped Data
Forests
complexity
diameter (DBH) distribution
estimation-maximization (EM) algorithm
forest structure
gamma
log-normal
author_facet Eric K. Zenner
Mahdi Teimouri
author_sort Eric K. Zenner
title Modeling in Forestry Using Mixture Models Fitted to Grouped and Ungrouped Data
title_short Modeling in Forestry Using Mixture Models Fitted to Grouped and Ungrouped Data
title_full Modeling in Forestry Using Mixture Models Fitted to Grouped and Ungrouped Data
title_fullStr Modeling in Forestry Using Mixture Models Fitted to Grouped and Ungrouped Data
title_full_unstemmed Modeling in Forestry Using Mixture Models Fitted to Grouped and Ungrouped Data
title_sort modeling in forestry using mixture models fitted to grouped and ungrouped data
publisher MDPI AG
series Forests
issn 1999-4907
publishDate 2021-09-01
description The creation and maintenance of complex forest structures has become an important forestry objective. Complex forest structures, often expressed in multimodal shapes of tree size/diameter (DBH) distributions, are challenging to model. Mixture probability density functions of two- or three-component gamma, log-normal, and Weibull mixture models offer a solution and can additionally provide insights into forest dynamics. Model parameters can be efficiently estimated with the maximum likelihood (ML) approach using iterative methods such as the Newton-Raphson (NR) algorithm. However, the NR algorithm is sensitive to the choice of initial values and does not always converge. As an alternative, we explored the use of the iterative expectation-maximization (EM) algorithm for estimating parameters of the aforementioned mixture models because it always converges to ML estimators. Since forestry data frequently occur both in grouped (classified) and ungrouped (raw) forms, the EM algorithm was applied to explore the goodness-of-fit of the gamma, log-normal, and Weibull mixture distributions in three sample plots that exhibited irregular, multimodal, highly skewed, and heavy-tailed DBH distributions where some size classes were empty. The EM-based goodness-of-fit was further compared against a nonparametric kernel-based density estimation (NK) model and the recently popularized gamma-shaped mixture (GSM) models using the ungrouped data. In this example application, the EM algorithm provided well-fitting two- or three-component mixture models for all three model families. The number of components of the best-fitting models differed among the three sample plots (but not among model families) and the mixture models of the log-normal and gamma families provided a better fit than the Weibull distribution for grouped and ungrouped data. For ungrouped data, both log-normal and gamma mixture distributions outperformed the GSM model and, with the exception of the multimodal diameter distribution, also the NK model. The EM algorithm appears to be a promising tool for modeling complex forest structures.
topic complexity
diameter (DBH) distribution
estimation-maximization (EM) algorithm
forest structure
gamma
log-normal
url https://www.mdpi.com/1999-4907/12/9/1196
work_keys_str_mv AT erickzenner modelinginforestryusingmixturemodelsfittedtogroupedandungroupeddata
AT mahditeimouri modelinginforestryusingmixturemodelsfittedtogroupedandungroupeddata
_version_ 1717366883811852288