Model combination by decomposition and aggregation

Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Nuclear Engineering, 2004. === Includes bibliographical references (p. 265-282). === This thesis focuses on a general problem in statistical modeling, namely model combination. It proposes a novel feature-based model combination method...

Full description

Bibliographic Details
Main Author: Xu, Mingyang, 1974-
Other Authors: Michael W. Golay.
Format: Others
Language:English
Published: Massachusetts Institute of Technology 2006
Subjects:
Online Access:http://hdl.handle.net/1721.1/33641
id ndltd-MIT-oai-dspace.mit.edu-1721.1-33641
record_format oai_dc
spelling ndltd-MIT-oai-dspace.mit.edu-1721.1-336412019-05-02T15:58:21Z Model combination by decomposition and aggregation Xu, Mingyang, 1974- Michael W. Golay. Massachusetts Institute of Technology. Dept. of Nuclear Engineering. Massachusetts Institute of Technology. Dept. of Nuclear Engineering. Nuclear Engineering. Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Nuclear Engineering, 2004. Includes bibliographical references (p. 265-282). This thesis focuses on a general problem in statistical modeling, namely model combination. It proposes a novel feature-based model combination method to improve model accuracy and reduce model uncertainty. In this method, a set of candidate models are first decomposed into a group of components or features and then components are selected and aggregated into a composite model based on data. However, in implementing this new method, some central challenges have to be addressed, which include candidate model choice, component selection, data noise modeling, model uncertainty reduction and model locality. In order to solve these problems, some new methods are put forward. In choosing candidate models, some criteria are proposed including accuracy, diversity, independence as well as completeness and then corresponding quantitative measures are designed to quantify these criteria, and finally an overall preference score is generated for each model in the pool. Principal component analysis (PCA) and independent component analysis (ICA) are applied to decompose candidate models into components and multiple linear regression is employed to aggregate components into a composite model. (cont.) In order to reduce model structure uncertainty, a new concept of fuzzy variable selection is introduced to carry out component selection, which is able to combine the interpretability of classical variable selection and the stability of shrinkage estimators. In dealing with parameter estimation uncertainty, exponential power distribution is proposed to model unknown non-Gaussian noise and parametric weighted least-squares method is devise to estimate parameters in the context of non-Gaussian noise. These two methods are combined to work together to reduce model uncertainty, including both model structure uncertainty and parameter uncertainty. To handle model locality, i.e. candidate models do not work equally well over different regions, the adaptive fuzzy mixture of local ICA models is developed. Basically, it splits the entire input space into domains, build local ICA models within each sub-region and then combine them into a mixture model. Many different experiments are carried out to demonstrate the performance of this novel method. Our simulation study and comparison show that this new method meets our goals and outperforms existing methods in most situations. by Mingyang Xu. Ph.D. 2006-07-31T15:19:25Z 2006-07-31T15:19:25Z 2004 2004 Thesis http://hdl.handle.net/1721.1/33641 64394023 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 282 p. 15496883 bytes 15510291 bytes application/pdf application/pdf application/pdf Massachusetts Institute of Technology
collection NDLTD
language English
format Others
sources NDLTD
topic Nuclear Engineering.
spellingShingle Nuclear Engineering.
Xu, Mingyang, 1974-
Model combination by decomposition and aggregation
description Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Nuclear Engineering, 2004. === Includes bibliographical references (p. 265-282). === This thesis focuses on a general problem in statistical modeling, namely model combination. It proposes a novel feature-based model combination method to improve model accuracy and reduce model uncertainty. In this method, a set of candidate models are first decomposed into a group of components or features and then components are selected and aggregated into a composite model based on data. However, in implementing this new method, some central challenges have to be addressed, which include candidate model choice, component selection, data noise modeling, model uncertainty reduction and model locality. In order to solve these problems, some new methods are put forward. In choosing candidate models, some criteria are proposed including accuracy, diversity, independence as well as completeness and then corresponding quantitative measures are designed to quantify these criteria, and finally an overall preference score is generated for each model in the pool. Principal component analysis (PCA) and independent component analysis (ICA) are applied to decompose candidate models into components and multiple linear regression is employed to aggregate components into a composite model. === (cont.) In order to reduce model structure uncertainty, a new concept of fuzzy variable selection is introduced to carry out component selection, which is able to combine the interpretability of classical variable selection and the stability of shrinkage estimators. In dealing with parameter estimation uncertainty, exponential power distribution is proposed to model unknown non-Gaussian noise and parametric weighted least-squares method is devise to estimate parameters in the context of non-Gaussian noise. These two methods are combined to work together to reduce model uncertainty, including both model structure uncertainty and parameter uncertainty. To handle model locality, i.e. candidate models do not work equally well over different regions, the adaptive fuzzy mixture of local ICA models is developed. Basically, it splits the entire input space into domains, build local ICA models within each sub-region and then combine them into a mixture model. Many different experiments are carried out to demonstrate the performance of this novel method. Our simulation study and comparison show that this new method meets our goals and outperforms existing methods in most situations. === by Mingyang Xu. === Ph.D.
author2 Michael W. Golay.
author_facet Michael W. Golay.
Xu, Mingyang, 1974-
author Xu, Mingyang, 1974-
author_sort Xu, Mingyang, 1974-
title Model combination by decomposition and aggregation
title_short Model combination by decomposition and aggregation
title_full Model combination by decomposition and aggregation
title_fullStr Model combination by decomposition and aggregation
title_full_unstemmed Model combination by decomposition and aggregation
title_sort model combination by decomposition and aggregation
publisher Massachusetts Institute of Technology
publishDate 2006
url http://hdl.handle.net/1721.1/33641
work_keys_str_mv AT xumingyang1974 modelcombinationbydecompositionandaggregation
_version_ 1719032224869777408