Segmentation of High Dimensional Time-Series Data Using Mixture of Sparse Principal Component Regression Model with Information Complexity

This paper presents a new and novel hybrid modeling method for the segmentation of high dimensional time-series data using the mixture of the sparse principal components regression (<i>MIX-SPCR</i>) model with information complexity (<inline-formula><math display="inline&qu...

Full description

Bibliographic Details
Main Authors: Yaojin Sun, Hamparsum Bozdogan
Format: Article
Language:English
Published: MDPI AG 2020-10-01
Series:Entropy
Subjects:
Online Access:https://www.mdpi.com/1099-4300/22/10/1170
Description
Summary:This paper presents a new and novel hybrid modeling method for the segmentation of high dimensional time-series data using the mixture of the sparse principal components regression (<i>MIX-SPCR</i>) model with information complexity (<inline-formula><math display="inline"><semantics><mi mathvariant="sans-serif">ICOMP</mi></semantics></math></inline-formula>) criterion as the fitness function. Our approach encompasses dimension reduction in high dimensional time-series data and, at the same time, determines the number of component clusters (i.e., number of segments across time-series data) and selects the best subset of predictors. A large-scale Monte Carlo simulation is performed to show the capability of the <i>MIX-SPCR</i> model to identify the correct structure of the time-series data successfully. <i>MIX-SPCR</i> model is also applied to a high dimensional Standard & Poor’s 500 (S&P 500) index data to uncover the time-series’s hidden structure and identify the structure change points. The approach presented in this paper determines both the relationships among the predictor variables and how various predictor variables contribute to the explanatory power of the response variable through the sparsity settings cluster wise.
ISSN:1099-4300