Forecasting TAIEX with Media Information: Integrating Time Series and Support Vector Regression

碩士 === 國立雲林科技大學 === 財務金融系 === 107 === Due to the rapid development of semiconductor technology and computer science, led to the Internet innovation; at the same time, the message also extremely fast rate of flow, and produce large amounts of data. Therefore creating data mining, text mining and AI,...

Full description

Bibliographic Details
Main Authors: CHENG, TSO-YU, 鄭作宇
Other Authors: CHANG, TZU-PU
Format: Others
Language:zh-TW
Published: 2019
Online Access:http://ndltd.ncl.edu.tw/handle/yg83rs
id ndltd-TW-107YUNT0304007
record_format oai_dc
spelling ndltd-TW-107YUNT03040072019-10-16T03:39:43Z http://ndltd.ncl.edu.tw/handle/yg83rs Forecasting TAIEX with Media Information: Integrating Time Series and Support Vector Regression 以媒體消息預測台灣加權指數:整合時間序列與支援向量迴歸 CHENG, TSO-YU 鄭作宇 碩士 國立雲林科技大學 財務金融系 107 Due to the rapid development of semiconductor technology and computer science, led to the Internet innovation; at the same time, the message also extremely fast rate of flow, and produce large amounts of data. Therefore creating data mining, text mining and AI, etc. boom. The rapid flow of information, news prompted most investors to make investment decisions, and therefore influence the news should not be underestimated. This research uses R language as a tool, through its powerful calculus ability and numerous kits, it can process a huge amount of data and conduct text mining. Combined with the "ANTUSD" developed by NTU and Academia Sinica, we can calculate the sentiment score of financial news and standardize the sentiment score of each monthly financial news to become the only qualitative variable in this study. Meanwhile, it is also a direct emotional indicator. After controlling for other emotional variables and macroeconomic variables, empirical evidence can be used to improve the prediction accuracy both inside and outside the sample. In addition to adding financial news as a qualitative variable, this article predicts with a combined model. Because ARIMA has a very good ability for linear prediction, and SVR has an advantage in dealing with non-linearity, combining the advantages of both can effectively improve the prediction ability. In the empirical results, the prediction error (RMSE) in the sample is reduced by about 20%, so the combined model can effectively improve the predictive ability. This paper found that there are two: First, after the addition of qualitative variables (News Article), effectively enhance the ability to predict. Second, the combined model can effectively decrease the prediction error. If the parameters are adjusted, the predictive ability can be further improved. CHANG, TZU-PU 張子溥 2019 學位論文 ; thesis 52 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立雲林科技大學 === 財務金融系 === 107 === Due to the rapid development of semiconductor technology and computer science, led to the Internet innovation; at the same time, the message also extremely fast rate of flow, and produce large amounts of data. Therefore creating data mining, text mining and AI, etc. boom. The rapid flow of information, news prompted most investors to make investment decisions, and therefore influence the news should not be underestimated. This research uses R language as a tool, through its powerful calculus ability and numerous kits, it can process a huge amount of data and conduct text mining. Combined with the "ANTUSD" developed by NTU and Academia Sinica, we can calculate the sentiment score of financial news and standardize the sentiment score of each monthly financial news to become the only qualitative variable in this study. Meanwhile, it is also a direct emotional indicator. After controlling for other emotional variables and macroeconomic variables, empirical evidence can be used to improve the prediction accuracy both inside and outside the sample. In addition to adding financial news as a qualitative variable, this article predicts with a combined model. Because ARIMA has a very good ability for linear prediction, and SVR has an advantage in dealing with non-linearity, combining the advantages of both can effectively improve the prediction ability. In the empirical results, the prediction error (RMSE) in the sample is reduced by about 20%, so the combined model can effectively improve the predictive ability. This paper found that there are two: First, after the addition of qualitative variables (News Article), effectively enhance the ability to predict. Second, the combined model can effectively decrease the prediction error. If the parameters are adjusted, the predictive ability can be further improved.
author2 CHANG, TZU-PU
author_facet CHANG, TZU-PU
CHENG, TSO-YU
鄭作宇
author CHENG, TSO-YU
鄭作宇
spellingShingle CHENG, TSO-YU
鄭作宇
Forecasting TAIEX with Media Information: Integrating Time Series and Support Vector Regression
author_sort CHENG, TSO-YU
title Forecasting TAIEX with Media Information: Integrating Time Series and Support Vector Regression
title_short Forecasting TAIEX with Media Information: Integrating Time Series and Support Vector Regression
title_full Forecasting TAIEX with Media Information: Integrating Time Series and Support Vector Regression
title_fullStr Forecasting TAIEX with Media Information: Integrating Time Series and Support Vector Regression
title_full_unstemmed Forecasting TAIEX with Media Information: Integrating Time Series and Support Vector Regression
title_sort forecasting taiex with media information: integrating time series and support vector regression
publishDate 2019
url http://ndltd.ncl.edu.tw/handle/yg83rs
work_keys_str_mv AT chengtsoyu forecastingtaiexwithmediainformationintegratingtimeseriesandsupportvectorregression
AT zhèngzuòyǔ forecastingtaiexwithmediainformationintegratingtimeseriesandsupportvectorregression
AT chengtsoyu yǐméitǐxiāoxīyùcètáiwānjiāquánzhǐshùzhěnghéshíjiānxùlièyǔzhīyuánxiàngliànghuíguī
AT zhèngzuòyǔ yǐméitǐxiāoxīyùcètáiwānjiāquánzhǐshùzhěnghéshíjiānxùlièyǔzhīyuánxiàngliànghuíguī
_version_ 1719269000303607808