Forecasting TAIEX with Media Information: Integrating Time Series and Support Vector Regression
碩士 === 國立雲林科技大學 === 財務金融系 === 107 === Due to the rapid development of semiconductor technology and computer science, led to the Internet innovation; at the same time, the message also extremely fast rate of flow, and produce large amounts of data. Therefore creating data mining, text mining and AI,...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2019
|
Online Access: | http://ndltd.ncl.edu.tw/handle/yg83rs |
id |
ndltd-TW-107YUNT0304007 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-107YUNT03040072019-10-16T03:39:43Z http://ndltd.ncl.edu.tw/handle/yg83rs Forecasting TAIEX with Media Information: Integrating Time Series and Support Vector Regression 以媒體消息預測台灣加權指數:整合時間序列與支援向量迴歸 CHENG, TSO-YU 鄭作宇 碩士 國立雲林科技大學 財務金融系 107 Due to the rapid development of semiconductor technology and computer science, led to the Internet innovation; at the same time, the message also extremely fast rate of flow, and produce large amounts of data. Therefore creating data mining, text mining and AI, etc. boom. The rapid flow of information, news prompted most investors to make investment decisions, and therefore influence the news should not be underestimated. This research uses R language as a tool, through its powerful calculus ability and numerous kits, it can process a huge amount of data and conduct text mining. Combined with the "ANTUSD" developed by NTU and Academia Sinica, we can calculate the sentiment score of financial news and standardize the sentiment score of each monthly financial news to become the only qualitative variable in this study. Meanwhile, it is also a direct emotional indicator. After controlling for other emotional variables and macroeconomic variables, empirical evidence can be used to improve the prediction accuracy both inside and outside the sample. In addition to adding financial news as a qualitative variable, this article predicts with a combined model. Because ARIMA has a very good ability for linear prediction, and SVR has an advantage in dealing with non-linearity, combining the advantages of both can effectively improve the prediction ability. In the empirical results, the prediction error (RMSE) in the sample is reduced by about 20%, so the combined model can effectively improve the predictive ability. This paper found that there are two: First, after the addition of qualitative variables (News Article), effectively enhance the ability to predict. Second, the combined model can effectively decrease the prediction error. If the parameters are adjusted, the predictive ability can be further improved. CHANG, TZU-PU 張子溥 2019 學位論文 ; thesis 52 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立雲林科技大學 === 財務金融系 === 107 === Due to the rapid development of semiconductor technology and computer science, led to the Internet innovation; at the same time, the message also extremely fast rate of flow, and produce large amounts of data. Therefore creating data mining, text mining and AI, etc. boom. The rapid flow of information, news prompted most investors to make investment decisions, and therefore influence the news should not be underestimated.
This research uses R language as a tool, through its powerful calculus ability and numerous kits, it can process a huge amount of data and conduct text mining. Combined with the "ANTUSD" developed by NTU and Academia Sinica, we can calculate the sentiment score of financial news and standardize the sentiment score of each monthly financial news to become the only qualitative variable in this study. Meanwhile, it is also a direct emotional indicator. After controlling for other emotional variables and macroeconomic variables, empirical evidence can be used to improve the prediction accuracy both inside and outside the sample.
In addition to adding financial news as a qualitative variable, this article predicts with a combined model. Because ARIMA has a very good ability for linear prediction, and SVR has an advantage in dealing with non-linearity, combining the advantages of both can effectively improve the prediction ability. In the empirical results, the prediction error (RMSE) in the sample is reduced by about 20%, so the combined model can effectively improve the predictive ability.
This paper found that there are two: First, after the addition of qualitative variables (News Article), effectively enhance the ability to predict. Second, the combined model can effectively decrease the prediction error. If the parameters are adjusted, the predictive ability can be further improved.
|
author2 |
CHANG, TZU-PU |
author_facet |
CHANG, TZU-PU CHENG, TSO-YU 鄭作宇 |
author |
CHENG, TSO-YU 鄭作宇 |
spellingShingle |
CHENG, TSO-YU 鄭作宇 Forecasting TAIEX with Media Information: Integrating Time Series and Support Vector Regression |
author_sort |
CHENG, TSO-YU |
title |
Forecasting TAIEX with Media Information: Integrating Time Series and Support Vector Regression |
title_short |
Forecasting TAIEX with Media Information: Integrating Time Series and Support Vector Regression |
title_full |
Forecasting TAIEX with Media Information: Integrating Time Series and Support Vector Regression |
title_fullStr |
Forecasting TAIEX with Media Information: Integrating Time Series and Support Vector Regression |
title_full_unstemmed |
Forecasting TAIEX with Media Information: Integrating Time Series and Support Vector Regression |
title_sort |
forecasting taiex with media information: integrating time series and support vector regression |
publishDate |
2019 |
url |
http://ndltd.ncl.edu.tw/handle/yg83rs |
work_keys_str_mv |
AT chengtsoyu forecastingtaiexwithmediainformationintegratingtimeseriesandsupportvectorregression AT zhèngzuòyǔ forecastingtaiexwithmediainformationintegratingtimeseriesandsupportvectorregression AT chengtsoyu yǐméitǐxiāoxīyùcètáiwānjiāquánzhǐshùzhěnghéshíjiānxùlièyǔzhīyuánxiàngliànghuíguī AT zhèngzuòyǔ yǐméitǐxiāoxīyùcètáiwānjiāquánzhǐshùzhěnghéshíjiānxùlièyǔzhīyuánxiàngliànghuíguī |
_version_ |
1719269000303607808 |