An Empirical Analysis of Risk Alert with the Text Mining of Internet Financial News

碩士 === 國立交通大學 === 管理學院財務金融學程 === 104 === With the increasing development of Chinese text analysis tools and machine learning methodologies, more researches analyze Madridan financial statements and financial news provided from internet media to predict the performance of stocks returns for risk mana...

Full description

Bibliographic Details
Main Authors: Yeh, Hung-Ching, 葉鴻青
Other Authors: Dai, Tian-Shyr
Format: Others
Language:zh-TW
Published: 2016
Online Access:http://ndltd.ncl.edu.tw/handle/50713728929196548640
id ndltd-TW-104NCTU5303006
record_format oai_dc
spelling ndltd-TW-104NCTU53030062017-09-06T04:21:58Z http://ndltd.ncl.edu.tw/handle/50713728929196548640 An Empirical Analysis of Risk Alert with the Text Mining of Internet Financial News 網路財務新聞的文字分析於風險預警應用之實證研究 Yeh, Hung-Ching 葉鴻青 碩士 國立交通大學 管理學院財務金融學程 104 With the increasing development of Chinese text analysis tools and machine learning methodologies, more researches analyze Madridan financial statements and financial news provided from internet media to predict the performance of stocks returns for risk management purpose. This study analyzes the excess return of the stocks with the nature language method and supervised machine learning. The Chinese financial mood dictionary provided by professor Chuan-Ju Wang makes the financial meanings of the news more accurate. By using technique of the Chinese word segmentation like Jieba, we can build a matrix with the excess return and the feature TF-IDF weightings. There are two types of models. The first type of models categorizes excess returns into positive/negative. These methods include the Logit regression model, Naïve bayes classifier and Support vector classifier. The second type of models estimates quantitatives of excess returns. These models include the linear regression model and Support vector machine. According to our empirical study, the average accuracies of 50-fold cross validation in the first type of models are about above 60%, and the AUC indicates the model has at least middle discremanatory power. The mean square errors and the coefficients of determination are passable. The predict excess return is worthy of our reference. It appears that my proposed five models can process Chineses financial news to predict the excess return of the stock. It confirms the hypothesis that Taiwan stock market does not meet the strong form of efficient market. Dai, Tian-Shyr 戴天時 2016 學位論文 ; thesis 53 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 管理學院財務金融學程 === 104 === With the increasing development of Chinese text analysis tools and machine learning methodologies, more researches analyze Madridan financial statements and financial news provided from internet media to predict the performance of stocks returns for risk management purpose. This study analyzes the excess return of the stocks with the nature language method and supervised machine learning. The Chinese financial mood dictionary provided by professor Chuan-Ju Wang makes the financial meanings of the news more accurate. By using technique of the Chinese word segmentation like Jieba, we can build a matrix with the excess return and the feature TF-IDF weightings. There are two types of models. The first type of models categorizes excess returns into positive/negative. These methods include the Logit regression model, Naïve bayes classifier and Support vector classifier. The second type of models estimates quantitatives of excess returns. These models include the linear regression model and Support vector machine. According to our empirical study, the average accuracies of 50-fold cross validation in the first type of models are about above 60%, and the AUC indicates the model has at least middle discremanatory power. The mean square errors and the coefficients of determination are passable. The predict excess return is worthy of our reference. It appears that my proposed five models can process Chineses financial news to predict the excess return of the stock. It confirms the hypothesis that Taiwan stock market does not meet the strong form of efficient market.
author2 Dai, Tian-Shyr
author_facet Dai, Tian-Shyr
Yeh, Hung-Ching
葉鴻青
author Yeh, Hung-Ching
葉鴻青
spellingShingle Yeh, Hung-Ching
葉鴻青
An Empirical Analysis of Risk Alert with the Text Mining of Internet Financial News
author_sort Yeh, Hung-Ching
title An Empirical Analysis of Risk Alert with the Text Mining of Internet Financial News
title_short An Empirical Analysis of Risk Alert with the Text Mining of Internet Financial News
title_full An Empirical Analysis of Risk Alert with the Text Mining of Internet Financial News
title_fullStr An Empirical Analysis of Risk Alert with the Text Mining of Internet Financial News
title_full_unstemmed An Empirical Analysis of Risk Alert with the Text Mining of Internet Financial News
title_sort empirical analysis of risk alert with the text mining of internet financial news
publishDate 2016
url http://ndltd.ncl.edu.tw/handle/50713728929196548640
work_keys_str_mv AT yehhungching anempiricalanalysisofriskalertwiththetextminingofinternetfinancialnews
AT yèhóngqīng anempiricalanalysisofriskalertwiththetextminingofinternetfinancialnews
AT yehhungching wǎnglùcáiwùxīnwéndewénzìfēnxīyúfēngxiǎnyùjǐngyīngyòngzhīshízhèngyánjiū
AT yèhóngqīng wǎnglùcáiwùxīnwéndewénzìfēnxīyúfēngxiǎnyùjǐngyīngyòngzhīshízhèngyánjiū
AT yehhungching empiricalanalysisofriskalertwiththetextminingofinternetfinancialnews
AT yèhóngqīng empiricalanalysisofriskalertwiththetextminingofinternetfinancialnews
_version_ 1718527148922241024