Risk-related Sentence Detection in Financial Reports
碩士 === 國立政治大學 === 資訊科學學系 === 106 === The main purpose of this paper is to evaluate the risk of financial report of listed companies in sentence-level. Most of past sentiment analysis studies focused on word-level risk detection. However, most financial keywords are highly context-sensitive, which ma...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2017
|
Online Access: | http://ndltd.ncl.edu.tw/handle/483qk2 |
id |
ndltd-TW-106NCCU5394006 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-106NCCU53940062019-05-15T23:46:36Z http://ndltd.ncl.edu.tw/handle/483qk2 Risk-related Sentence Detection in Financial Reports 財報文字分析之句子風險程度偵測研究 Liu, Yu-Wen 柳育彣 碩士 國立政治大學 資訊科學學系 106 The main purpose of this paper is to evaluate the risk of financial report of listed companies in sentence-level. Most of past sentiment analysis studies focused on word-level risk detection. However, most financial keywords are highly context-sensitive, which may likely yield biased results. Therefore, to advance the understanding of financial textual information, this thesis broadens the analysis from word-level to sentence level. We use two sentence-level models, fastText and Siamese-CBOW, to learn sentence embedding and attempt to facilitate the financial risk detection. In our experiment, we use the 10-K corpus and a financial sentiment dataset which were labeled by financial professionals to train our financial risk classifier. Moreover, we adopt the Bag-of-Word model as a baseline and use accuracy, precision, recall and F1-score to evaluate the performance of financial risk prediction. The experimental results show that the embedding models could lead better performance than the Bag-of-word model. In addition, this paper proposes a web-based financial risk detection system which is constructed based on fastText and Siamese CBOW model called RiskFinder. There are total 40,708 financial reports inside the system and each risk-related sentence is highlighted based on different sentence embedding models. Besides, our system also provides metadata and a visualization of financial time-series data for the corresponding company according to release day of financial report. This system considerably facilitates case studies in the field of finance and can be of great help in capturing valuable insight within large amounts of textual information. Tsai, Ming-Feng Wang, Chuan-Ju 蔡銘峰 王釧茹 2017 學位論文 ; thesis 35 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立政治大學 === 資訊科學學系 === 106 === The main purpose of this paper is to evaluate the risk of financial report of listed companies in sentence-level. Most of past sentiment analysis studies focused on word-level risk detection. However, most financial keywords are highly context-sensitive, which may likely yield biased results. Therefore, to advance the understanding of financial textual information, this thesis broadens the analysis from word-level to sentence level. We use two sentence-level models, fastText and Siamese-CBOW, to learn sentence embedding and attempt to facilitate the financial risk detection. In our experiment, we use the 10-K corpus and a financial sentiment dataset which were labeled by financial professionals to train our financial risk classifier. Moreover, we adopt the Bag-of-Word model as a baseline and use accuracy, precision, recall and F1-score to evaluate the performance of financial risk prediction. The experimental results show that the embedding models could lead better performance than the Bag-of-word model. In addition, this paper proposes a web-based financial risk detection system which is constructed based on fastText and Siamese CBOW model called RiskFinder. There are total 40,708 financial reports inside the system and each risk-related sentence is highlighted based on different sentence embedding models. Besides, our system also provides metadata and a visualization of financial time-series data for the corresponding company according to release day of financial report. This system considerably facilitates case studies in the field of finance and can be of great help in capturing valuable insight within large amounts of textual information.
|
author2 |
Tsai, Ming-Feng |
author_facet |
Tsai, Ming-Feng Liu, Yu-Wen 柳育彣 |
author |
Liu, Yu-Wen 柳育彣 |
spellingShingle |
Liu, Yu-Wen 柳育彣 Risk-related Sentence Detection in Financial Reports |
author_sort |
Liu, Yu-Wen |
title |
Risk-related Sentence Detection in Financial Reports |
title_short |
Risk-related Sentence Detection in Financial Reports |
title_full |
Risk-related Sentence Detection in Financial Reports |
title_fullStr |
Risk-related Sentence Detection in Financial Reports |
title_full_unstemmed |
Risk-related Sentence Detection in Financial Reports |
title_sort |
risk-related sentence detection in financial reports |
publishDate |
2017 |
url |
http://ndltd.ncl.edu.tw/handle/483qk2 |
work_keys_str_mv |
AT liuyuwen riskrelatedsentencedetectioninfinancialreports AT liǔyùwén riskrelatedsentencedetectioninfinancialreports AT liuyuwen cáibàowénzìfēnxīzhījùzifēngxiǎnchéngdùzhēncèyánjiū AT liǔyùwén cáibàowénzìfēnxīzhījùzifēngxiǎnchéngdùzhēncèyánjiū |
_version_ |
1719154114758180864 |