Risk-related Sentence Detection in Financial Reports

碩士 === 國立政治大學 === 資訊科學學系 === 106 === The main purpose of this paper is to evaluate the risk of financial report of listed companies in sentence-level. Most of past sentiment analysis studies focused on word-level risk detection. However, most financial keywords are highly context-sensitive, which ma...

Full description

Bibliographic Details
Main Authors: Liu, Yu-Wen, 柳育彣
Other Authors: Tsai, Ming-Feng
Format: Others
Language:zh-TW
Published: 2017
Online Access:http://ndltd.ncl.edu.tw/handle/483qk2
id ndltd-TW-106NCCU5394006
record_format oai_dc
spelling ndltd-TW-106NCCU53940062019-05-15T23:46:36Z http://ndltd.ncl.edu.tw/handle/483qk2 Risk-related Sentence Detection in Financial Reports 財報文字分析之句子風險程度偵測研究 Liu, Yu-Wen 柳育彣 碩士 國立政治大學 資訊科學學系 106 The main purpose of this paper is to evaluate the risk of financial report of listed companies in sentence-level. Most of past sentiment analysis studies focused on word-level risk detection. However, most financial keywords are highly context-sensitive, which may likely yield biased results. Therefore, to advance the understanding of financial textual information, this thesis broadens the analysis from word-level to sentence level. We use two sentence-level models, fastText and Siamese-CBOW, to learn sentence embedding and attempt to facilitate the financial risk detection. In our experiment, we use the 10-K corpus and a financial sentiment dataset which were labeled by financial professionals to train our financial risk classifier. Moreover, we adopt the Bag-of-Word model as a baseline and use accuracy, precision, recall and F1-score to evaluate the performance of financial risk prediction. The experimental results show that the embedding models could lead better performance than the Bag-of-word model. In addition, this paper proposes a web-based financial risk detection system which is constructed based on fastText and Siamese CBOW model called RiskFinder. There are total 40,708 financial reports inside the system and each risk-related sentence is highlighted based on different sentence embedding models. Besides, our system also provides metadata and a visualization of financial time-series data for the corresponding company according to release day of financial report. This system considerably facilitates case studies in the field of finance and can be of great help in capturing valuable insight within large amounts of textual information. Tsai, Ming-Feng Wang, Chuan-Ju 蔡銘峰 王釧茹 2017 學位論文 ; thesis 35 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立政治大學 === 資訊科學學系 === 106 === The main purpose of this paper is to evaluate the risk of financial report of listed companies in sentence-level. Most of past sentiment analysis studies focused on word-level risk detection. However, most financial keywords are highly context-sensitive, which may likely yield biased results. Therefore, to advance the understanding of financial textual information, this thesis broadens the analysis from word-level to sentence level. We use two sentence-level models, fastText and Siamese-CBOW, to learn sentence embedding and attempt to facilitate the financial risk detection. In our experiment, we use the 10-K corpus and a financial sentiment dataset which were labeled by financial professionals to train our financial risk classifier. Moreover, we adopt the Bag-of-Word model as a baseline and use accuracy, precision, recall and F1-score to evaluate the performance of financial risk prediction. The experimental results show that the embedding models could lead better performance than the Bag-of-word model. In addition, this paper proposes a web-based financial risk detection system which is constructed based on fastText and Siamese CBOW model called RiskFinder. There are total 40,708 financial reports inside the system and each risk-related sentence is highlighted based on different sentence embedding models. Besides, our system also provides metadata and a visualization of financial time-series data for the corresponding company according to release day of financial report. This system considerably facilitates case studies in the field of finance and can be of great help in capturing valuable insight within large amounts of textual information.
author2 Tsai, Ming-Feng
author_facet Tsai, Ming-Feng
Liu, Yu-Wen
柳育彣
author Liu, Yu-Wen
柳育彣
spellingShingle Liu, Yu-Wen
柳育彣
Risk-related Sentence Detection in Financial Reports
author_sort Liu, Yu-Wen
title Risk-related Sentence Detection in Financial Reports
title_short Risk-related Sentence Detection in Financial Reports
title_full Risk-related Sentence Detection in Financial Reports
title_fullStr Risk-related Sentence Detection in Financial Reports
title_full_unstemmed Risk-related Sentence Detection in Financial Reports
title_sort risk-related sentence detection in financial reports
publishDate 2017
url http://ndltd.ncl.edu.tw/handle/483qk2
work_keys_str_mv AT liuyuwen riskrelatedsentencedetectioninfinancialreports
AT liǔyùwén riskrelatedsentencedetectioninfinancialreports
AT liuyuwen cáibàowénzìfēnxīzhījùzifēngxiǎnchéngdùzhēncèyánjiū
AT liǔyùwén cáibàowénzìfēnxīzhījùzifēngxiǎnchéngdùzhēncèyánjiū
_version_ 1719154114758180864