On the Construction and Analysis of Time-Series-Oriented Lexicons

碩士 === 臺北市立大學 === 資訊科學系 === 104 === This thesis proposes a novel framework to build a time-series-oriented lexicon which can cover different types of sources and also has explicit links with the targets of prediction problems. In the framework, the input is composed of a text stream, such as financi...

Full description

Bibliographic Details
Main Authors: Lai, Cheng-Yi, 賴承毅
Other Authors: 王釧茹
Format: Others
Language:en_US
Published: 2016
Online Access:http://ndltd.ncl.edu.tw/handle/52924983784941283362
id ndltd-TW-104UT005394016
record_format oai_dc
spelling ndltd-TW-104UT0053940162017-09-24T04:40:57Z http://ndltd.ncl.edu.tw/handle/52924983784941283362 On the Construction and Analysis of Time-Series-Oriented Lexicons 時間序列導向財經字典的建構與分析 Lai, Cheng-Yi 賴承毅 碩士 臺北市立大學 資訊科學系 104 This thesis proposes a novel framework to build a time-series-oriented lexicon which can cover different types of sources and also has explicit links with the targets of prediction problems. In the framework, the input is composed of a text stream, such as financial news and a financial time series, such as the stock prices of a company. We then calculate the Pearson correlation between the frequency series of each word and the stock price series of a company. Although Pearson correlation gives a good idea of how much the two time series are correlated, it has a limitation in capturing the similarity when one of the series is stretched or shifted. To overcome this limitation, we adopt Dynamic time warping (DTW) to handle the problem. Finally, the words with high correlations will be extracted to build the time-series-oriented lexicon. Additionally, we adopt the learned lexicon to construct a model for stock price movement prediction. The experimental results demonstrate that the learned words generally have good prediction ability, which attests the practicability of the proposed idea of capturing a company's keywords via its historical stock prices. 王釧茹 2016 學位論文 ; thesis 33 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 臺北市立大學 === 資訊科學系 === 104 === This thesis proposes a novel framework to build a time-series-oriented lexicon which can cover different types of sources and also has explicit links with the targets of prediction problems. In the framework, the input is composed of a text stream, such as financial news and a financial time series, such as the stock prices of a company. We then calculate the Pearson correlation between the frequency series of each word and the stock price series of a company. Although Pearson correlation gives a good idea of how much the two time series are correlated, it has a limitation in capturing the similarity when one of the series is stretched or shifted. To overcome this limitation, we adopt Dynamic time warping (DTW) to handle the problem. Finally, the words with high correlations will be extracted to build the time-series-oriented lexicon. Additionally, we adopt the learned lexicon to construct a model for stock price movement prediction. The experimental results demonstrate that the learned words generally have good prediction ability, which attests the practicability of the proposed idea of capturing a company's keywords via its historical stock prices.
author2 王釧茹
author_facet 王釧茹
Lai, Cheng-Yi
賴承毅
author Lai, Cheng-Yi
賴承毅
spellingShingle Lai, Cheng-Yi
賴承毅
On the Construction and Analysis of Time-Series-Oriented Lexicons
author_sort Lai, Cheng-Yi
title On the Construction and Analysis of Time-Series-Oriented Lexicons
title_short On the Construction and Analysis of Time-Series-Oriented Lexicons
title_full On the Construction and Analysis of Time-Series-Oriented Lexicons
title_fullStr On the Construction and Analysis of Time-Series-Oriented Lexicons
title_full_unstemmed On the Construction and Analysis of Time-Series-Oriented Lexicons
title_sort on the construction and analysis of time-series-oriented lexicons
publishDate 2016
url http://ndltd.ncl.edu.tw/handle/52924983784941283362
work_keys_str_mv AT laichengyi ontheconstructionandanalysisoftimeseriesorientedlexicons
AT làichéngyì ontheconstructionandanalysisoftimeseriesorientedlexicons
AT laichengyi shíjiānxùlièdǎoxiàngcáijīngzìdiǎndejiàngòuyǔfēnxī
AT làichéngyì shíjiānxùlièdǎoxiàngcáijīngzìdiǎndejiàngòuyǔfēnxī
_version_ 1718540390152273920