On the Construction and Analysis of Time-Series-Oriented Lexicons
碩士 === 臺北市立大學 === 資訊科學系 === 104 === This thesis proposes a novel framework to build a time-series-oriented lexicon which can cover different types of sources and also has explicit links with the targets of prediction problems. In the framework, the input is composed of a text stream, such as financi...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2016
|
Online Access: | http://ndltd.ncl.edu.tw/handle/52924983784941283362 |
id |
ndltd-TW-104UT005394016 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-104UT0053940162017-09-24T04:40:57Z http://ndltd.ncl.edu.tw/handle/52924983784941283362 On the Construction and Analysis of Time-Series-Oriented Lexicons 時間序列導向財經字典的建構與分析 Lai, Cheng-Yi 賴承毅 碩士 臺北市立大學 資訊科學系 104 This thesis proposes a novel framework to build a time-series-oriented lexicon which can cover different types of sources and also has explicit links with the targets of prediction problems. In the framework, the input is composed of a text stream, such as financial news and a financial time series, such as the stock prices of a company. We then calculate the Pearson correlation between the frequency series of each word and the stock price series of a company. Although Pearson correlation gives a good idea of how much the two time series are correlated, it has a limitation in capturing the similarity when one of the series is stretched or shifted. To overcome this limitation, we adopt Dynamic time warping (DTW) to handle the problem. Finally, the words with high correlations will be extracted to build the time-series-oriented lexicon. Additionally, we adopt the learned lexicon to construct a model for stock price movement prediction. The experimental results demonstrate that the learned words generally have good prediction ability, which attests the practicability of the proposed idea of capturing a company's keywords via its historical stock prices. 王釧茹 2016 學位論文 ; thesis 33 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 臺北市立大學 === 資訊科學系 === 104 === This thesis proposes a novel framework to build a time-series-oriented lexicon which can cover different types of sources and also has explicit links with the targets of prediction problems. In the framework, the input is composed of a text stream, such as financial news and a financial time series, such as the stock prices of a company. We then calculate the Pearson correlation between the frequency series of each word and the stock price series of a company. Although Pearson correlation gives a good idea of how much the two time series are correlated, it has a limitation in capturing the similarity when one of the series is stretched or shifted. To overcome this limitation, we adopt Dynamic time warping (DTW) to handle the problem. Finally, the words with high correlations will be extracted to build the time-series-oriented lexicon. Additionally, we adopt the learned lexicon to construct a model for stock price movement prediction. The experimental results demonstrate that the learned words generally have good prediction ability, which attests the practicability of the proposed idea of capturing a company's keywords via its historical stock prices.
|
author2 |
王釧茹 |
author_facet |
王釧茹 Lai, Cheng-Yi 賴承毅 |
author |
Lai, Cheng-Yi 賴承毅 |
spellingShingle |
Lai, Cheng-Yi 賴承毅 On the Construction and Analysis of Time-Series-Oriented Lexicons |
author_sort |
Lai, Cheng-Yi |
title |
On the Construction and Analysis of Time-Series-Oriented Lexicons |
title_short |
On the Construction and Analysis of Time-Series-Oriented Lexicons |
title_full |
On the Construction and Analysis of Time-Series-Oriented Lexicons |
title_fullStr |
On the Construction and Analysis of Time-Series-Oriented Lexicons |
title_full_unstemmed |
On the Construction and Analysis of Time-Series-Oriented Lexicons |
title_sort |
on the construction and analysis of time-series-oriented lexicons |
publishDate |
2016 |
url |
http://ndltd.ncl.edu.tw/handle/52924983784941283362 |
work_keys_str_mv |
AT laichengyi ontheconstructionandanalysisoftimeseriesorientedlexicons AT làichéngyì ontheconstructionandanalysisoftimeseriesorientedlexicons AT laichengyi shíjiānxùlièdǎoxiàngcáijīngzìdiǎndejiàngòuyǔfēnxī AT làichéngyì shíjiānxùlièdǎoxiàngcáijīngzìdiǎndejiàngòuyǔfēnxī |
_version_ |
1718540390152273920 |