Sentiment analysis: Quantitative evaluation of subjective opinions using natural language processing
Sentiment Analysis consists of recognizing sentiment orientation towards specific subjects within natural language texts. Most research in this area focuses on classifying documents as positive or negative. The purpose of this thesis is to quantitatively evaluate subjective opinions of customer revi...
Main Author: | |
---|---|
Format: | Others |
Language: | en |
Published: |
University of Ottawa (Canada)
2013
|
Subjects: | |
Online Access: | http://hdl.handle.net/10393/28000 http://dx.doi.org/10.20381/ruor-19027 |
id |
ndltd-uottawa.ca-oai-ruor.uottawa.ca-10393-28000 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-uottawa.ca-oai-ruor.uottawa.ca-10393-280002018-01-05T19:07:48Z Sentiment analysis: Quantitative evaluation of subjective opinions using natural language processing Li, Wenhui Artificial Intelligence. Computer Science. Sentiment Analysis consists of recognizing sentiment orientation towards specific subjects within natural language texts. Most research in this area focuses on classifying documents as positive or negative. The purpose of this thesis is to quantitatively evaluate subjective opinions of customer reviews using a five star rating system, which is widely used on on-line review web sites, and to try to make the predicted score as accurate as possible. Firstly, this thesis presents two methods for rating reviews: classifying reviews by supervised learning methods as multi-class classification does, or rating reviews by using association scores of sentiment terms with a set of seed words extracted from the corpus, i.e. the unsupervised learning method. We extend the feature selection approach used in Turney's PMI-IR estimation by introducing semantic relatedness measures based up on the content of WordNet. This thesis reports on experiments using the two methods mentioned above for rating reviews using the combined feature set enriched with WordNet-selected sentiment terms. The results of these experiments suggest ways in which incorporating WordNet relatedness measures into feature selection may yield improvement over classification and unsupervised learning methods which do not use it. Furthermore, via ordinal meta-classifiers, we utilize the ordering information contained in the scores of bank reviews to improve the performance, we explore the effectiveness of re-sampling for reducing the problem of skewed data, and we check whether discretization benefits the ordinal meta-learning process. Finally, we combine the unsupervised and supervised meta-learning methods to optimize performance on our sentiment prediction task. 2013-11-07T19:03:11Z 2013-11-07T19:03:11Z 2008 2008 Thesis Source: Masters Abstracts International, Volume: 48-01, page: 0453. http://hdl.handle.net/10393/28000 http://dx.doi.org/10.20381/ruor-19027 en 181 p. University of Ottawa (Canada) |
collection |
NDLTD |
language |
en |
format |
Others
|
sources |
NDLTD |
topic |
Artificial Intelligence. Computer Science. |
spellingShingle |
Artificial Intelligence. Computer Science. Li, Wenhui Sentiment analysis: Quantitative evaluation of subjective opinions using natural language processing |
description |
Sentiment Analysis consists of recognizing sentiment orientation towards specific subjects within natural language texts. Most research in this area focuses on classifying documents as positive or negative. The purpose of this thesis is to quantitatively evaluate subjective opinions of customer reviews using a five star rating system, which is widely used on on-line review web sites, and to try to make the predicted score as accurate as possible.
Firstly, this thesis presents two methods for rating reviews: classifying reviews by supervised learning methods as multi-class classification does, or rating reviews by using association scores of sentiment terms with a set of seed words extracted from the corpus, i.e. the unsupervised learning method. We extend the feature selection approach used in Turney's PMI-IR estimation by introducing semantic relatedness measures based up on the content of WordNet. This thesis reports on experiments using the two methods mentioned above for rating reviews using the combined feature set enriched with WordNet-selected sentiment terms. The results of these experiments suggest ways in which incorporating WordNet relatedness measures into feature selection may yield improvement over classification and unsupervised learning methods which do not use it.
Furthermore, via ordinal meta-classifiers, we utilize the ordering information contained in the scores of bank reviews to improve the performance, we explore the effectiveness of re-sampling for reducing the problem of skewed data, and we check whether discretization benefits the ordinal meta-learning process.
Finally, we combine the unsupervised and supervised meta-learning methods to optimize performance on our sentiment prediction task. |
author |
Li, Wenhui |
author_facet |
Li, Wenhui |
author_sort |
Li, Wenhui |
title |
Sentiment analysis: Quantitative evaluation of subjective opinions using natural language processing |
title_short |
Sentiment analysis: Quantitative evaluation of subjective opinions using natural language processing |
title_full |
Sentiment analysis: Quantitative evaluation of subjective opinions using natural language processing |
title_fullStr |
Sentiment analysis: Quantitative evaluation of subjective opinions using natural language processing |
title_full_unstemmed |
Sentiment analysis: Quantitative evaluation of subjective opinions using natural language processing |
title_sort |
sentiment analysis: quantitative evaluation of subjective opinions using natural language processing |
publisher |
University of Ottawa (Canada) |
publishDate |
2013 |
url |
http://hdl.handle.net/10393/28000 http://dx.doi.org/10.20381/ruor-19027 |
work_keys_str_mv |
AT liwenhui sentimentanalysisquantitativeevaluationofsubjectiveopinionsusingnaturallanguageprocessing |
_version_ |
1718602471741325312 |