Sentiment analysis: Quantitative evaluation of subjective opinions using natural language processing

Sentiment Analysis consists of recognizing sentiment orientation towards specific subjects within natural language texts. Most research in this area focuses on classifying documents as positive or negative. The purpose of this thesis is to quantitatively evaluate subjective opinions of customer revi...

Full description

Bibliographic Details
Main Author: Li, Wenhui
Format: Others
Language:en
Published: University of Ottawa (Canada) 2013
Subjects:
Online Access:http://hdl.handle.net/10393/28000
http://dx.doi.org/10.20381/ruor-19027
id ndltd-uottawa.ca-oai-ruor.uottawa.ca-10393-28000
record_format oai_dc
spelling ndltd-uottawa.ca-oai-ruor.uottawa.ca-10393-280002018-01-05T19:07:48Z Sentiment analysis: Quantitative evaluation of subjective opinions using natural language processing Li, Wenhui Artificial Intelligence. Computer Science. Sentiment Analysis consists of recognizing sentiment orientation towards specific subjects within natural language texts. Most research in this area focuses on classifying documents as positive or negative. The purpose of this thesis is to quantitatively evaluate subjective opinions of customer reviews using a five star rating system, which is widely used on on-line review web sites, and to try to make the predicted score as accurate as possible. Firstly, this thesis presents two methods for rating reviews: classifying reviews by supervised learning methods as multi-class classification does, or rating reviews by using association scores of sentiment terms with a set of seed words extracted from the corpus, i.e. the unsupervised learning method. We extend the feature selection approach used in Turney's PMI-IR estimation by introducing semantic relatedness measures based up on the content of WordNet. This thesis reports on experiments using the two methods mentioned above for rating reviews using the combined feature set enriched with WordNet-selected sentiment terms. The results of these experiments suggest ways in which incorporating WordNet relatedness measures into feature selection may yield improvement over classification and unsupervised learning methods which do not use it. Furthermore, via ordinal meta-classifiers, we utilize the ordering information contained in the scores of bank reviews to improve the performance, we explore the effectiveness of re-sampling for reducing the problem of skewed data, and we check whether discretization benefits the ordinal meta-learning process. Finally, we combine the unsupervised and supervised meta-learning methods to optimize performance on our sentiment prediction task. 2013-11-07T19:03:11Z 2013-11-07T19:03:11Z 2008 2008 Thesis Source: Masters Abstracts International, Volume: 48-01, page: 0453. http://hdl.handle.net/10393/28000 http://dx.doi.org/10.20381/ruor-19027 en 181 p. University of Ottawa (Canada)
collection NDLTD
language en
format Others
sources NDLTD
topic Artificial Intelligence.
Computer Science.
spellingShingle Artificial Intelligence.
Computer Science.
Li, Wenhui
Sentiment analysis: Quantitative evaluation of subjective opinions using natural language processing
description Sentiment Analysis consists of recognizing sentiment orientation towards specific subjects within natural language texts. Most research in this area focuses on classifying documents as positive or negative. The purpose of this thesis is to quantitatively evaluate subjective opinions of customer reviews using a five star rating system, which is widely used on on-line review web sites, and to try to make the predicted score as accurate as possible. Firstly, this thesis presents two methods for rating reviews: classifying reviews by supervised learning methods as multi-class classification does, or rating reviews by using association scores of sentiment terms with a set of seed words extracted from the corpus, i.e. the unsupervised learning method. We extend the feature selection approach used in Turney's PMI-IR estimation by introducing semantic relatedness measures based up on the content of WordNet. This thesis reports on experiments using the two methods mentioned above for rating reviews using the combined feature set enriched with WordNet-selected sentiment terms. The results of these experiments suggest ways in which incorporating WordNet relatedness measures into feature selection may yield improvement over classification and unsupervised learning methods which do not use it. Furthermore, via ordinal meta-classifiers, we utilize the ordering information contained in the scores of bank reviews to improve the performance, we explore the effectiveness of re-sampling for reducing the problem of skewed data, and we check whether discretization benefits the ordinal meta-learning process. Finally, we combine the unsupervised and supervised meta-learning methods to optimize performance on our sentiment prediction task.
author Li, Wenhui
author_facet Li, Wenhui
author_sort Li, Wenhui
title Sentiment analysis: Quantitative evaluation of subjective opinions using natural language processing
title_short Sentiment analysis: Quantitative evaluation of subjective opinions using natural language processing
title_full Sentiment analysis: Quantitative evaluation of subjective opinions using natural language processing
title_fullStr Sentiment analysis: Quantitative evaluation of subjective opinions using natural language processing
title_full_unstemmed Sentiment analysis: Quantitative evaluation of subjective opinions using natural language processing
title_sort sentiment analysis: quantitative evaluation of subjective opinions using natural language processing
publisher University of Ottawa (Canada)
publishDate 2013
url http://hdl.handle.net/10393/28000
http://dx.doi.org/10.20381/ruor-19027
work_keys_str_mv AT liwenhui sentimentanalysisquantitativeevaluationofsubjectiveopinionsusingnaturallanguageprocessing
_version_ 1718602471741325312