A Deep-Learning-Based Long Article Analysis Method for Sentiment Extraction from Japanese Animation Viewers Comments

碩士 === 國立臺灣科技大學 === 資訊工程系 === 107 === In an era of information, Internet volume is a major issue. We automatically classify emotions from multiple long comments through a computer to extract sentiment. The traditional long article summary and sentiment classification methods define a dictionary of s...

Full description

Bibliographic Details
Main Authors: Ying-Tse Lee, 李映澤
Other Authors: Chin-Shyurng Fahn
Format: Others
Language:en_US
Published: 2019
Online Access:http://ndltd.ncl.edu.tw/handle/898crq
id ndltd-TW-107NTUS5392053
record_format oai_dc
spelling ndltd-TW-107NTUS53920532019-10-23T05:46:05Z http://ndltd.ncl.edu.tw/handle/898crq A Deep-Learning-Based Long Article Analysis Method for Sentiment Extraction from Japanese Animation Viewers Comments 一個應用深度學習的長文分析於日本動畫 作品評論以擷取輿情之方法 Ying-Tse Lee 李映澤 碩士 國立臺灣科技大學 資訊工程系 107 In an era of information, Internet volume is a major issue. We automatically classify emotions from multiple long comments through a computer to extract sentiment. The traditional long article summary and sentiment classification methods define a dictionary of sentimental words in advance, but the situation in the real world is more complicated. Under the circumstances, neither the dictionary can fully enumerate every word nor totally define each word. Based on the shortcomings of the above methods, we design a long article analysis system with deep learning. When the model is completely trained, the system will automatically classify reviewers’ sentiment. After collecting lots of reviews, we can determinate the market reaction of today's animation works. According to these, we can achieve the target of extracting sentiment. We propose an automatic sentimental classification method for long comments in the field of animation and a related dataset. First, we use crawlers to fetch a huge number of relevant labeled comments and summarize the articles through pre- processing and Skip-thoughts. Then we use the Bi-GRU combined with self-attention mechanism to train deep recurrent neural networks, and finally complete a sentiment classification model. By integrating the sentimental model into our long comment analysis classification system, we can obtain an ability of extracting emotions from long comments. In addition to the labels and comments, each of the data we collect contains more detailed classifications and information about each of the animation works, which will be the material for future research. However, in the stage of model training, we adopt data enhancement to increase the collected comments and then employ the model to learn the features. Finally, the system can distinguish between positive and negative sentiments. In the experiments, we use public open datasets to evaluate and analyze comments on different types of Japanese animation works, such as Action, Adventure, Comedy, and School. Our system based on the proposed methods can correctly predict in most cases of sentimental classification. In the open datasets, the accuracy in the sentimental classification for the IMDB, SST2, MPQA, and MR are 89.9%, 83.3%, 87.3%, and 86.0%, respectively. Additionally, for our dataset, the accuracy in sentimental classification reaches 84.7% and the overall execution time is very short. It spends about 0.001 seconds on an average per prediction. The experimental results reveal that our system can achieve real-time prediction. Chin-Shyurng Fahn 范欽雄 2019 學位論文 ; thesis 112 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣科技大學 === 資訊工程系 === 107 === In an era of information, Internet volume is a major issue. We automatically classify emotions from multiple long comments through a computer to extract sentiment. The traditional long article summary and sentiment classification methods define a dictionary of sentimental words in advance, but the situation in the real world is more complicated. Under the circumstances, neither the dictionary can fully enumerate every word nor totally define each word. Based on the shortcomings of the above methods, we design a long article analysis system with deep learning. When the model is completely trained, the system will automatically classify reviewers’ sentiment. After collecting lots of reviews, we can determinate the market reaction of today's animation works. According to these, we can achieve the target of extracting sentiment. We propose an automatic sentimental classification method for long comments in the field of animation and a related dataset. First, we use crawlers to fetch a huge number of relevant labeled comments and summarize the articles through pre- processing and Skip-thoughts. Then we use the Bi-GRU combined with self-attention mechanism to train deep recurrent neural networks, and finally complete a sentiment classification model. By integrating the sentimental model into our long comment analysis classification system, we can obtain an ability of extracting emotions from long comments. In addition to the labels and comments, each of the data we collect contains more detailed classifications and information about each of the animation works, which will be the material for future research. However, in the stage of model training, we adopt data enhancement to increase the collected comments and then employ the model to learn the features. Finally, the system can distinguish between positive and negative sentiments. In the experiments, we use public open datasets to evaluate and analyze comments on different types of Japanese animation works, such as Action, Adventure, Comedy, and School. Our system based on the proposed methods can correctly predict in most cases of sentimental classification. In the open datasets, the accuracy in the sentimental classification for the IMDB, SST2, MPQA, and MR are 89.9%, 83.3%, 87.3%, and 86.0%, respectively. Additionally, for our dataset, the accuracy in sentimental classification reaches 84.7% and the overall execution time is very short. It spends about 0.001 seconds on an average per prediction. The experimental results reveal that our system can achieve real-time prediction.
author2 Chin-Shyurng Fahn
author_facet Chin-Shyurng Fahn
Ying-Tse Lee
李映澤
author Ying-Tse Lee
李映澤
spellingShingle Ying-Tse Lee
李映澤
A Deep-Learning-Based Long Article Analysis Method for Sentiment Extraction from Japanese Animation Viewers Comments
author_sort Ying-Tse Lee
title A Deep-Learning-Based Long Article Analysis Method for Sentiment Extraction from Japanese Animation Viewers Comments
title_short A Deep-Learning-Based Long Article Analysis Method for Sentiment Extraction from Japanese Animation Viewers Comments
title_full A Deep-Learning-Based Long Article Analysis Method for Sentiment Extraction from Japanese Animation Viewers Comments
title_fullStr A Deep-Learning-Based Long Article Analysis Method for Sentiment Extraction from Japanese Animation Viewers Comments
title_full_unstemmed A Deep-Learning-Based Long Article Analysis Method for Sentiment Extraction from Japanese Animation Viewers Comments
title_sort deep-learning-based long article analysis method for sentiment extraction from japanese animation viewers comments
publishDate 2019
url http://ndltd.ncl.edu.tw/handle/898crq
work_keys_str_mv AT yingtselee adeeplearningbasedlongarticleanalysismethodforsentimentextractionfromjapaneseanimationviewerscomments
AT lǐyìngzé adeeplearningbasedlongarticleanalysismethodforsentimentextractionfromjapaneseanimationviewerscomments
AT yingtselee yīgèyīngyòngshēndùxuéxídezhǎngwénfēnxīyúrìběndònghuàzuòpǐnpínglùnyǐxiéqǔyúqíngzhīfāngfǎ
AT lǐyìngzé yīgèyīngyòngshēndùxuéxídezhǎngwénfēnxīyúrìběndònghuàzuòpǐnpínglùnyǐxiéqǔyúqíngzhīfāngfǎ
AT yingtselee deeplearningbasedlongarticleanalysismethodforsentimentextractionfromjapaneseanimationviewerscomments
AT lǐyìngzé deeplearningbasedlongarticleanalysismethodforsentimentextractionfromjapaneseanimationviewerscomments
_version_ 1719276245389148160