Performance Evaluation of Keyword Extraction Methods and Visualization for Student Online Comments

Topic keyword extraction (as a typical task in information retrieval) refers to extracting the core keywords from document topics. In an online environment, students often post comments in subject forums. The automatic and accurate extraction of keywords from these comments are beneficial to lecture...

Full description

Bibliographic Details
Main Authors: Feng Liu, Xiaodi Huang, Weidong Huang, Sophia Xiaoxia Duan
Format: Article
Language:English
Published: MDPI AG 2020-11-01
Series:Symmetry
Subjects:
Online Access:https://www.mdpi.com/2073-8994/12/11/1923
id doaj-6c55b20cf78745a59fb0de8e6cbf8906
record_format Article
spelling doaj-6c55b20cf78745a59fb0de8e6cbf89062020-11-25T04:10:36ZengMDPI AGSymmetry2073-89942020-11-01121923192310.3390/sym12111923Performance Evaluation of Keyword Extraction Methods and Visualization for Student Online CommentsFeng Liu0Xiaodi Huang1Weidong Huang2Sophia Xiaoxia Duan3School of Computing and Mathematics, Charles Sturt University, Albury 2640, AustraliaSchool of Computing and Mathematics, Charles Sturt University, Albury 2640, AustraliaFaculty of Transdisciplinary Innovation, University of Technology Sydney, Sydney 2007, AustraliaSchool of Accounting, Information Systems and Supply Chain, RMIT University, Melbourne 3000, AustraliaTopic keyword extraction (as a typical task in information retrieval) refers to extracting the core keywords from document topics. In an online environment, students often post comments in subject forums. The automatic and accurate extraction of keywords from these comments are beneficial to lecturers (particular when it comes to repeatedly delivered subjects). In this paper, we compare the performance of traditional machine learning algorithms and two deep learning methods in extracting topic keywords from student comments posted in subject forums. For this purpose, we collected student comment data from a period of two years, manually tagging part of the raw data for our experiments. Based on this dataset, we comprehensively compared the five typical algorithms of naïve Bayes, logistic regression, support vector machine, convolutional neural networks, and Long Short-Term Memory with Attention (Att-LSTM). The performances were measured by the four evaluation metrics. We further examined the keywords by visualization. From the results of our experiment and visualization, we conclude that the Att-LSTM method is the best approach for topic keyword extraction from student comments. Further, the results from the algorithms and visualization are symmetry, to some degree. In particular, the extracted topics from the comments posted at the same stages of different teaching sessions are, almost, reflection symmetry.https://www.mdpi.com/2073-8994/12/11/1923student commentsmetricsmachine learningtopic keywords extractionNaïve BayesLogR
collection DOAJ
language English
format Article
sources DOAJ
author Feng Liu
Xiaodi Huang
Weidong Huang
Sophia Xiaoxia Duan
spellingShingle Feng Liu
Xiaodi Huang
Weidong Huang
Sophia Xiaoxia Duan
Performance Evaluation of Keyword Extraction Methods and Visualization for Student Online Comments
Symmetry
student comments
metrics
machine learning
topic keywords extraction
Naïve Bayes
LogR
author_facet Feng Liu
Xiaodi Huang
Weidong Huang
Sophia Xiaoxia Duan
author_sort Feng Liu
title Performance Evaluation of Keyword Extraction Methods and Visualization for Student Online Comments
title_short Performance Evaluation of Keyword Extraction Methods and Visualization for Student Online Comments
title_full Performance Evaluation of Keyword Extraction Methods and Visualization for Student Online Comments
title_fullStr Performance Evaluation of Keyword Extraction Methods and Visualization for Student Online Comments
title_full_unstemmed Performance Evaluation of Keyword Extraction Methods and Visualization for Student Online Comments
title_sort performance evaluation of keyword extraction methods and visualization for student online comments
publisher MDPI AG
series Symmetry
issn 2073-8994
publishDate 2020-11-01
description Topic keyword extraction (as a typical task in information retrieval) refers to extracting the core keywords from document topics. In an online environment, students often post comments in subject forums. The automatic and accurate extraction of keywords from these comments are beneficial to lecturers (particular when it comes to repeatedly delivered subjects). In this paper, we compare the performance of traditional machine learning algorithms and two deep learning methods in extracting topic keywords from student comments posted in subject forums. For this purpose, we collected student comment data from a period of two years, manually tagging part of the raw data for our experiments. Based on this dataset, we comprehensively compared the five typical algorithms of naïve Bayes, logistic regression, support vector machine, convolutional neural networks, and Long Short-Term Memory with Attention (Att-LSTM). The performances were measured by the four evaluation metrics. We further examined the keywords by visualization. From the results of our experiment and visualization, we conclude that the Att-LSTM method is the best approach for topic keyword extraction from student comments. Further, the results from the algorithms and visualization are symmetry, to some degree. In particular, the extracted topics from the comments posted at the same stages of different teaching sessions are, almost, reflection symmetry.
topic student comments
metrics
machine learning
topic keywords extraction
Naïve Bayes
LogR
url https://www.mdpi.com/2073-8994/12/11/1923
work_keys_str_mv AT fengliu performanceevaluationofkeywordextractionmethodsandvisualizationforstudentonlinecomments
AT xiaodihuang performanceevaluationofkeywordextractionmethodsandvisualizationforstudentonlinecomments
AT weidonghuang performanceevaluationofkeywordextractionmethodsandvisualizationforstudentonlinecomments
AT sophiaxiaoxiaduan performanceevaluationofkeywordextractionmethodsandvisualizationforstudentonlinecomments
_version_ 1724419951633104896