Deep Learning-Based Correct Answer Prediction for Developer Forums
Developer forums are essential for software engineers to solve their problems with the assistance of experts on such forums. However, sometimes the solutions (answers) of a problem are not satisfactory or challenging to select the potential answer. Information seekers usually browse all the answers...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9524607/ |
id |
doaj-e4d40b341fd04d9b80b5b19580aa482e |
---|---|
record_format |
Article |
spelling |
doaj-e4d40b341fd04d9b80b5b19580aa482e2021-09-21T23:00:15ZengIEEEIEEE Access2169-35362021-01-01912816612817710.1109/ACCESS.2021.31084169524607Deep Learning-Based Correct Answer Prediction for Developer ForumsHafiz Umar Iftikhar0https://orcid.org/0000-0002-9841-1291Aqeel Ur Rehman1https://orcid.org/0000-0002-3083-6066Olga A. Kalugina2Qasim Umer3https://orcid.org/0000-0002-0237-3025Haris Ali Khan4Research Center for Modelling and Simulation, National University of Sciences and Technology, Islamabad, PakistanComputer Science Department, COMSATS University Islamabad, Vehari Campus, Vehari, PakistanDepartment of English Language for Professional Communication, Financial University Under the Government of the Russian Federation, Moscow, RussiaComputer Science Department, COMSATS University Islamabad, Vehari Campus, Vehari, PakistanComputer Science Department, COMSATS University Islamabad, Vehari Campus, Vehari, PakistanDeveloper forums are essential for software engineers to solve their problems with the assistance of experts on such forums. However, sometimes the solutions (answers) of a problem are not satisfactory or challenging to select the potential answer. Information seekers usually browse all the answers within the question thread to get the potential answer. The manual selection of correct answers is a tedious and time-consuming task. In this paper, we propose an automatic classification approach to predict the correct answers for developer forums. We first extract the metadata and combination of Q/A for each thread of the developer community (<italic>Stack Overflow</italic>). Then, the natural language processing techniques are applied to preprocess the Q/A combinations of the given dataset. After that, a keyword ranking algorithm is leveraged to extract keywords and their ranking scores for each Q/A combination. Based on keywords and their ranking scores for each Q/A combination, a keywords-based feature vector is constructed. Subsequently, word embedding is leveraged to convert each preprocessed Q/A combination into a text-based feature vector. Finally, we pass the metadata, keywords-based features, and text-based features to the ensemble deep learning model for training to predict correct answers. The results of 10-fold cross-validation specify that the proposed approach is accurate and surpasses the state-of-the-art. On average, it improves the <italic>accuracy</italic>, <italic>precision</italic>, <italic>recall</italic>, and <italic>f-measure</italic> up to <italic>1.72%</italic>, <italic>24.96%</italic>, <italic>6.57%</italic>, and <italic>16.62%</italic>, respectively.https://ieeexplore.ieee.org/document/9524607/Developer forumscommunity questing answeringcorrect answer predictionstack overflowsoftware maintenancedeep learning |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Hafiz Umar Iftikhar Aqeel Ur Rehman Olga A. Kalugina Qasim Umer Haris Ali Khan |
spellingShingle |
Hafiz Umar Iftikhar Aqeel Ur Rehman Olga A. Kalugina Qasim Umer Haris Ali Khan Deep Learning-Based Correct Answer Prediction for Developer Forums IEEE Access Developer forums community questing answering correct answer prediction stack overflow software maintenance deep learning |
author_facet |
Hafiz Umar Iftikhar Aqeel Ur Rehman Olga A. Kalugina Qasim Umer Haris Ali Khan |
author_sort |
Hafiz Umar Iftikhar |
title |
Deep Learning-Based Correct Answer Prediction for Developer Forums |
title_short |
Deep Learning-Based Correct Answer Prediction for Developer Forums |
title_full |
Deep Learning-Based Correct Answer Prediction for Developer Forums |
title_fullStr |
Deep Learning-Based Correct Answer Prediction for Developer Forums |
title_full_unstemmed |
Deep Learning-Based Correct Answer Prediction for Developer Forums |
title_sort |
deep learning-based correct answer prediction for developer forums |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2021-01-01 |
description |
Developer forums are essential for software engineers to solve their problems with the assistance of experts on such forums. However, sometimes the solutions (answers) of a problem are not satisfactory or challenging to select the potential answer. Information seekers usually browse all the answers within the question thread to get the potential answer. The manual selection of correct answers is a tedious and time-consuming task. In this paper, we propose an automatic classification approach to predict the correct answers for developer forums. We first extract the metadata and combination of Q/A for each thread of the developer community (<italic>Stack Overflow</italic>). Then, the natural language processing techniques are applied to preprocess the Q/A combinations of the given dataset. After that, a keyword ranking algorithm is leveraged to extract keywords and their ranking scores for each Q/A combination. Based on keywords and their ranking scores for each Q/A combination, a keywords-based feature vector is constructed. Subsequently, word embedding is leveraged to convert each preprocessed Q/A combination into a text-based feature vector. Finally, we pass the metadata, keywords-based features, and text-based features to the ensemble deep learning model for training to predict correct answers. The results of 10-fold cross-validation specify that the proposed approach is accurate and surpasses the state-of-the-art. On average, it improves the <italic>accuracy</italic>, <italic>precision</italic>, <italic>recall</italic>, and <italic>f-measure</italic> up to <italic>1.72%</italic>, <italic>24.96%</italic>, <italic>6.57%</italic>, and <italic>16.62%</italic>, respectively. |
topic |
Developer forums community questing answering correct answer prediction stack overflow software maintenance deep learning |
url |
https://ieeexplore.ieee.org/document/9524607/ |
work_keys_str_mv |
AT hafizumariftikhar deeplearningbasedcorrectanswerpredictionfordeveloperforums AT aqeelurrehman deeplearningbasedcorrectanswerpredictionfordeveloperforums AT olgaakalugina deeplearningbasedcorrectanswerpredictionfordeveloperforums AT qasimumer deeplearningbasedcorrectanswerpredictionfordeveloperforums AT harisalikhan deeplearningbasedcorrectanswerpredictionfordeveloperforums |
_version_ |
1717372054744858624 |