Deep Learning-Based Correct Answer Prediction for Developer Forums

Developer forums are essential for software engineers to solve their problems with the assistance of experts on such forums. However, sometimes the solutions (answers) of a problem are not satisfactory or challenging to select the potential answer. Information seekers usually browse all the answers...

Full description

Bibliographic Details
Main Authors: Hafiz Umar Iftikhar, Aqeel Ur Rehman, Olga A. Kalugina, Qasim Umer, Haris Ali Khan
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9524607/
id doaj-e4d40b341fd04d9b80b5b19580aa482e
record_format Article
spelling doaj-e4d40b341fd04d9b80b5b19580aa482e2021-09-21T23:00:15ZengIEEEIEEE Access2169-35362021-01-01912816612817710.1109/ACCESS.2021.31084169524607Deep Learning-Based Correct Answer Prediction for Developer ForumsHafiz Umar Iftikhar0https://orcid.org/0000-0002-9841-1291Aqeel Ur Rehman1https://orcid.org/0000-0002-3083-6066Olga A. Kalugina2Qasim Umer3https://orcid.org/0000-0002-0237-3025Haris Ali Khan4Research Center for Modelling and Simulation, National University of Sciences and Technology, Islamabad, PakistanComputer Science Department, COMSATS University Islamabad, Vehari Campus, Vehari, PakistanDepartment of English Language for Professional Communication, Financial University Under the Government of the Russian Federation, Moscow, RussiaComputer Science Department, COMSATS University Islamabad, Vehari Campus, Vehari, PakistanComputer Science Department, COMSATS University Islamabad, Vehari Campus, Vehari, PakistanDeveloper forums are essential for software engineers to solve their problems with the assistance of experts on such forums. However, sometimes the solutions (answers) of a problem are not satisfactory or challenging to select the potential answer. Information seekers usually browse all the answers within the question thread to get the potential answer. The manual selection of correct answers is a tedious and time-consuming task. In this paper, we propose an automatic classification approach to predict the correct answers for developer forums. We first extract the metadata and combination of Q/A for each thread of the developer community (<italic>Stack Overflow</italic>). Then, the natural language processing techniques are applied to preprocess the Q/A combinations of the given dataset. After that, a keyword ranking algorithm is leveraged to extract keywords and their ranking scores for each Q/A combination. Based on keywords and their ranking scores for each Q/A combination, a keywords-based feature vector is constructed. Subsequently, word embedding is leveraged to convert each preprocessed Q/A combination into a text-based feature vector. Finally, we pass the metadata, keywords-based features, and text-based features to the ensemble deep learning model for training to predict correct answers. The results of 10-fold cross-validation specify that the proposed approach is accurate and surpasses the state-of-the-art. On average, it improves the <italic>accuracy</italic>, <italic>precision</italic>, <italic>recall</italic>, and <italic>f-measure</italic> up to <italic>1.72&#x0025;</italic>, <italic>24.96&#x0025;</italic>, <italic>6.57&#x0025;</italic>, and <italic>16.62&#x0025;</italic>, respectively.https://ieeexplore.ieee.org/document/9524607/Developer forumscommunity questing answeringcorrect answer predictionstack overflowsoftware maintenancedeep learning
collection DOAJ
language English
format Article
sources DOAJ
author Hafiz Umar Iftikhar
Aqeel Ur Rehman
Olga A. Kalugina
Qasim Umer
Haris Ali Khan
spellingShingle Hafiz Umar Iftikhar
Aqeel Ur Rehman
Olga A. Kalugina
Qasim Umer
Haris Ali Khan
Deep Learning-Based Correct Answer Prediction for Developer Forums
IEEE Access
Developer forums
community questing answering
correct answer prediction
stack overflow
software maintenance
deep learning
author_facet Hafiz Umar Iftikhar
Aqeel Ur Rehman
Olga A. Kalugina
Qasim Umer
Haris Ali Khan
author_sort Hafiz Umar Iftikhar
title Deep Learning-Based Correct Answer Prediction for Developer Forums
title_short Deep Learning-Based Correct Answer Prediction for Developer Forums
title_full Deep Learning-Based Correct Answer Prediction for Developer Forums
title_fullStr Deep Learning-Based Correct Answer Prediction for Developer Forums
title_full_unstemmed Deep Learning-Based Correct Answer Prediction for Developer Forums
title_sort deep learning-based correct answer prediction for developer forums
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description Developer forums are essential for software engineers to solve their problems with the assistance of experts on such forums. However, sometimes the solutions (answers) of a problem are not satisfactory or challenging to select the potential answer. Information seekers usually browse all the answers within the question thread to get the potential answer. The manual selection of correct answers is a tedious and time-consuming task. In this paper, we propose an automatic classification approach to predict the correct answers for developer forums. We first extract the metadata and combination of Q/A for each thread of the developer community (<italic>Stack Overflow</italic>). Then, the natural language processing techniques are applied to preprocess the Q/A combinations of the given dataset. After that, a keyword ranking algorithm is leveraged to extract keywords and their ranking scores for each Q/A combination. Based on keywords and their ranking scores for each Q/A combination, a keywords-based feature vector is constructed. Subsequently, word embedding is leveraged to convert each preprocessed Q/A combination into a text-based feature vector. Finally, we pass the metadata, keywords-based features, and text-based features to the ensemble deep learning model for training to predict correct answers. The results of 10-fold cross-validation specify that the proposed approach is accurate and surpasses the state-of-the-art. On average, it improves the <italic>accuracy</italic>, <italic>precision</italic>, <italic>recall</italic>, and <italic>f-measure</italic> up to <italic>1.72&#x0025;</italic>, <italic>24.96&#x0025;</italic>, <italic>6.57&#x0025;</italic>, and <italic>16.62&#x0025;</italic>, respectively.
topic Developer forums
community questing answering
correct answer prediction
stack overflow
software maintenance
deep learning
url https://ieeexplore.ieee.org/document/9524607/
work_keys_str_mv AT hafizumariftikhar deeplearningbasedcorrectanswerpredictionfordeveloperforums
AT aqeelurrehman deeplearningbasedcorrectanswerpredictionfordeveloperforums
AT olgaakalugina deeplearningbasedcorrectanswerpredictionfordeveloperforums
AT qasimumer deeplearningbasedcorrectanswerpredictionfordeveloperforums
AT harisalikhan deeplearningbasedcorrectanswerpredictionfordeveloperforums
_version_ 1717372054744858624