A Systematic Literature Review on Using Machine Learning Algorithms for Software Requirements Identification on Stack Overflow

Context. The improvements made in the last couple of decades in the requirements engineering (RE) processes and methods have witnessed a rapid rise in effectively using diverse machine learning (ML) techniques to resolve several multifaceted RE issues. One such challenging issue is the effective ide...

Full description

Bibliographic Details
Main Authors: Arshad Ahmad, Chong Feng, Muzammil Khan, Asif Khan, Ayaz Ullah, Shah Nazir, Adnan Tahir
Format: Article
Language:English
Published: Hindawi-Wiley 2020-01-01
Series:Security and Communication Networks
Online Access:http://dx.doi.org/10.1155/2020/8830683
id doaj-07efb231f40849b29e861bef078757d0
record_format Article
spelling doaj-07efb231f40849b29e861bef078757d02020-11-25T03:28:19ZengHindawi-WileySecurity and Communication Networks1939-01141939-01222020-01-01202010.1155/2020/88306838830683A Systematic Literature Review on Using Machine Learning Algorithms for Software Requirements Identification on Stack OverflowArshad Ahmad0Chong Feng1Muzammil Khan2Asif Khan3Ayaz Ullah4Shah Nazir5Adnan Tahir6School of Computer Science & Technology, Beijing Institute of Technology, Beijing, ChinaSchool of Computer Science & Technology, Beijing Institute of Technology, Beijing, ChinaDepartment of Computer Science, University of Swat, Mingora, PakistanSchool of Computer Science & Technology, Beijing Institute of Technology, Beijing, ChinaDepartment of Computer Science, University of Swabi, Anbar, PakistanDepartment of Computer Science, University of Swabi, Anbar, PakistanCollege of Computer Science and Software Engineering, Shenzhen University, Shenzhen, ChinaContext. The improvements made in the last couple of decades in the requirements engineering (RE) processes and methods have witnessed a rapid rise in effectively using diverse machine learning (ML) techniques to resolve several multifaceted RE issues. One such challenging issue is the effective identification and classification of the software requirements on Stack Overflow (SO) for building quality systems. The appropriateness of ML-based techniques to tackle this issue has revealed quite substantial results, much effective than those produced by the usual available natural language processing (NLP) techniques. Nonetheless, a complete, systematic, and detailed comprehension of these ML based techniques is considerably scarce. Objective. To identify or recognize and classify the kinds of ML algorithms used for software requirements identification primarily on SO. Method. This paper reports a systematic literature review (SLR) collecting empirical evidence published up to May 2020. Results. This SLR study found 2,484 published papers related to RE and SO. The data extraction process of the SLR showed that (1) Latent Dirichlet Allocation (LDA) topic modeling is among the widely used ML algorithm in the selected studies and (2) precision and recall are amongst the most commonly utilized evaluation methods for measuring the performance of these ML algorithms. Conclusion. Our SLR study revealed that while ML algorithms have phenomenal capabilities of identifying the software requirements on SO, they still are confronted with various open problems/issues that will eventually limit their practical applications and performances. Our SLR study calls for the need of close collaboration venture between the RE and ML communities/researchers to handle the open issues confronted in the development of some real world machine learning-based quality systems.http://dx.doi.org/10.1155/2020/8830683
collection DOAJ
language English
format Article
sources DOAJ
author Arshad Ahmad
Chong Feng
Muzammil Khan
Asif Khan
Ayaz Ullah
Shah Nazir
Adnan Tahir
spellingShingle Arshad Ahmad
Chong Feng
Muzammil Khan
Asif Khan
Ayaz Ullah
Shah Nazir
Adnan Tahir
A Systematic Literature Review on Using Machine Learning Algorithms for Software Requirements Identification on Stack Overflow
Security and Communication Networks
author_facet Arshad Ahmad
Chong Feng
Muzammil Khan
Asif Khan
Ayaz Ullah
Shah Nazir
Adnan Tahir
author_sort Arshad Ahmad
title A Systematic Literature Review on Using Machine Learning Algorithms for Software Requirements Identification on Stack Overflow
title_short A Systematic Literature Review on Using Machine Learning Algorithms for Software Requirements Identification on Stack Overflow
title_full A Systematic Literature Review on Using Machine Learning Algorithms for Software Requirements Identification on Stack Overflow
title_fullStr A Systematic Literature Review on Using Machine Learning Algorithms for Software Requirements Identification on Stack Overflow
title_full_unstemmed A Systematic Literature Review on Using Machine Learning Algorithms for Software Requirements Identification on Stack Overflow
title_sort systematic literature review on using machine learning algorithms for software requirements identification on stack overflow
publisher Hindawi-Wiley
series Security and Communication Networks
issn 1939-0114
1939-0122
publishDate 2020-01-01
description Context. The improvements made in the last couple of decades in the requirements engineering (RE) processes and methods have witnessed a rapid rise in effectively using diverse machine learning (ML) techniques to resolve several multifaceted RE issues. One such challenging issue is the effective identification and classification of the software requirements on Stack Overflow (SO) for building quality systems. The appropriateness of ML-based techniques to tackle this issue has revealed quite substantial results, much effective than those produced by the usual available natural language processing (NLP) techniques. Nonetheless, a complete, systematic, and detailed comprehension of these ML based techniques is considerably scarce. Objective. To identify or recognize and classify the kinds of ML algorithms used for software requirements identification primarily on SO. Method. This paper reports a systematic literature review (SLR) collecting empirical evidence published up to May 2020. Results. This SLR study found 2,484 published papers related to RE and SO. The data extraction process of the SLR showed that (1) Latent Dirichlet Allocation (LDA) topic modeling is among the widely used ML algorithm in the selected studies and (2) precision and recall are amongst the most commonly utilized evaluation methods for measuring the performance of these ML algorithms. Conclusion. Our SLR study revealed that while ML algorithms have phenomenal capabilities of identifying the software requirements on SO, they still are confronted with various open problems/issues that will eventually limit their practical applications and performances. Our SLR study calls for the need of close collaboration venture between the RE and ML communities/researchers to handle the open issues confronted in the development of some real world machine learning-based quality systems.
url http://dx.doi.org/10.1155/2020/8830683
work_keys_str_mv AT arshadahmad asystematicliteraturereviewonusingmachinelearningalgorithmsforsoftwarerequirementsidentificationonstackoverflow
AT chongfeng asystematicliteraturereviewonusingmachinelearningalgorithmsforsoftwarerequirementsidentificationonstackoverflow
AT muzammilkhan asystematicliteraturereviewonusingmachinelearningalgorithmsforsoftwarerequirementsidentificationonstackoverflow
AT asifkhan asystematicliteraturereviewonusingmachinelearningalgorithmsforsoftwarerequirementsidentificationonstackoverflow
AT ayazullah asystematicliteraturereviewonusingmachinelearningalgorithmsforsoftwarerequirementsidentificationonstackoverflow
AT shahnazir asystematicliteraturereviewonusingmachinelearningalgorithmsforsoftwarerequirementsidentificationonstackoverflow
AT adnantahir asystematicliteraturereviewonusingmachinelearningalgorithmsforsoftwarerequirementsidentificationonstackoverflow
AT arshadahmad systematicliteraturereviewonusingmachinelearningalgorithmsforsoftwarerequirementsidentificationonstackoverflow
AT chongfeng systematicliteraturereviewonusingmachinelearningalgorithmsforsoftwarerequirementsidentificationonstackoverflow
AT muzammilkhan systematicliteraturereviewonusingmachinelearningalgorithmsforsoftwarerequirementsidentificationonstackoverflow
AT asifkhan systematicliteraturereviewonusingmachinelearningalgorithmsforsoftwarerequirementsidentificationonstackoverflow
AT ayazullah systematicliteraturereviewonusingmachinelearningalgorithmsforsoftwarerequirementsidentificationonstackoverflow
AT shahnazir systematicliteraturereviewonusingmachinelearningalgorithmsforsoftwarerequirementsidentificationonstackoverflow
AT adnantahir systematicliteraturereviewonusingmachinelearningalgorithmsforsoftwarerequirementsidentificationonstackoverflow
_version_ 1715205950080548864