EM Algorithm for Estimating Reliability of Multi-Release Open Source Software Based on General Masked Data

Multi-release is critical for modern open source software product in order to satisfy more customer requirements. Masked data, a kind of missing data, is the system failure data when the exact cause of the failures might be unknown. That is, the cause of the system failures may be any one of the obj...

Full description

Bibliographic Details
Main Authors: Jianfeng Yang, Jing Chen, Xibin Wang
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9336012/
id doaj-5bb7b014f6994102833223f4108ca11e
record_format Article
spelling doaj-5bb7b014f6994102833223f4108ca11e2021-03-30T15:17:16ZengIEEEIEEE Access2169-35362021-01-019188901890310.1109/ACCESS.2021.30547609336012EM Algorithm for Estimating Reliability of Multi-Release Open Source Software Based on General Masked DataJianfeng Yang0https://orcid.org/0000-0003-3486-9604Jing Chen1Xibin Wang2School of Data Science, Guizhou Institute of Technology, Guiyang, ChinaCollege of Information Engineering, Guizhou University of Traditional Chinese Medicine, Guiyang, ChinaSchool of Data Science, Guizhou Institute of Technology, Guiyang, ChinaMulti-release is critical for modern open source software product in order to satisfy more customer requirements. Masked data, a kind of missing data, is the system failure data when the exact cause of the failures might be unknown. That is, the cause of the system failures may be any one of the objects. However, due to the influence of the test strategy in real project, the cause of the system failures may be a subset of the system objects, not any one of the objects. In this paper, the mathematical description of general masked data is presented based on the traditional masked data. Furthermore, a novel multi-release open source software (OSS) reliability model based on general masked data is proposed. Different from traditional multi-release OSS reliability model, the proposed approach is based on additive model with general masked data other than change point model. And then, the maximum likelihood estimation (MLE) process of the model parameters is derived in detail, and expectation maximization (EM) algorithm is used to solve the extremely complicated problem of the log-likelihood function. Finally, two data sets from real open source software project are applied to the proposed approach, and the results show that the proposed reliability model is useful and powerful.https://ieeexplore.ieee.org/document/9336012/General masked datamulti-release open source softwarereliability modelmaximum likelihood estimationEM~algorithm
collection DOAJ
language English
format Article
sources DOAJ
author Jianfeng Yang
Jing Chen
Xibin Wang
spellingShingle Jianfeng Yang
Jing Chen
Xibin Wang
EM Algorithm for Estimating Reliability of Multi-Release Open Source Software Based on General Masked Data
IEEE Access
General masked data
multi-release open source software
reliability model
maximum likelihood estimation
EM~algorithm
author_facet Jianfeng Yang
Jing Chen
Xibin Wang
author_sort Jianfeng Yang
title EM Algorithm for Estimating Reliability of Multi-Release Open Source Software Based on General Masked Data
title_short EM Algorithm for Estimating Reliability of Multi-Release Open Source Software Based on General Masked Data
title_full EM Algorithm for Estimating Reliability of Multi-Release Open Source Software Based on General Masked Data
title_fullStr EM Algorithm for Estimating Reliability of Multi-Release Open Source Software Based on General Masked Data
title_full_unstemmed EM Algorithm for Estimating Reliability of Multi-Release Open Source Software Based on General Masked Data
title_sort em algorithm for estimating reliability of multi-release open source software based on general masked data
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description Multi-release is critical for modern open source software product in order to satisfy more customer requirements. Masked data, a kind of missing data, is the system failure data when the exact cause of the failures might be unknown. That is, the cause of the system failures may be any one of the objects. However, due to the influence of the test strategy in real project, the cause of the system failures may be a subset of the system objects, not any one of the objects. In this paper, the mathematical description of general masked data is presented based on the traditional masked data. Furthermore, a novel multi-release open source software (OSS) reliability model based on general masked data is proposed. Different from traditional multi-release OSS reliability model, the proposed approach is based on additive model with general masked data other than change point model. And then, the maximum likelihood estimation (MLE) process of the model parameters is derived in detail, and expectation maximization (EM) algorithm is used to solve the extremely complicated problem of the log-likelihood function. Finally, two data sets from real open source software project are applied to the proposed approach, and the results show that the proposed reliability model is useful and powerful.
topic General masked data
multi-release open source software
reliability model
maximum likelihood estimation
EM~algorithm
url https://ieeexplore.ieee.org/document/9336012/
work_keys_str_mv AT jianfengyang emalgorithmforestimatingreliabilityofmultireleaseopensourcesoftwarebasedongeneralmaskeddata
AT jingchen emalgorithmforestimatingreliabilityofmultireleaseopensourcesoftwarebasedongeneralmaskeddata
AT xibinwang emalgorithmforestimatingreliabilityofmultireleaseopensourcesoftwarebasedongeneralmaskeddata
_version_ 1714739979193679872