EM Algorithm for Estimating Reliability of Multi-Release Open Source Software Based on General Masked Data
Multi-release is critical for modern open source software product in order to satisfy more customer requirements. Masked data, a kind of missing data, is the system failure data when the exact cause of the failures might be unknown. That is, the cause of the system failures may be any one of the obj...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9336012/ |
id |
doaj-5bb7b014f6994102833223f4108ca11e |
---|---|
record_format |
Article |
spelling |
doaj-5bb7b014f6994102833223f4108ca11e2021-03-30T15:17:16ZengIEEEIEEE Access2169-35362021-01-019188901890310.1109/ACCESS.2021.30547609336012EM Algorithm for Estimating Reliability of Multi-Release Open Source Software Based on General Masked DataJianfeng Yang0https://orcid.org/0000-0003-3486-9604Jing Chen1Xibin Wang2School of Data Science, Guizhou Institute of Technology, Guiyang, ChinaCollege of Information Engineering, Guizhou University of Traditional Chinese Medicine, Guiyang, ChinaSchool of Data Science, Guizhou Institute of Technology, Guiyang, ChinaMulti-release is critical for modern open source software product in order to satisfy more customer requirements. Masked data, a kind of missing data, is the system failure data when the exact cause of the failures might be unknown. That is, the cause of the system failures may be any one of the objects. However, due to the influence of the test strategy in real project, the cause of the system failures may be a subset of the system objects, not any one of the objects. In this paper, the mathematical description of general masked data is presented based on the traditional masked data. Furthermore, a novel multi-release open source software (OSS) reliability model based on general masked data is proposed. Different from traditional multi-release OSS reliability model, the proposed approach is based on additive model with general masked data other than change point model. And then, the maximum likelihood estimation (MLE) process of the model parameters is derived in detail, and expectation maximization (EM) algorithm is used to solve the extremely complicated problem of the log-likelihood function. Finally, two data sets from real open source software project are applied to the proposed approach, and the results show that the proposed reliability model is useful and powerful.https://ieeexplore.ieee.org/document/9336012/General masked datamulti-release open source softwarereliability modelmaximum likelihood estimationEM~algorithm |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jianfeng Yang Jing Chen Xibin Wang |
spellingShingle |
Jianfeng Yang Jing Chen Xibin Wang EM Algorithm for Estimating Reliability of Multi-Release Open Source Software Based on General Masked Data IEEE Access General masked data multi-release open source software reliability model maximum likelihood estimation EM~algorithm |
author_facet |
Jianfeng Yang Jing Chen Xibin Wang |
author_sort |
Jianfeng Yang |
title |
EM Algorithm for Estimating Reliability of Multi-Release Open Source Software Based on General Masked Data |
title_short |
EM Algorithm for Estimating Reliability of Multi-Release Open Source Software Based on General Masked Data |
title_full |
EM Algorithm for Estimating Reliability of Multi-Release Open Source Software Based on General Masked Data |
title_fullStr |
EM Algorithm for Estimating Reliability of Multi-Release Open Source Software Based on General Masked Data |
title_full_unstemmed |
EM Algorithm for Estimating Reliability of Multi-Release Open Source Software Based on General Masked Data |
title_sort |
em algorithm for estimating reliability of multi-release open source software based on general masked data |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2021-01-01 |
description |
Multi-release is critical for modern open source software product in order to satisfy more customer requirements. Masked data, a kind of missing data, is the system failure data when the exact cause of the failures might be unknown. That is, the cause of the system failures may be any one of the objects. However, due to the influence of the test strategy in real project, the cause of the system failures may be a subset of the system objects, not any one of the objects. In this paper, the mathematical description of general masked data is presented based on the traditional masked data. Furthermore, a novel multi-release open source software (OSS) reliability model based on general masked data is proposed. Different from traditional multi-release OSS reliability model, the proposed approach is based on additive model with general masked data other than change point model. And then, the maximum likelihood estimation (MLE) process of the model parameters is derived in detail, and expectation maximization (EM) algorithm is used to solve the extremely complicated problem of the log-likelihood function. Finally, two data sets from real open source software project are applied to the proposed approach, and the results show that the proposed reliability model is useful and powerful. |
topic |
General masked data multi-release open source software reliability model maximum likelihood estimation EM~algorithm |
url |
https://ieeexplore.ieee.org/document/9336012/ |
work_keys_str_mv |
AT jianfengyang emalgorithmforestimatingreliabilityofmultireleaseopensourcesoftwarebasedongeneralmaskeddata AT jingchen emalgorithmforestimatingreliabilityofmultireleaseopensourcesoftwarebasedongeneralmaskeddata AT xibinwang emalgorithmforestimatingreliabilityofmultireleaseopensourcesoftwarebasedongeneralmaskeddata |
_version_ |
1714739979193679872 |