Studying the impact of CI on pull request delivery time in open source projects—a conceptual replication

Nowadays, continuous integration (CI) is indispensable in the software development process. A central promise of adopting CI is that new features or bug fixes can be delivered more quickly. A recent repository mining study by Bernardo, da Costa & Kulesza (2018) found that only about half of the...

Full description

Bibliographic Details
Main Authors: Yunfang Guo, Philipp Leitner
Format: Article
Language:English
Published: PeerJ Inc. 2019-12-01
Series:PeerJ Computer Science
Subjects:
Online Access:https://peerj.com/articles/cs-245.pdf
id doaj-18d2608f3d504e78b82b112668938435
record_format Article
spelling doaj-18d2608f3d504e78b82b1126689384352020-11-25T01:57:04ZengPeerJ Inc.PeerJ Computer Science2376-59922019-12-015e24510.7717/peerj-cs.245Studying the impact of CI on pull request delivery time in open source projects—a conceptual replicationYunfang Guo0Philipp Leitner1Software Engineering Division, Chalmers — University of Gothenburg, Gothenburg, SwedenSoftware Engineering Division, Chalmers — University of Gothenburg, Gothenburg, SwedenNowadays, continuous integration (CI) is indispensable in the software development process. A central promise of adopting CI is that new features or bug fixes can be delivered more quickly. A recent repository mining study by Bernardo, da Costa & Kulesza (2018) found that only about half of the investigated open source projects actually deliver pull requests (PR) faster after adopting CI, with small effect sizes. However, there are some concerns regarding the methodology used by Bernardo et al., which may potentially limit the trustworthiness of this finding. Particularly, they do not explicitly control for normal changes in the pull request delivery time during a project’s lifetime (independently of CI introduction). Hence, in our work, we conduct a conceptual replication of this study. In a first step, we replicate their study results using the same subjects and methodology. In a second step, we address the same core research question using an adapted methodology. We use a different statistical method (regression discontinuity design, RDD) that is more robust towards the confounding factor of projects potentially getting faster in delivering PRs over time naturally, and we introduce a control group of comparable projects that never applied CI. Finally, we also evaluate the generalizability of the original findings on a set of new open source projects sampled using the same methodology. We find that the results of the study by Bernardo et al. largely hold in our replication. Using RDD, we do not find robust evidence of projects getting faster at delivering PRs without CI, and we similarly do not see a speed-up in our control group that never introduced CI. Further, results obtained from a newly mined set of projects are comparable to the original findings. In conclusion, we consider the replication successful.https://peerj.com/articles/cs-245.pdfContinuous integrationMining software repositoriesReplicationPull-request based development
collection DOAJ
language English
format Article
sources DOAJ
author Yunfang Guo
Philipp Leitner
spellingShingle Yunfang Guo
Philipp Leitner
Studying the impact of CI on pull request delivery time in open source projects—a conceptual replication
PeerJ Computer Science
Continuous integration
Mining software repositories
Replication
Pull-request based development
author_facet Yunfang Guo
Philipp Leitner
author_sort Yunfang Guo
title Studying the impact of CI on pull request delivery time in open source projects—a conceptual replication
title_short Studying the impact of CI on pull request delivery time in open source projects—a conceptual replication
title_full Studying the impact of CI on pull request delivery time in open source projects—a conceptual replication
title_fullStr Studying the impact of CI on pull request delivery time in open source projects—a conceptual replication
title_full_unstemmed Studying the impact of CI on pull request delivery time in open source projects—a conceptual replication
title_sort studying the impact of ci on pull request delivery time in open source projects—a conceptual replication
publisher PeerJ Inc.
series PeerJ Computer Science
issn 2376-5992
publishDate 2019-12-01
description Nowadays, continuous integration (CI) is indispensable in the software development process. A central promise of adopting CI is that new features or bug fixes can be delivered more quickly. A recent repository mining study by Bernardo, da Costa & Kulesza (2018) found that only about half of the investigated open source projects actually deliver pull requests (PR) faster after adopting CI, with small effect sizes. However, there are some concerns regarding the methodology used by Bernardo et al., which may potentially limit the trustworthiness of this finding. Particularly, they do not explicitly control for normal changes in the pull request delivery time during a project’s lifetime (independently of CI introduction). Hence, in our work, we conduct a conceptual replication of this study. In a first step, we replicate their study results using the same subjects and methodology. In a second step, we address the same core research question using an adapted methodology. We use a different statistical method (regression discontinuity design, RDD) that is more robust towards the confounding factor of projects potentially getting faster in delivering PRs over time naturally, and we introduce a control group of comparable projects that never applied CI. Finally, we also evaluate the generalizability of the original findings on a set of new open source projects sampled using the same methodology. We find that the results of the study by Bernardo et al. largely hold in our replication. Using RDD, we do not find robust evidence of projects getting faster at delivering PRs without CI, and we similarly do not see a speed-up in our control group that never introduced CI. Further, results obtained from a newly mined set of projects are comparable to the original findings. In conclusion, we consider the replication successful.
topic Continuous integration
Mining software repositories
Replication
Pull-request based development
url https://peerj.com/articles/cs-245.pdf
work_keys_str_mv AT yunfangguo studyingtheimpactofcionpullrequestdeliverytimeinopensourceprojectsaconceptualreplication
AT philippleitner studyingtheimpactofcionpullrequestdeliverytimeinopensourceprojectsaconceptualreplication
_version_ 1724976572446801920