Revisiting the reproducibility of empirical software engineering studies based on data retrieved from development repositories
Fecha
2023
Título de la revista
ISSN de la revista
Título del volumen
Editor
Elsevier
Resumen
Context: In 2012, our paper ‘‘On the reproducibility of empirical software engineering studies based on data
retrieved from development repositories’’ was published. It proposed a method for assessing the reproducibility
of studies based on mining software repositories (MSR studies). Since then, several approaches have happened
with respect to the study of the reproducibility of this kind of studies.
Objective: To revisit the proposals of that paper, analyzing to which extent they remain valid, and how
they relate to current initiatives and studies on reproducibility and validation of research results in empirical
software engineering.
Method: We analyze the most relevant studies affecting assumptions or consequences of the approach of the
original paper, and other initiatives related to the evaluation of replicability aspects of empirical software
engineering studies. We compare the results of that analysis with the results of the original study, finding
similarities and differences. We also run a reproducibility assessment study on current MSR papers. Based on
the comparison, and the applicability of the method to current papers, we draw conclusions on the validity
of the approach of the original paper.
Main lessons learned: The method proposed in the original paper is still valid, and compares well with
other more recent methods. It matches the results of relevant studies on reproducibility, and a systematic
comparison with them shows that our approach is aligned with their proposals. Our method has practical use,
and complements well the current major initiatives on the review of reproducibility artifacts. As a side result,
we learn that the reproducibility of MSR studies has improved during the last decade.
Vision: We propose to use our approach as a fundamental element of a more profound review of the
reproducibility of MSR studies, and of the characterization of validation studies in this realm.
Descripción
The research presented in this paper has been supported in part by the Government of Spain , through project “Dependentium” (PID2022-139551NB-I00).
Citación
Jesus M. Gonzalez-Barahona, Gregorio Robles, Revisiting the reproducibility of empirical software engineering studies based on data retrieved from development repositories, Information and Software Technology, Volume 164, 2023, 107318, ISSN 0950-5849, https://doi.org/10.1016/j.infsof.2023.107318
Colecciones
Excepto si se señala otra cosa, la licencia del ítem se describe como Atribución 4.0 Internacional