The List is the Process: Reliable Pre-Integration Tracking of Commits on Mailing Lists

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Ralf Ramsauer
  • Daniel Lohmann
  • Wolfgang Mauerer

External Research Organisations

  • The Regensburg University of Applied Sciences
  • Siemens AG
View graph of relations

Details

Original languageEnglish
Title of host publication2019 IEEE/ACM 41st International Conference on Software Engineering, ICSE 2019
Subtitle of host publicationProceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages807-818
Number of pages12
ISBN (electronic)978-1-7281-0869-8
ISBN (print)978-1-7281-0870-4
Publication statusPublished - May 2019
Event41st IEEE/ACM International Conference on Software Engineering, ICSE 2019 - Montreal, Canada
Duration: 25 May 201931 May 2019

Publication series

NameInternational Conference on Software Engineering
ISSN (Print)0270-5257
ISSN (electronic)1558-1225

Abstract

A considerable corpus of research on software evolution focuses on mining changes in software repositories, but omits their pre-integration history. We present a novel method for tracking this otherwise invisible evolution of software changes on mailing lists by connecting all early revisions of changes to their final version in repositories. Since artefact modifications on mailing lists are communicated by updates to fragments (i.e., patches) only, identifying semantically similar changes is a non-trivial task that our approach solves in a language-independent way. We evaluate our method on high-profile open source software (OSS) projects like the Linux kernel, and validate its high accuracy using an elaborately created ground truth. Our approach can be used to quantify properties of OSS development processes, which is an essential requirement for using OSS in reliable or safety-critical industrial products, where certifiability and conformance to processes are crucial. The high accuracy of our technique allows, to the best of our knowledge, for the first time to quantitatively determine if an open development process effectively aligns with given formal process requirements.

Keywords

    commits, mailing lists, mining software repositories, patches, software engineering

ASJC Scopus subject areas

Cite this

The List is the Process: Reliable Pre-Integration Tracking of Commits on Mailing Lists. / Ramsauer, Ralf; Lohmann, Daniel; Mauerer, Wolfgang.
2019 IEEE/ACM 41st International Conference on Software Engineering, ICSE 2019: Proceedings. Institute of Electrical and Electronics Engineers Inc., 2019. p. 807-818 (International Conference on Software Engineering).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Ramsauer, R, Lohmann, D & Mauerer, W 2019, The List is the Process: Reliable Pre-Integration Tracking of Commits on Mailing Lists. in 2019 IEEE/ACM 41st International Conference on Software Engineering, ICSE 2019: Proceedings. International Conference on Software Engineering, Institute of Electrical and Electronics Engineers Inc., pp. 807-818, 41st IEEE/ACM International Conference on Software Engineering, ICSE 2019, Montreal, Canada, 25 May 2019. https://doi.org/10.48550/arXiv.1902.03147, https://doi.org/10.1109/ICSE.2019.00088
Ramsauer, R., Lohmann, D., & Mauerer, W. (2019). The List is the Process: Reliable Pre-Integration Tracking of Commits on Mailing Lists. In 2019 IEEE/ACM 41st International Conference on Software Engineering, ICSE 2019: Proceedings (pp. 807-818). (International Conference on Software Engineering). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.48550/arXiv.1902.03147, https://doi.org/10.1109/ICSE.2019.00088
Ramsauer R, Lohmann D, Mauerer W. The List is the Process: Reliable Pre-Integration Tracking of Commits on Mailing Lists. In 2019 IEEE/ACM 41st International Conference on Software Engineering, ICSE 2019: Proceedings. Institute of Electrical and Electronics Engineers Inc. 2019. p. 807-818. (International Conference on Software Engineering). doi: 10.48550/arXiv.1902.03147, 10.1109/ICSE.2019.00088
Ramsauer, Ralf ; Lohmann, Daniel ; Mauerer, Wolfgang. / The List is the Process : Reliable Pre-Integration Tracking of Commits on Mailing Lists. 2019 IEEE/ACM 41st International Conference on Software Engineering, ICSE 2019: Proceedings. Institute of Electrical and Electronics Engineers Inc., 2019. pp. 807-818 (International Conference on Software Engineering).
Download
@inproceedings{d2b997407c0546408117637061aa8fea,
title = "The List is the Process: Reliable Pre-Integration Tracking of Commits on Mailing Lists",
abstract = "A considerable corpus of research on software evolution focuses on mining changes in software repositories, but omits their pre-integration history. We present a novel method for tracking this otherwise invisible evolution of software changes on mailing lists by connecting all early revisions of changes to their final version in repositories. Since artefact modifications on mailing lists are communicated by updates to fragments (i.e., patches) only, identifying semantically similar changes is a non-trivial task that our approach solves in a language-independent way. We evaluate our method on high-profile open source software (OSS) projects like the Linux kernel, and validate its high accuracy using an elaborately created ground truth. Our approach can be used to quantify properties of OSS development processes, which is an essential requirement for using OSS in reliable or safety-critical industrial products, where certifiability and conformance to processes are crucial. The high accuracy of our technique allows, to the best of our knowledge, for the first time to quantitatively determine if an open development process effectively aligns with given formal process requirements.",
keywords = "commits, mailing lists, mining software repositories, patches, software engineering",
author = "Ralf Ramsauer and Daniel Lohmann and Wolfgang Mauerer",
note = "Funding information: This work was supported by Siemens AG, Corporate Research, the iDev40 project and the German Research Council (DFG) under grant no. LO 1719/3-1. The iDev40 project has received funding from the ECSEL Joint Undertaking (JU) under grant no. 783163. The JU receives support from the European Union{\^a}A? ?s Horizon 2020 research and innovation programme. It is co-funded by the consortium members, grants from Austria, Germany, Belgium, Italy, Spain and Romania.; 41st IEEE/ACM International Conference on Software Engineering, ICSE 2019 ; Conference date: 25-05-2019 Through 31-05-2019",
year = "2019",
month = may,
doi = "10.48550/arXiv.1902.03147",
language = "English",
isbn = "978-1-7281-0870-4",
series = "International Conference on Software Engineering",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
pages = "807--818",
booktitle = "2019 IEEE/ACM 41st International Conference on Software Engineering, ICSE 2019",
address = "United States",

}

Download

TY - GEN

T1 - The List is the Process

T2 - 41st IEEE/ACM International Conference on Software Engineering, ICSE 2019

AU - Ramsauer, Ralf

AU - Lohmann, Daniel

AU - Mauerer, Wolfgang

N1 - Funding information: This work was supported by Siemens AG, Corporate Research, the iDev40 project and the German Research Council (DFG) under grant no. LO 1719/3-1. The iDev40 project has received funding from the ECSEL Joint Undertaking (JU) under grant no. 783163. The JU receives support from the European UnionâA? ?s Horizon 2020 research and innovation programme. It is co-funded by the consortium members, grants from Austria, Germany, Belgium, Italy, Spain and Romania.

PY - 2019/5

Y1 - 2019/5

N2 - A considerable corpus of research on software evolution focuses on mining changes in software repositories, but omits their pre-integration history. We present a novel method for tracking this otherwise invisible evolution of software changes on mailing lists by connecting all early revisions of changes to their final version in repositories. Since artefact modifications on mailing lists are communicated by updates to fragments (i.e., patches) only, identifying semantically similar changes is a non-trivial task that our approach solves in a language-independent way. We evaluate our method on high-profile open source software (OSS) projects like the Linux kernel, and validate its high accuracy using an elaborately created ground truth. Our approach can be used to quantify properties of OSS development processes, which is an essential requirement for using OSS in reliable or safety-critical industrial products, where certifiability and conformance to processes are crucial. The high accuracy of our technique allows, to the best of our knowledge, for the first time to quantitatively determine if an open development process effectively aligns with given formal process requirements.

AB - A considerable corpus of research on software evolution focuses on mining changes in software repositories, but omits their pre-integration history. We present a novel method for tracking this otherwise invisible evolution of software changes on mailing lists by connecting all early revisions of changes to their final version in repositories. Since artefact modifications on mailing lists are communicated by updates to fragments (i.e., patches) only, identifying semantically similar changes is a non-trivial task that our approach solves in a language-independent way. We evaluate our method on high-profile open source software (OSS) projects like the Linux kernel, and validate its high accuracy using an elaborately created ground truth. Our approach can be used to quantify properties of OSS development processes, which is an essential requirement for using OSS in reliable or safety-critical industrial products, where certifiability and conformance to processes are crucial. The high accuracy of our technique allows, to the best of our knowledge, for the first time to quantitatively determine if an open development process effectively aligns with given formal process requirements.

KW - commits

KW - mailing lists

KW - mining software repositories

KW - patches

KW - software engineering

UR - http://www.scopus.com/inward/record.url?scp=85072290414&partnerID=8YFLogxK

U2 - 10.48550/arXiv.1902.03147

DO - 10.48550/arXiv.1902.03147

M3 - Conference contribution

AN - SCOPUS:85072290414

SN - 978-1-7281-0870-4

T3 - International Conference on Software Engineering

SP - 807

EP - 818

BT - 2019 IEEE/ACM 41st International Conference on Software Engineering, ICSE 2019

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 25 May 2019 through 31 May 2019

ER -