Defexts: A Curated Dataset of Reproducible Real-world bugs for Modern JVM Languages

Samuel Benton, Ali Ghanbari, Lingming Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Software engineering studies, such as bug detection, localization, repair, and prediction, often require benchmark bug datasets for their experiments. Few publicly available reproducible bug datasets exist for research consumption. Such datasets which publicly exist tend to be applicable exclusively towards the most popular traditional programming languages (e.g., Defects4J for Java and CoreBench for C). Thus, the creation and widespread usage of bug datasets for other popular modern JVM (Java Virtual Machine) programming languages serve to provide vital resources for software engineering research. This paper introduces Defexts, a family of bug datasets currently containing child datasets for Kotlin (DefextsKotlin) and Groovy (DefextsGroovy). Each dataset contains reproducible real-world bugs and their corresponding patches scraped from real-world projects. Our introductory versions of DefextsKotlin and DefextsGroovy include 225 Kotlin and 302 Groovy bugs and patches. As development of Defexts continues, we aim to include other JVM languages, notably Scala. A video demonstration of Defexts is located at following link:https://youtu.be/lenYcVzRGGQ.

Original languageEnglish (US)
Title of host publicationProceedings - 2019 IEEE/ACM 41st International Conference on Software Engineering
Subtitle of host publicationCompanion, ICSE-Companion 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages47-50
Number of pages4
ISBN (Electronic)9781728117645
DOIs
StatePublished - May 2019
Externally publishedYes
Event41st IEEE/ACM International Conference on Software Engineering: Companion, ICSE-Companion 2019 - Montreal, Canada
Duration: May 25 2019May 31 2019

Publication series

NameProceedings - 2019 IEEE/ACM 41st International Conference on Software Engineering: Companion, ICSE-Companion 2019

Conference

Conference41st IEEE/ACM International Conference on Software Engineering: Companion, ICSE-Companion 2019
Country/TerritoryCanada
CityMontreal
Period5/25/195/31/19

Keywords

  • Benchmark
  • Bug
  • Dataset
  • Groovy
  • Java virtual machine
  • JVM
  • Kotlin
  • Scala
  • Software testing

ASJC Scopus subject areas

  • Organizational Behavior and Human Resource Management
  • Software
  • Safety, Risk, Reliability and Quality
  • Education

Fingerprint

Dive into the research topics of 'Defexts: A Curated Dataset of Reproducible Real-world bugs for Modern JVM Languages'. Together they form a unique fingerprint.

Cite this