Preserving reproducibility: Provenance and executable containers in DataONE data packages

Bryce Mecum, Matthew B. Jones, Dave Vieglais, Craig Willis

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Many data packaging standards are available to researchers and data repository operators and the choice to use an existing standard or create a new one is challenging. We introduce the DataONE Data Package standard which is based on the existing OAI-ORE Resource Map standard. We describe the functionality Data Package provides, implementation considerations, compare it to existing standards, and discuss future extensions to the standard including the ability to describe execution environments via WholeTale 'Tales'' and alternate serialization formats.

Original languageEnglish (US)
Title of host publicationProceedings - IEEE 14th International Conference on eScience, e-Science 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages45-49
Number of pages5
ISBN (Electronic)9781538691564
DOIs
StatePublished - Dec 24 2018
Event14th IEEE International Conference on eScience, e-Science 2018 - Amsterdam, Netherlands
Duration: Oct 29 2018Nov 1 2018

Publication series

NameProceedings - IEEE 14th International Conference on eScience, e-Science 2018

Conference

Conference14th IEEE International Conference on eScience, e-Science 2018
CountryNetherlands
CityAmsterdam
Period10/29/1811/1/18

Fingerprint

Provenance
Reproducibility
Container
provenance
Containers
Standard Map
repository
Packaging
Alternate
Repository
Standards
container
resource
Resources
Operator

Keywords

  • Data packaging
  • DataONE
  • OAI-ORE
  • Reproducibility
  • Standards
  • WholeTale

ASJC Scopus subject areas

  • Computer Science Applications
  • Software
  • Ecological Modeling
  • Modeling and Simulation

Cite this

Mecum, B., Jones, M. B., Vieglais, D., & Willis, C. (2018). Preserving reproducibility: Provenance and executable containers in DataONE data packages. In Proceedings - IEEE 14th International Conference on eScience, e-Science 2018 (pp. 45-49). [8588638] (Proceedings - IEEE 14th International Conference on eScience, e-Science 2018). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/eScience.2018.00019

Preserving reproducibility : Provenance and executable containers in DataONE data packages. / Mecum, Bryce; Jones, Matthew B.; Vieglais, Dave; Willis, Craig.

Proceedings - IEEE 14th International Conference on eScience, e-Science 2018. Institute of Electrical and Electronics Engineers Inc., 2018. p. 45-49 8588638 (Proceedings - IEEE 14th International Conference on eScience, e-Science 2018).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Mecum, B, Jones, MB, Vieglais, D & Willis, C 2018, Preserving reproducibility: Provenance and executable containers in DataONE data packages. in Proceedings - IEEE 14th International Conference on eScience, e-Science 2018., 8588638, Proceedings - IEEE 14th International Conference on eScience, e-Science 2018, Institute of Electrical and Electronics Engineers Inc., pp. 45-49, 14th IEEE International Conference on eScience, e-Science 2018, Amsterdam, Netherlands, 10/29/18. https://doi.org/10.1109/eScience.2018.00019
Mecum B, Jones MB, Vieglais D, Willis C. Preserving reproducibility: Provenance and executable containers in DataONE data packages. In Proceedings - IEEE 14th International Conference on eScience, e-Science 2018. Institute of Electrical and Electronics Engineers Inc. 2018. p. 45-49. 8588638. (Proceedings - IEEE 14th International Conference on eScience, e-Science 2018). https://doi.org/10.1109/eScience.2018.00019
Mecum, Bryce ; Jones, Matthew B. ; Vieglais, Dave ; Willis, Craig. / Preserving reproducibility : Provenance and executable containers in DataONE data packages. Proceedings - IEEE 14th International Conference on eScience, e-Science 2018. Institute of Electrical and Electronics Engineers Inc., 2018. pp. 45-49 (Proceedings - IEEE 14th International Conference on eScience, e-Science 2018).
@inproceedings{767cfe36e4f940fab44add8d1ff70a27,
title = "Preserving reproducibility: Provenance and executable containers in DataONE data packages",
abstract = "Many data packaging standards are available to researchers and data repository operators and the choice to use an existing standard or create a new one is challenging. We introduce the DataONE Data Package standard which is based on the existing OAI-ORE Resource Map standard. We describe the functionality Data Package provides, implementation considerations, compare it to existing standards, and discuss future extensions to the standard including the ability to describe execution environments via WholeTale 'Tales'' and alternate serialization formats.",
keywords = "Data packaging, DataONE, OAI-ORE, Reproducibility, Standards, WholeTale",
author = "Bryce Mecum and Jones, {Matthew B.} and Dave Vieglais and Craig Willis",
year = "2018",
month = "12",
day = "24",
doi = "10.1109/eScience.2018.00019",
language = "English (US)",
series = "Proceedings - IEEE 14th International Conference on eScience, e-Science 2018",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
pages = "45--49",
booktitle = "Proceedings - IEEE 14th International Conference on eScience, e-Science 2018",
address = "United States",

}

TY - GEN

T1 - Preserving reproducibility

T2 - Provenance and executable containers in DataONE data packages

AU - Mecum, Bryce

AU - Jones, Matthew B.

AU - Vieglais, Dave

AU - Willis, Craig

PY - 2018/12/24

Y1 - 2018/12/24

N2 - Many data packaging standards are available to researchers and data repository operators and the choice to use an existing standard or create a new one is challenging. We introduce the DataONE Data Package standard which is based on the existing OAI-ORE Resource Map standard. We describe the functionality Data Package provides, implementation considerations, compare it to existing standards, and discuss future extensions to the standard including the ability to describe execution environments via WholeTale 'Tales'' and alternate serialization formats.

AB - Many data packaging standards are available to researchers and data repository operators and the choice to use an existing standard or create a new one is challenging. We introduce the DataONE Data Package standard which is based on the existing OAI-ORE Resource Map standard. We describe the functionality Data Package provides, implementation considerations, compare it to existing standards, and discuss future extensions to the standard including the ability to describe execution environments via WholeTale 'Tales'' and alternate serialization formats.

KW - Data packaging

KW - DataONE

KW - OAI-ORE

KW - Reproducibility

KW - Standards

KW - WholeTale

UR - http://www.scopus.com/inward/record.url?scp=85061393184&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85061393184&partnerID=8YFLogxK

U2 - 10.1109/eScience.2018.00019

DO - 10.1109/eScience.2018.00019

M3 - Conference contribution

AN - SCOPUS:85061393184

T3 - Proceedings - IEEE 14th International Conference on eScience, e-Science 2018

SP - 45

EP - 49

BT - Proceedings - IEEE 14th International Conference on eScience, e-Science 2018

PB - Institute of Electrical and Electronics Engineers Inc.

ER -