Computing environments for reproducibility

Capturing the “Whole Tale”

Adam Brinckman, Kyle Chard, Niall Gaffney, Mihael Hategan, Matthew B. Jones, Kacper Kowalik, Sivakumar Kulasekaran, Bertram Ludaescher, Bryce D. Mecum, Jarek Nabrzyski, Victoria Stodden, Ian J. Taylor, Matthew J Turk, Kandace Turner

Research output: Contribution to journalArticle

Abstract

The act of sharing scientific knowledge is rapidly evolving away from traditional articles and presentations to the delivery of executable objects that integrate the data and computational details (e.g., scripts and workflows) upon which the findings rely. This envisioned coupling of data and process is essential to advancing science but faces technical and institutional barriers. The Whole Tale project aims to address these barriers by connecting computational, data-intensive research efforts with the larger research process—transforming the knowledge discovery and dissemination process into one where data products are united with research articles to create “living publications” or tales. The Whole Tale focuses on the full spectrum of science, empowering users in the long tail of science, and power users with demands for access to big data and compute resources. We report here on the design, architecture, and implementation of the Whole Tale environment.

Original languageEnglish (US)
Pages (from-to)854-867
Number of pages14
JournalFuture Generation Computer Systems
Volume94
DOIs
StatePublished - May 1 2019

Fingerprint

Data mining
Big data

Keywords

  • Code sharing
  • Data sharing
  • Living publications
  • Provenance
  • Reproducibility

ASJC Scopus subject areas

  • Software
  • Hardware and Architecture
  • Computer Networks and Communications

Cite this

Brinckman, A., Chard, K., Gaffney, N., Hategan, M., Jones, M. B., Kowalik, K., ... Turner, K. (2019). Computing environments for reproducibility: Capturing the “Whole Tale”. Future Generation Computer Systems, 94, 854-867. https://doi.org/10.1016/j.future.2017.12.029

Computing environments for reproducibility : Capturing the “Whole Tale”. / Brinckman, Adam; Chard, Kyle; Gaffney, Niall; Hategan, Mihael; Jones, Matthew B.; Kowalik, Kacper; Kulasekaran, Sivakumar; Ludaescher, Bertram; Mecum, Bryce D.; Nabrzyski, Jarek; Stodden, Victoria; Taylor, Ian J.; Turk, Matthew J; Turner, Kandace.

In: Future Generation Computer Systems, Vol. 94, 01.05.2019, p. 854-867.

Research output: Contribution to journalArticle

Brinckman, A, Chard, K, Gaffney, N, Hategan, M, Jones, MB, Kowalik, K, Kulasekaran, S, Ludaescher, B, Mecum, BD, Nabrzyski, J, Stodden, V, Taylor, IJ, Turk, MJ & Turner, K 2019, 'Computing environments for reproducibility: Capturing the “Whole Tale”', Future Generation Computer Systems, vol. 94, pp. 854-867. https://doi.org/10.1016/j.future.2017.12.029
Brinckman, Adam ; Chard, Kyle ; Gaffney, Niall ; Hategan, Mihael ; Jones, Matthew B. ; Kowalik, Kacper ; Kulasekaran, Sivakumar ; Ludaescher, Bertram ; Mecum, Bryce D. ; Nabrzyski, Jarek ; Stodden, Victoria ; Taylor, Ian J. ; Turk, Matthew J ; Turner, Kandace. / Computing environments for reproducibility : Capturing the “Whole Tale”. In: Future Generation Computer Systems. 2019 ; Vol. 94. pp. 854-867.
@article{7aca5e217b824ea6943d788260779bbf,
title = "Computing environments for reproducibility: Capturing the “Whole Tale”",
abstract = "The act of sharing scientific knowledge is rapidly evolving away from traditional articles and presentations to the delivery of executable objects that integrate the data and computational details (e.g., scripts and workflows) upon which the findings rely. This envisioned coupling of data and process is essential to advancing science but faces technical and institutional barriers. The Whole Tale project aims to address these barriers by connecting computational, data-intensive research efforts with the larger research process—transforming the knowledge discovery and dissemination process into one where data products are united with research articles to create “living publications” or tales. The Whole Tale focuses on the full spectrum of science, empowering users in the long tail of science, and power users with demands for access to big data and compute resources. We report here on the design, architecture, and implementation of the Whole Tale environment.",
keywords = "Code sharing, Data sharing, Living publications, Provenance, Reproducibility",
author = "Adam Brinckman and Kyle Chard and Niall Gaffney and Mihael Hategan and Jones, {Matthew B.} and Kacper Kowalik and Sivakumar Kulasekaran and Bertram Ludaescher and Mecum, {Bryce D.} and Jarek Nabrzyski and Victoria Stodden and Taylor, {Ian J.} and Turk, {Matthew J} and Kandace Turner",
year = "2019",
month = "5",
day = "1",
doi = "10.1016/j.future.2017.12.029",
language = "English (US)",
volume = "94",
pages = "854--867",
journal = "Future Generation Computer Systems",
issn = "0167-739X",
publisher = "Elsevier",

}

TY - JOUR

T1 - Computing environments for reproducibility

T2 - Capturing the “Whole Tale”

AU - Brinckman, Adam

AU - Chard, Kyle

AU - Gaffney, Niall

AU - Hategan, Mihael

AU - Jones, Matthew B.

AU - Kowalik, Kacper

AU - Kulasekaran, Sivakumar

AU - Ludaescher, Bertram

AU - Mecum, Bryce D.

AU - Nabrzyski, Jarek

AU - Stodden, Victoria

AU - Taylor, Ian J.

AU - Turk, Matthew J

AU - Turner, Kandace

PY - 2019/5/1

Y1 - 2019/5/1

N2 - The act of sharing scientific knowledge is rapidly evolving away from traditional articles and presentations to the delivery of executable objects that integrate the data and computational details (e.g., scripts and workflows) upon which the findings rely. This envisioned coupling of data and process is essential to advancing science but faces technical and institutional barriers. The Whole Tale project aims to address these barriers by connecting computational, data-intensive research efforts with the larger research process—transforming the knowledge discovery and dissemination process into one where data products are united with research articles to create “living publications” or tales. The Whole Tale focuses on the full spectrum of science, empowering users in the long tail of science, and power users with demands for access to big data and compute resources. We report here on the design, architecture, and implementation of the Whole Tale environment.

AB - The act of sharing scientific knowledge is rapidly evolving away from traditional articles and presentations to the delivery of executable objects that integrate the data and computational details (e.g., scripts and workflows) upon which the findings rely. This envisioned coupling of data and process is essential to advancing science but faces technical and institutional barriers. The Whole Tale project aims to address these barriers by connecting computational, data-intensive research efforts with the larger research process—transforming the knowledge discovery and dissemination process into one where data products are united with research articles to create “living publications” or tales. The Whole Tale focuses on the full spectrum of science, empowering users in the long tail of science, and power users with demands for access to big data and compute resources. We report here on the design, architecture, and implementation of the Whole Tale environment.

KW - Code sharing

KW - Data sharing

KW - Living publications

KW - Provenance

KW - Reproducibility

UR - http://www.scopus.com/inward/record.url?scp=85042565331&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85042565331&partnerID=8YFLogxK

U2 - 10.1016/j.future.2017.12.029

DO - 10.1016/j.future.2017.12.029

M3 - Article

VL - 94

SP - 854

EP - 867

JO - Future Generation Computer Systems

JF - Future Generation Computer Systems

SN - 0167-739X

ER -