AIM: AN ABSTRACTION for IMPROVING MACHINE LEARNING PREDICTION

Victoria Stodden, Xiaomian Wu, Vanessa Sochat

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We introduce a structured and portable Abstraction for Improving Machine learning (AIM) to improve prediction outcomes and enable meaningful comparisons of ML pipelines. We implement AIM for a well-known acute leukemia classification problem using the Scientific Filesystem, enabling direct performance comparisons across a variety of classifiers. AIM provides three direct efficiency benefits: 1) the sources of performance differences between ML pipelines can identified at the algorithm implementation level as defined by the AIM, 2) improvements can be made to specific aspects of the pipeline and thus better understood, and 3) the reuse of these defined abstraction components across different pipelines is facilitated. When the AIM is defined at the outset of the prediction challenge, these benefits can come at minimal cost. We show these benefits by implementing AIM and the Scientific Filesystem on the well-known Golub AML/ALL cancer dataset.

Original languageEnglish (US)
Title of host publication2018 IEEE Data Science Workshop, DSW 2018 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages150-154
Number of pages5
ISBN (Print)9781538644102
DOIs
StatePublished - Aug 17 2018
Event2018 IEEE Data Science Workshop, DSW 2018 - Lausanne, Switzerland
Duration: Jun 4 2018Jun 6 2018

Publication series

Name2018 IEEE Data Science Workshop, DSW 2018 - Proceedings

Other

Other2018 IEEE Data Science Workshop, DSW 2018
Country/TerritorySwitzerland
CityLausanne
Period6/4/186/6/18

Keywords

  • Scientific Filesystem
  • containers
  • cyberinfrastructure
  • machine learning
  • programming abstraction
  • reproducible research

ASJC Scopus subject areas

  • Artificial Intelligence
  • Safety, Risk, Reliability and Quality
  • Water Science and Technology
  • Control and Optimization

Fingerprint

Dive into the research topics of 'AIM: AN ABSTRACTION for IMPROVING MACHINE LEARNING PREDICTION'. Together they form a unique fingerprint.

Cite this