Transactional Python for Durable Machine Learning: Vision, Challenges, and Feasibility

Supawit Chockchowwat, Zhaoheng Li, Yongjoo Park

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In machine learning (ML), Python serves as a convenient abstraction for working with key libraries such as PyTorch, scikit-learn, and others. Unlike DBMS, however, Python applications may lose important data, such as trained models and extracted features, due to machine failures or human errors, leading to a waste of time and resources. Specifically, they lack four essential properties that could make ML more reliable and user-friendly - -durability, atomicity, replicability, and time-versioning (DART).This paper presents our vision of Transactional Python that provides DART without any code modifications to user programs or the Python kernel, by non-intrusively monitoring application states at the object level and determining a minimal amount of information sufficient to reconstruct a whole application. Our evaluation of a proof-of-concept implementation with public PyTorch and scikit-learn applications shows that DART can be offered with overheads ranging 1.5% - 15.6%.

Original languageEnglish (US)
Title of host publicationProceedings of the 7th Workshop on Data Management for End-To-End Machine Learning, DEEM 2023
PublisherAssociation for Computing Machinery
ISBN (Electronic)9798400702044
DOIs
StatePublished - Jun 18 2023
Event7th Workshop on Data Management for End-To-End Machine Learning, DEEM 2023 - Seattle, United States
Duration: Jun 18 2023 → …

Publication series

NameProceedings of the 7th Workshop on Data Management for End-To-End Machine Learning, DEEM 2023

Conference

Conference7th Workshop on Data Management for End-To-End Machine Learning, DEEM 2023
Country/TerritoryUnited States
CitySeattle
Period6/18/23 → …

ASJC Scopus subject areas

  • Hardware and Architecture
  • Human-Computer Interaction
  • Sociology and Political Science

Fingerprint

Dive into the research topics of 'Transactional Python for Durable Machine Learning: Vision, Challenges, and Feasibility'. Together they form a unique fingerprint.

Cite this