Productive and Performant Generic Lossy Data Compression with LibPressio

Robert Underwood, Victoriana Malvoso, Jon C. Calhoun, Sheng Di, Franck Cappello

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In recent years, lossless and lossy compressors have been developed to cope with the ever increasing volume of scientific floating point data. However not all compression techniques are appropriate for all data-sets, and determining which one to use can be time consuming requiring code modifications and trial and error. We present LibPressio-a generic library for the compression of dense tensors that minimizes the code changes scientists need to make to take advantage of new and improved compression techniques. We compare LibPressio to 9 different competing libraries and measure the overhead of their design decisions as well as overall run time overhead showing insignificant overhead. We further show an improvement in usability as measured by a reduction in lines of code compared to native code by 50-90 %. The value of this tool can be seen by integration into Z-Checker and ADIOS2.

Original languageEnglish (US)
Title of host publicationProceedings of DRBSD-7 2021
Subtitle of host publication7th International Workshop on Data Analysis and Reduction for Big Scientific Data, Held in conjunction with SC 2021: The International Conference for High Performance Computing, Networking, Storage and Analysis
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1-10
Number of pages10
ISBN (Electronic)9781728186726
DOIs
StatePublished - 2021
Externally publishedYes
Event7th International Workshop on Data Analysis and Reduction for Big Scientific Data, DRBSD-7 2021 - St. Louis, United States
Duration: Nov 14 2021 → …

Publication series

NameProceedings of DRBSD-7 2021: 7th International Workshop on Data Analysis and Reduction for Big Scientific Data, Held in conjunction with SC 2021: The International Conference for High Performance Computing, Networking, Storage and Analysis

Conference

Conference7th International Workshop on Data Analysis and Reduction for Big Scientific Data, DRBSD-7 2021
Country/TerritoryUnited States
CitySt. Louis
Period11/14/21 → …

Keywords

  • Error Bounded Lossy Compression
  • LibPressio

ASJC Scopus subject areas

  • Artificial Intelligence
  • Information Systems
  • Information Systems and Management
  • Statistics, Probability and Uncertainty
  • Media Technology

Fingerprint

Dive into the research topics of 'Productive and Performant Generic Lossy Data Compression with LibPressio'. Together they form a unique fingerprint.

Cite this