OptZConfig: Efficient Parallel Optimization of Lossy Compression Configuration

Robert Underwood, Jon C. Calhoun, Sheng Di, Amy Apon, Franck Cappello

Research output: Contribution to journalArticlepeer-review

Abstract

Lossless compressors have very low compression ratios that do not meet the needs of today's large-scale scientific applications that produce vast volumes of data. Error-bounded lossy compression (EBLC) is considered a critical technique for the success of scientific research. Although EBLC allows users to set an error bound for the compression, users have been unable to specify the requirements on the compression quality, limiting practical use. Our contributions are: (1) We formulate the problem of configuring EBLC to preserve a user-defined metric as an optimization problem. This allows many classes of new metrics to be preserved, which improves over current practices. (2) We present a framework, OptZConfig, that can adapt to improvements in the search algorithm, compressor, and metrics with minimal changes, enabling future advancements in this area. (3) We demonstrate the advantages of our approach against the leading methods to configure compressors to preserve specific metrics. Our approach improves compression ratios against a specialized compressor by up to $3\times$3×, has a 56× speedup over FRaZ, 1000× speedup over MGARD-QOI post tuning, and 110× speedup over systematic approaches which had not been bounded by compressors before.

Original languageEnglish (US)
Pages (from-to)3505-3519
Number of pages15
JournalIEEE Transactions on Parallel and Distributed Systems
Volume33
Issue number12
DOIs
StatePublished - Dec 1 2022
Externally publishedYes

Keywords

  • Error bounded lossy compression
  • LibPressio
  • non-linear optimization
  • parallel computing

ASJC Scopus subject areas

  • Signal Processing
  • Hardware and Architecture
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of 'OptZConfig: Efficient Parallel Optimization of Lossy Compression Configuration'. Together they form a unique fingerprint.

Cite this