Sequoia: An Accessible and Extensible Framework for Privacy-Preserving Machine Learning over Distributed Data

Kaiqiang Xu, Di Chai, Junxue Zhang, Fan Lai, Kai Chen

Research output: Contribution to journalArticlepeer-review

Abstract

Privacy-preserving machine learning (PPML) algorithms use secure computation protocols to allow multiple data parties to collaboratively train machine learning (ML) models while maintaining their data confidentiality. However, current PPML frameworks couple secure protocols with ML models in PPML algorithm implementations, making it challenging for non-experts to develop and optimize PPML applications, limiting their accessibility and performance.We propose Sequoia, a novel PPML framework that decouples ML models and secure protocols to optimize the development and execution of PPML applications across data parties. Sequoia offers JAX-compatible APIs for users to program their ML models, while using a compiler-executor architecture to automatically apply PPML algorithms and system optimizations for model execution over distributed data. The compiler in Sequoia incorporates cross-party PPML processes into user-defined ML models by transparently adding computation, encryption, and communication steps with extensible policies, and the executor efficiently schedules code execution across multiple data parties, considering data dependencies and device heterogeneity.Compared to existing PPML frameworks, Sequoia requires 6492 and achieves 88% speedup of training throughput in horizontal PPML.
Original languageEnglish (US)
JournalProceedings of the ACM on Management of Data
Volume3
Issue number1
DOIs
StatePublished - Feb 2025

Keywords

  • compiler-executor architecture
  • distributed data
  • privacy-preserving machine learning (PPML)
  • secure computation
  • training throughput

Fingerprint

Dive into the research topics of 'Sequoia: An Accessible and Extensible Framework for Privacy-Preserving Machine Learning over Distributed Data'. Together they form a unique fingerprint.

Cite this