Proof-of-Contribution-Based Design for Collaborative Machine Learning on Blockchain

Baturalp Buyukates, Chaoyang He, Shanshan Han, Zhiyong Fang, Yupeng Zhang, Jieyi Long, Ali Farahanchi, Salman Avestimehr

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We consider a project (model) owner that would like to train a model by utilizing the local private data and compute power of interested data owners, i.e., trainers. Our goal is to design a data marketplace for such decentralized collaborative/federated learning applications that simultaneously provides i) proof-of-contribution based reward allocation so that the trainers are compensated based on their contributions to the trained model; ii) privacy-preserving decentralized model training by avoiding any data movement from data owners; iii) robustness against malicious parties (e.g., trainers aiming to poison the model); iv) verifiability in the sense that the integrity, i.e., correctness, of all computations in the data market protocol including contribution assessment and outlier detection are verifiable through zero-knowledge proofs; and v) efficient and universal design. We propose a blockchain-based marketplace design to achieve all five objectives mentioned above. In our design, we utilize a distributed storage infrastructure and an aggregator aside from the project owner and the trainers. The aggregator is a processing node that performs certain computations, including assessing trainer contributions, removing outliers, and updating hyper-parameters. We execute the proposed data market through a blockchain smart contract. The deployed smart contract ensures that the project owner cannot evade payment, and honest trainers are rewarded based on their contributions at the end of training. Finally, we implement the building blocks of the proposed data market and demonstrate their applicability in practical scenarios through extensive experiments.

Original languageEnglish (US)
Title of host publicationProceedings - 2023 IEEE International Conference on Decentralized Applications and Infrastructures, DAPPS 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages13-22
Number of pages10
ISBN (Electronic)9798350335354
DOIs
StatePublished - 2023
Externally publishedYes
Event5th IEEE International Conference on Decentralized Applications and Infrastructures, DAPPS 2023 - Athens, Greece
Duration: Jul 17 2023Jul 20 2023

Publication series

NameProceedings - 2023 IEEE International Conference on Decentralized Applications and Infrastructures, DAPPS 2023

Conference

Conference5th IEEE International Conference on Decentralized Applications and Infrastructures, DAPPS 2023
Country/TerritoryGreece
CityAthens
Period7/17/237/20/23

Keywords

  • blockchain
  • collaborative machine learning
  • contribution assessment
  • data markets
  • zero-knowledge proof

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Computer Science Applications
  • Information Systems
  • Information Systems and Management
  • Safety, Risk, Reliability and Quality
  • Health Informatics

Fingerprint

Dive into the research topics of 'Proof-of-Contribution-Based Design for Collaborative Machine Learning on Blockchain'. Together they form a unique fingerprint.

Cite this