Federated learning for multi-omics: A performance evaluation in Parkinson's disease

Benjamin P. Danek, Mary B. Makarious, Anant Dadu, Dan Vitale, Paul Suhwan Lee, Andrew B. Singleton, Mike A. Nalls, Jimeng Sun, Faraz Faghri

Research output: Contribution to journalArticlepeer-review

Abstract

While machine learning (ML) research has recently grown more in popularity, its application in the omics domain is constrained by access to sufficiently large, high-quality datasets needed to train ML models. Federated learning (FL) represents an opportunity to enable collaborative curation of such datasets among participating institutions. We compare the simulated performance of several models trained using FL against classically trained ML models on the task of multi-omics Parkinson's disease prediction. We find that FL model performance tracks centrally trained ML models, where the most performant FL model achieves an AUC-PR of 0.876 ± 0.009, 0.014 ± 0.003 less than its centrally trained variation. We also determine that the dispersion of samples within a federation plays a meaningful role in model performance. Our study implements several open-source FL frameworks and aims to highlight some of the challenges and opportunities when applying these collaborative methods in multi-omics studies.

Original languageEnglish (US)
Article number100945
JournalPatterns
Volume5
Issue number3
DOIs
StatePublished - Mar 8 2024

Keywords

  • Parkinson's disease diagnosis
  • federated learning
  • machine learning
  • omics data analysis

ASJC Scopus subject areas

  • General Decision Sciences

Fingerprint

Dive into the research topics of 'Federated learning for multi-omics: A performance evaluation in Parkinson's disease'. Together they form a unique fingerprint.

Cite this