H3AGWAS: a portable workflow for genome wide association studies

Jean Tristan Brandenburg, Lindsay Clark, Gerrit Botha, Sumir Panji, Shakuntala Baichoo, Christopher Fields, Scott Hazelhurst

Research output: Contribution to journalArticlepeer-review

Abstract

Background: Genome-wide association studies (GWAS) are a powerful method to detect associations between variants and phenotypes. A GWAS requires several complex computations with large data sets, and many steps may need to be repeated with varying parameters. Manual running of these analyses can be tedious, error-prone and hard to reproduce. Results: The H3AGWAS workflow from the Pan-African Bioinformatics Network for H3Africa is a powerful, scalable and portable workflow implementing pre-association analysis, implementation of various association testing methods and post-association analysis of results. Conclusions: The workflow is scalable—laptop to cluster to cloud (e.g., SLURM, AWS Batch, Azure). All required software is containerised and can run under Docker or Singularity.

Original languageEnglish (US)
Article number498
JournalBMC bioinformatics
Volume23
Issue number1
DOIs
StatePublished - Dec 2022

Keywords

  • Association testing
  • Docker
  • Genome-wide association study
  • Nextflow
  • Pipeline
  • Post-association analysis
  • Quality control
  • Singularity
  • Workflow

ASJC Scopus subject areas

  • Structural Biology
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'H3AGWAS: a portable workflow for genome wide association studies'. Together they form a unique fingerprint.

Cite this