Unbiased, scalable sampling of protein loop conformations from probabilistic priors

Yajia Zhang, Kris Hauser

Research output: Contribution to journalArticlepeer-review

Abstract

Background: Protein loops are flexible structures that are intimately tied to function, but understanding loop motion and generating loop conformation ensembles remain significant computational challenges. Discrete search techniques scale poorly to large loops, optimization and molecular dynamics techniques are prone to local minima, and inverse kinematics techniques can only incorporate structural preferences in adhoc fashion. This paper presents Sub-Loop Inverse Kinematics Monte Carlo (SLIKMC), a new Markov chain Monte Carlo algorithm for generating conformations of closed loops according to experimentally available, heterogeneous structural preferences. Results: Our simulation experiments demonstrate that the method computes high-scoring conformations of large loops (>10 residues) orders of magnitude faster than standard Monte Carlo and discrete search techniques. Two new developments contribute to the scalability of the new method. First, structural preferences are specified via a probabilistic graphical model (PGM) that links conformation variables, spatial variables (e.g., atom positions), constraints and prior information in a unified framework. The method uses a sparse PGM that exploits locality of interactions between atoms and residues. Second, a novel method for sampling sub-loops is developed to generate statistically unbiased samples of probability densities restricted by loop-closure constraints. Conclusion: Numerical experiments confirm that SLIKMC generates conformation ensembles that are statistically consistent with specified structural preferences. Protein conformations with 100+ residues are sampled on standard PC hardware in seconds. Application to proteins involved in ion-binding demonstrate its potential as a tool for loop ensemble generation and missing structure completion.

Original languageEnglish (US)
Article numberS9
JournalBMC Structural Biology
Volume13
Issue numberSUPPL.1
DOIs
StatePublished - 2013
Externally publishedYes

Keywords

  • Conformation sampling
  • Monte Carlo methods
  • ensemble generation
  • graphical models
  • protein loops

ASJC Scopus subject areas

  • Structural Biology

Fingerprint

Dive into the research topics of 'Unbiased, scalable sampling of protein loop conformations from probabilistic priors'. Together they form a unique fingerprint.

Cite this