Direct coevolutionary couplings reflect biophysical residue interactions in proteins

  • Alice Coucke
  • , Guido Uguzzoni
  • , Francesco Oteri
  • , Simona Cocco
  • , Remi Monasson
  • , Martin Weigt

Research output: Contribution to journalArticlepeer-review

Abstract

Coevolution of residues in contact imposes strong statistical constraints on the sequence variability between homologous proteins. Direct-Coupling Analysis (DCA), a global statistical inference method, successfully models this variability across homologous protein families to infer structural information about proteins. For each residue pair, DCA infers 21 × 21 matrices describing the coevolutionary coupling for each pair of amino acids (or gaps). To achieve the residue-residue contact prediction, these matrices are mapped onto simple scalar parameters; the full information they contain gets lost. Here, we perform a detailed spectral analysis of the coupling matrices resulting from 70 protein families, to show that they contain quantitative information about the physico-chemical properties of amino-acid interactions. Results for protein families are corroborated by the analysis of synthetic data from lattice-protein models, which emphasizes the critical effect of sampling quality and regularization on the biochemical features of the statistical coupling matrices.

Original languageEnglish
Article number174102
JournalJournal of Chemical Physics
Volume145
Issue number17
DOIs
Publication statusPublished - 7 Nov 2016

Fingerprint

Dive into the research topics of 'Direct coevolutionary couplings reflect biophysical residue interactions in proteins'. Together they form a unique fingerprint.

Cite this