Simultaneous alignment and folding of protein sequences

  • Jérǒme Waldispühl
  • , Charles W. O'Donnell
  • , Sebastian Will
  • , Srinivas Devadas
  • , Rolf Backofen
  • , Bonnie Berger

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Accurate comparative analysis tools for low-homology proteins remains a difficult challenge in computational biology, especially sequence alignment and consensus folding problems. We present partiFold-Align, the first algorithm for simultaneous alignment and consensus folding of unaligned protein sequences; the algorithm's complexity is polynomial in time and space. Algorithmically, partiFold-Align exploits sparsity in the set of super-secondary structure pairings and alignment candidates to achieve an effectively cubic running time for simultaneous pairwise alignment and folding. We demonstrate the efficacy of these techniques on transmembrane β-barrel proteins, an important yet difficult class of proteins with few known three-dimensional structures. Testing against structurally derived sequence alignments, partiFold-Align significantly outperforms state-of-the-art pairwise sequence alignment tools in themost difficult low sequence homology case and improves secondary structure prediction where current approaches fail. Importantly, partiFold-Align requires no prior training. These general techniques are widely applicable to many more protein families. partiFold-Align is available at http://partiFold.csail.mit.edu.

Original languageEnglish
Title of host publicationResearch in Computational Molecular Biology - 13th Annual International Conference, RECOMB 2009, Proceedings
Pages339-355
Number of pages17
DOIs
Publication statusPublished - 17 Jul 2009
Externally publishedYes
Event13th Annual International Conference on Research in Computational Molecular Biology, RECOMB 2009 - Tucson, AZ, United States
Duration: 18 May 200921 May 2009

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5541 LNBI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference13th Annual International Conference on Research in Computational Molecular Biology, RECOMB 2009
Country/TerritoryUnited States
CityTucson, AZ
Period18/05/0921/05/09

Fingerprint

Dive into the research topics of 'Simultaneous alignment and folding of protein sequences'. Together they form a unique fingerprint.

Cite this