TY - GEN
T1 - Automated Design of Dynamic Programming Schemes for RNA Folding with Pseudoknots
AU - Marchand, Bertrand
AU - Will, Sebastian
AU - Berkemer, Sarah J.
AU - Bulteau, Laurent
AU - Ponty, Yann
N1 - Publisher Copyright:
© Bertrand Marchand, Sebastian Will, Sarah J. Berkemer, Laurent Bulteau, and Yann Ponty.
PY - 2022/9/1
Y1 - 2022/9/1
N2 - Despite being a textbook application of dynamic programming (DP) and routine task in RNA structure analysis, RNA secondary structure prediction remains challenging whenever pseudoknots come into play. To circumvent the NP-hardness of energy minimization in realistic energy models, specialized algorithms have been proposed for restricted conformation classes that capture the most frequently observed configurations. While these methods rely on hand-crafted DP schemes, we generalize and fully automatize the design of DP pseudoknot prediction algorithms. We formalize the problem of designing DP algorithms for an (infinite) class of conformations, modeled by (a finite number of) fatgraphs, and automatically build DP schemes minimizing their algorithmic complexity. We propose an algorithm for the problem, based on the tree-decomposition of a well-chosen representative structure, which we simplify and reinterpret as a DP scheme. The algorithm is fixed-parameter tractable for the tree-width tw of the fatgraph, and its output represents a O(ntw+1) algorithm for predicting the MFE folding of an RNA of length n. Our general framework supports general energy models, partition function computations, recursive substructures and partial folding, and could pave the way for algebraic dynamic programming beyond the context-free case.
AB - Despite being a textbook application of dynamic programming (DP) and routine task in RNA structure analysis, RNA secondary structure prediction remains challenging whenever pseudoknots come into play. To circumvent the NP-hardness of energy minimization in realistic energy models, specialized algorithms have been proposed for restricted conformation classes that capture the most frequently observed configurations. While these methods rely on hand-crafted DP schemes, we generalize and fully automatize the design of DP pseudoknot prediction algorithms. We formalize the problem of designing DP algorithms for an (infinite) class of conformations, modeled by (a finite number of) fatgraphs, and automatically build DP schemes minimizing their algorithmic complexity. We propose an algorithm for the problem, based on the tree-decomposition of a well-chosen representative structure, which we simplify and reinterpret as a DP scheme. The algorithm is fixed-parameter tractable for the tree-width tw of the fatgraph, and its output represents a O(ntw+1) algorithm for predicting the MFE folding of an RNA of length n. Our general framework supports general energy models, partition function computations, recursive substructures and partial folding, and could pave the way for algebraic dynamic programming beyond the context-free case.
KW - RNA folding
KW - dynamic programming
KW - treewidth
UR - https://www.scopus.com/pages/publications/85137814095
U2 - 10.4230/LIPIcs.WABI.2022.7
DO - 10.4230/LIPIcs.WABI.2022.7
M3 - Conference contribution
AN - SCOPUS:85137814095
T3 - Leibniz International Proceedings in Informatics, LIPIcs
BT - 22nd International Workshop on Algorithms in Bioinformatics, WABI 2022
A2 - Boucher, Christina
A2 - Rahmann, Sven
PB - Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
T2 - 22nd International Workshop on Algorithms in Bioinformatics, WABI 2022
Y2 - 5 September 2022 through 7 September 2022
ER -