TY - GEN
T1 - Old Dog, New Tricks
T2 - 29th International Conference on Research in Computational Molecular Biology, RECOMB 2025
AU - Boury, Théo
AU - Sidl, Leonhard
AU - Hofacker, Ivo L.
AU - Ponty, Yann
AU - Yao, Hua Ting
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.
PY - 2025/1/1
Y1 - 2025/1/1
N2 - The Inverse Folding problem involves identifying RNA sequences that adopt a target structure with respect to free-energy minimization, i.e., preferential to all alternative structures. The problem has historically been regarded as challenging, largely due to its proven NP-completeness of an extended version where the base pair maximization energy model is used. In contrast, it has recently been shown that a large subset called m-separable structures, notably including those comprising helices of length 3+, can be solved in linear time within the same energy model. This permits not only the identification of a single solution but also the characterization of a language of solutions. In this work, we seek to describe the “hardness” of Inverse Folding, bridging (at least heuristically) the gap between a simplified energy model and a more realistic Turner energy model. We used LinearBPDesign to generate seed sequences for RNAinverse, thereby improving the design process in a Turner energy model. To this end, we extended LinearBPDesign to accommodate biseparability and to handle non- or high modulo separable structures by minimalist addition of base pairs. Our study suggests that seeds generated by LinearBPDesign capture long-range interactions, thereby improving the performance of RNAinverse compared to seed focusing on refining the energy model itself. Most surprisingly, a significant number of LinearBPDesign seeds uniquely fold into the target structure in the Turner model, especially when helices are at least of length 2. This observation suggests that the “hardness” of design may arise from the intrinsic properties of the structures themselves.
AB - The Inverse Folding problem involves identifying RNA sequences that adopt a target structure with respect to free-energy minimization, i.e., preferential to all alternative structures. The problem has historically been regarded as challenging, largely due to its proven NP-completeness of an extended version where the base pair maximization energy model is used. In contrast, it has recently been shown that a large subset called m-separable structures, notably including those comprising helices of length 3+, can be solved in linear time within the same energy model. This permits not only the identification of a single solution but also the characterization of a language of solutions. In this work, we seek to describe the “hardness” of Inverse Folding, bridging (at least heuristically) the gap between a simplified energy model and a more realistic Turner energy model. We used LinearBPDesign to generate seed sequences for RNAinverse, thereby improving the design process in a Turner energy model. To this end, we extended LinearBPDesign to accommodate biseparability and to handle non- or high modulo separable structures by minimalist addition of base pairs. Our study suggests that seeds generated by LinearBPDesign capture long-range interactions, thereby improving the performance of RNAinverse compared to seed focusing on refining the energy model itself. Most surprisingly, a significant number of LinearBPDesign seeds uniquely fold into the target structure in the Turner model, especially when helices are at least of length 2. This observation suggests that the “hardness” of design may arise from the intrinsic properties of the structures themselves.
KW - Dynamic programming
KW - RNA design
KW - RNA secondary structure
KW - Sampling
UR - https://www.scopus.com/pages/publications/105004253810
U2 - 10.1007/978-3-031-90252-9_9
DO - 10.1007/978-3-031-90252-9_9
M3 - Conference contribution
AN - SCOPUS:105004253810
SN - 9783031902512
T3 - Lecture Notes in Computer Science
SP - 134
EP - 152
BT - Research in Computational Molecular Biology - 29th International Conference, RECOMB 2025, Proceedings
A2 - Sankararaman, Sriram
PB - Springer Science and Business Media Deutschland GmbH
Y2 - 26 April 2025 through 29 April 2025
ER -