RNA Inverse Folding Can Be Solved in Linear Time for Structures Without Isolated Stacks or Base Pairs

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Inverse folding is a classic instance of negative RNA design which consists in finding a sequence that uniquely folds into a target secondary structure with respect to energy minimization. A breakthrough result of Bonnet et al. shows that, even in simple base pairs-based (BP) models, the decision version of a mildly constrained version of inverse folding is NP-hard. In this work, we show that inverse folding can be solved in linear time for a large collection of targets, including every structure that contains no isolated BP and no isolated stack (or, equivalently, when all helices consist of 3+ base pairs). For structures featuring shorter helices, our linear algorithm is no longer guaranteed to produce a solution, but still does so for a large proportion of instances. Our approach introduces a notion of modulo m-separability, generalizing a property pioneered by Hales et al. Separability is a sufficient condition for the existence of a solution to the inverse folding problem. We show that, for any input secondary structure of length n, a modulo m-separated sequence can be produced in time O(n2m) anytime such a sequence exists. Meanwhile, we show that any structure consisting of 3+ base pairs is either trivially non-designable, or always admits a modulo-2 separated solution (m = 2). Solution sequences can thus be produced in linear time, and even be uniformly generated within the set of modulo-2 separable sequences.

Original languageEnglish
Title of host publication24th International Workshop on Algorithms in Bioinformatics, WABI 2024
EditorsSolon P. Pissis, Wing-Kin Sung
PublisherSchloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
ISBN (Electronic)9783959773409
DOIs
Publication statusPublished - 1 Aug 2024
Event24th International Workshop on Algorithms in Bioinformatics, WABI 2024 - London, United Kingdom
Duration: 2 Sept 20244 Sept 2024

Publication series

NameLeibniz International Proceedings in Informatics, LIPIcs
Volume312
ISSN (Print)1868-8969

Conference

Conference24th International Workshop on Algorithms in Bioinformatics, WABI 2024
Country/TerritoryUnited Kingdom
CityLondon
Period2/09/244/09/24

Keywords

  • Parameterized Complexity
  • RNA structure
  • String Design
  • Uniform Sampling

Fingerprint

Dive into the research topics of 'RNA Inverse Folding Can Be Solved in Linear Time for Structures Without Isolated Stacks or Base Pairs'. Together they form a unique fingerprint.

Cite this