Sparsification enables predicting kissing hairpin pseudoknot structures of long RNAs in practice

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

While computational RNA secondary structure prediction is an important tool in RNA research, it is still fundamentally limited to pseudoknot-free structures (or at best very simple pseudoknots) in practice. Here, we make the prediction of complex pseudoknots - including kissing hairpin structures - practically applicable by reducing the originally high space consumption. For this aim, we apply the technique of sparsification and other space-saving modifications to the recurrences of the pseudoknot prediction algorithm by Chen, Condon and Jabbari (CCJ algorithm). Thus, the theoretical space complexity of free energy minimization is reduced to -(n3 + Z), in the sequence length n and the number of non-optimally decomposable fragments ("candidates") Z. The sparsified CCJ algorithm, sparseCCJ, is presented in detail. Moreover, we provide and compare three generations of CCJ implementations, which continuously improve the space requirements: the original CCJ implementation, our first modified implementation, and our final sparsified implementation. The two latest implementations implement the established HotKnots DP09 energy model. In our experiments, using 244GB of RAM, the original CCJ implementation failed to handle sequences longer than 195 bases; sparseCCJ handles our pseudoknot data set (up to about length 400 bases) in this space limit. All three CCJ implementations are available at https://github.com/HosnaJabbari/CCJ.

Original languageEnglish
Title of host publication17th International Workshop on Algorithms in Bioinformatics, WABI 2017
EditorsKnut Reinert, Russell Schwartz
PublisherSchloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
ISBN (Electronic)9783959770507
DOIs
Publication statusPublished - 1 Aug 2017
Externally publishedYes
Event17th International Workshop on Algorithms in Bioinformatics, WABI 2017 - Boston, United States
Duration: 21 Aug 201723 Aug 2017

Publication series

NameLeibniz International Proceedings in Informatics, LIPIcs
Volume88
ISSN (Print)1868-8969

Conference

Conference17th International Workshop on Algorithms in Bioinformatics, WABI 2017
Country/TerritoryUnited States
CityBoston
Period21/08/1723/08/17

Keywords

  • Pseudoknots
  • RNA
  • Secondary structure prediction
  • Space efficiency
  • Sparsification

Fingerprint

Dive into the research topics of 'Sparsification enables predicting kissing hairpin pseudoknot structures of long RNAs in practice'. Together they form a unique fingerprint.

Cite this