TY - GEN
T1 - Constraint maximal inter-molecular helix lengths within RNA-RNA interaction prediction improves bacterial sRNA target prediction
AU - Gelhausen, Rick
AU - Will, Sebastian
AU - Hofacker, Ivo L.
AU - Backofen, Rolf
AU - Raden, Martin
N1 - Publisher Copyright:
© 2019 by SCITEPRESS - Science and Technology Publications, Lda. All rights reserved.
PY - 2019/1/1
Y1 - 2019/1/1
N2 - Efficient computational tools for the identification of putative target RNAs regulated by prokaryotic sRNAs rely on thermodynamic models of RNA secondary structures. While they typically predict RNA-RNA interaction complexes accurately, they yield many highly-ranked false positives in target screens. One obvious source of this low specificity appears to be the disability of current secondary-structure-based models to reflect steric constraints, which nevertheless govern the kinetic formation of RNA-RNA interactions. For example, often-even thermodynamically favorable-extensions of short initial kissing hairpin interactions are kinetically prohibited, since this would require unwinding of intra-molecular helices as well as sterically impossible bending of the interaction helix. In consequence, the efficient prediction methods, which do not consider such effects, predict over-long helices. To increase the prediction accuracy, we devise a dynamic programming algorithm that length-restricts the runs of consecutive inter-molecular base pairs (perfect canonical stackings), which we hypothesize to implicitely model the steric and kinetic effects. The novel method is implemented by extending the state-of-the-art tool INTARNA. Our comprehensive bacterial sRNA target prediction benchmark demonstrates significant improvements of the prediction accuracy and enables 3-4 times faster computations. These results indicate-supporting our hypothesis-that length-limitations on inter-molecular subhelices increase the accuracy of interaction prediction models compared to the current state-of-the-art approach.
AB - Efficient computational tools for the identification of putative target RNAs regulated by prokaryotic sRNAs rely on thermodynamic models of RNA secondary structures. While they typically predict RNA-RNA interaction complexes accurately, they yield many highly-ranked false positives in target screens. One obvious source of this low specificity appears to be the disability of current secondary-structure-based models to reflect steric constraints, which nevertheless govern the kinetic formation of RNA-RNA interactions. For example, often-even thermodynamically favorable-extensions of short initial kissing hairpin interactions are kinetically prohibited, since this would require unwinding of intra-molecular helices as well as sterically impossible bending of the interaction helix. In consequence, the efficient prediction methods, which do not consider such effects, predict over-long helices. To increase the prediction accuracy, we devise a dynamic programming algorithm that length-restricts the runs of consecutive inter-molecular base pairs (perfect canonical stackings), which we hypothesize to implicitely model the steric and kinetic effects. The novel method is implemented by extending the state-of-the-art tool INTARNA. Our comprehensive bacterial sRNA target prediction benchmark demonstrates significant improvements of the prediction accuracy and enables 3-4 times faster computations. These results indicate-supporting our hypothesis-that length-limitations on inter-molecular subhelices increase the accuracy of interaction prediction models compared to the current state-of-the-art approach.
KW - Canonical Helix
KW - Constrained Helix Length
KW - RNA-RNA Interaction Prediction
KW - Seed
KW - Steric Constraints
UR - https://www.scopus.com/pages/publications/85064642011
U2 - 10.5220/0007689701310140
DO - 10.5220/0007689701310140
M3 - Conference contribution
AN - SCOPUS:85064642011
T3 - BIOINFORMATICS 2019 - 10th International Conference on Bioinformatics Models, Methods and Algorithms, Proceedings; Part of 12th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2019
SP - 131
EP - 140
BT - BIOINFORMATICS 2019 - 10th International Conference on Bioinformatics Models, Methods and Algorithms, Proceedings; Part of 12th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2019
A2 - De Maria, Elisabetta
A2 - Fred, Ana
A2 - Gamboa, Hugo
PB - SciTePress
T2 - 10th International Conference on Bioinformatics Models, Methods and Algorithms, BIOINFORMATICS 2019 - Part of 12th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2019
Y2 - 22 February 2019 through 24 February 2019
ER -