The Trouble with Long-Range Base Pairs in RNA Folding

Fabian Amman, Stephan H. Bernhart, Gero Doose, Ivo L. Hofacker, Jing Qin, Peter F. Stadler, Sebastian Will

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Abstract

RNA prediction has long been struggling with long-range base pairs since prediction accuracy decreases with base pair span. We analyze here the empirical distribution of base pair spans in large collection of experimentally known RNA structures. Surprisingly, we find that long-range base pairs are overrepresented in these data. In particular, there is no evidence that long-range base pairs are systematically overpredicted relative to short-range interactions in thermodynamic predictions. This casts doubt on a recent suggestion that kinetic effects are the cause of length-dependent decrease of predictability. Instead of a modification of the energy model we advocate a modification of the expected accuracy model for RNA secondary structures. We demonstrate that the inclusion of a span-dependent penalty leads to improved maximum expected accuracy structure predictions compared to both the standard MEA model and a modified folding algorithm with an energy penalty function. The prevalence of long-range base pairs provide further evidence that RNA structures in general do not have the so-called polymer zeta property. This has consequences for the asymptotic performance for a large class of sparsified RNA folding algorithms.
Original languageEnglish
Title of host publicationAdvances in Bioinformatics and Computational Biology : 8th Brazilian Symposium on Bioinformatics, BSB 2013, Recife, Brazil, November 3-7, 2013, Proceedings
EditorsJoão C. Setubal, Nalvo F. Almeida
Number of pages11
PublisherSpringer
Publication date2013
Pages1-11
DOIs
Publication statusPublished - 2013
Externally publishedYes
Event8th Brazilian Symposium on Bioinformatics - Recife, Brazil
Duration: 3. Nov 20137. Nov 2013

Conference

Conference8th Brazilian Symposium on Bioinformatics
CountryBrazil
CityRecife
Period03/11/201307/11/2013
SeriesLecture Notes in Computer Science
Volume8213
ISSN0302-9743

Keywords

  • RNA folding
  • long-range base pair
  • polymer zeta property
  • prediction accuracy

Fingerprint Dive into the research topics of 'The Trouble with Long-Range Base Pairs in RNA Folding'. Together they form a unique fingerprint.

Cite this