Preferential formation of Z-RNA over intercalated motifs in long noncoding RNA

  1. Nicole M. Smith1
  1. 1School of Molecular Sciences, The University of Western Australia, Crawley, Western Australia 6009, Australia;
  2. 2Laboratoire d'Optique et Biosciences, École Polytechnique, CNRS, INSERM, Institut Polytechnique de Paris, 91120 Palaiseau, France
  • Corresponding author: nicole.smith{at}uwa.edu.au
  • Abstract

    Secondary structure is a principal determinant of lncRNA function, predominantly regarding scaffold formation and interfaces with target molecules. Noncanonical secondary structures that form in nucleic acids have known roles in regulating gene expression and include G-quadruplexes (G4s), intercalated motifs (iMs), and R-loops (RLs). In this paper, we used the computational tools G4-iM Grinder and QmRLFS-finder to predict the formation of each of these structures throughout the lncRNA transcriptome in comparison to protein-coding transcripts. The importance of the predicted structures in lncRNAs in biological contexts was assessed by combining our results with publicly available lncRNA tissue expression data followed by pathway analysis. The formation of predicted G4 (pG4) and iM (piM) structures in select lncRNA sequences was confirmed in vitro using biophysical experiments under near-physiological conditions. We find that the majority of the tested pG4s form highly stable G4 structures, and identify many previously unreported G4s in biologically important lncRNAs. In contrast, none of the piM sequences are able to form iM structures, consistent with the idea that RNA is unable to form stable iMs. Unexpectedly, these C-rich sequences instead form Z-RNA structures, which have not been previously observed in regions containing cytosine repeats and represent an interesting and underexplored target for protein–RNA interactions. Our results highlight the prevalence and potential structure-associated functions of noncanonical secondary structures in lncRNAs, and show G4 and Z-RNA structure formation in many lncRNA sequences for the first time, furthering the understanding of the structure–function relationship in lncRNAs.

    Footnotes

    • Received June 30, 2023.
    • Accepted January 31, 2024.

    This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

    | Table of Contents

    Preprint Server