Abstract
Centromeres are no longer considered to be silent. Both centromeric and pericentric transcription have been discovered, and their RNA transcripts have been characterized and probed for functions in numerous monocentric model organisms recently. Here, we will discuss the challenges in centromere transcription studies due to the repetitive nature and sequence similarity in centromeric and pericentric regions. Various technological breakthroughs have helped to tackle these challenges and reveal unique features of the centromeres and pericentromeres. We will briefly introduce these techniques, including third-generation long-read DNA and RNA sequencing, protein-DNA and RNA–DNA interaction detection methods, and epigenomic and nucleosomal mapping techniques. Interestingly, some newly analyzed repeat-based holocentromeres also resemble the architecture and the transcription behavior of monocentromeres. We will summarize evidences that support the functions of the transcription process and stalling, and those that support the functions of the centromeric and pericentric RNAs. The processing of centromeric and pericentric RNAs into multiple variants and their diverse structures may also provide clues to their functions. How future studies may address the separation of functions of specific centromeric transcription steps, processing pathways, and the transcripts themselves will also be discussed.
Similar content being viewed by others
Introduction
Centromere function
Historically, the centromere is often recognized as the constricted region that “holds” the X-shaped sister chromatids during cell divisions (Flemming 1880). Functionally, the centromere is the chromatin region where the kinetochore complex builds on, to connect chromosomes to microtubules emanated from the centrosomes. The centromere ensures that the sister chromatids and homologous chromosomes are separated equally to the daughter cells during mitosis and meiosis, respectively. Errors in chromosome segregation can cause chromosome breakages, or gains or losses of genetic materials (Potapova & Gorbsky 2017). The consequence is usually catastrophic for the cell and the whole organism, such as infertility, chromosomal abnormality disorders, or aberrant proliferation in cancers (Santaguida & Amon 2015; Smurova & De Wulf 2018).
Centromere paradox and architectures
The conserved function of the centromere is contradicted with the diverse DNA sequences, sizes, and even architectures of the centromere across eukaryotes, and this phenomenon is called “centromere paradox” (Henikoff et al. 2001). There are two major architecture of centromeres: monocentromeres and holocentromeres (Wong et al. 2020). Monocentromeres, in which a single region of the chromosome is designated for the centromere function, are the most common architecture of centromeres in studied eukaryotes. Monocentromeres can be further subdivided into regional and point monocentromeres. The sizes of regional monocentromeres range from kilobases to megabases, and are mostly made up of AT-rich DNA (Baker & Rogers 2005; Barbosa et al. 2022), tandem repeats, called satellites, or transposable elements, as observed in many model organisms, including fission yeast, Arabidopsis, rice, flies, frogs, chicken, mice, tammar wallaby, and human cells (Hartley & O’Neill 2019; Shannon M McNulty & Sullivan 2018). The core centromere region of regional monocentromeres is flanked by pericentric, heterochromatic regions which are important for the cohesion of sister chromatids or homologous chromosomes during mitosis and meiotic reductional division, respectively. Meta-polycentromeres are monocentromeres that have distinct regions of centromeres believed to function together, forming an elongated primary constriction, as observed in Pisum and Lathyrus species (Schubert et al. 2020). Point monocentromeres refer to short centromeres in budding yeast, such as ~ 125 bp in Saccharomyces cerevisiae, which consists of 3 conserved elements (CDEI, II, and III), in which CDEII is AT-rich while CDEI and III are palindromic but are not repetitive (Clarke & Carbon 1980). Holocentromeres, in which the centromere is diffused along the length of the mitotic chromosomes, are observed in some plants, insects, and nematodes. Holocentromeres can be formed on the centromere-specific satellite family, called Tyba, and the centromeric retrotransposons (CRRh), as observed in Rhynchospora pubera (Marques et al. 2015), or on non-repetitive sequences, as observed in Caenorhabditis elegans (Gassmann et al. 2012; Talbert & Henikoff 2020). Notably, holocentromeres have evolved multiple times from their monocentromere ancestors independently in both animal and plant lineages (Escudero et al. 2016; Melters et al. 2012).
Conserved epigenetic regulation of centromeres
Early mutational studies of centromeric DNA (Carbon & Clarke 1984; Cumberledge & Carbon 1987), centromere inactivation (Ishii et al. 2008; Thakur & Sanyal 2013), and the discoveries of neocentromeres, which are new centromeres formed on non-centromeric DNA sites (Scott & Sullivan 2014; Williams et al. 1998), have shown that centromeric DNA is not necessary or sufficient for centromere function, leading to the suggestion that most centromeres are epigenetically regulated (Allshire & Karpen 2008). One exception described is that of the point monocentromeres in Saccharomyces cerevisiae, as mutations in CDEIII cause centromere and kinetochore malfunction (Carbon & Clarke 1984; Ng et al. 1986). Many species, including Saccharomyces cerevisiae, contain a centromeric-specific histone H3 variant, CENP-A, that replaces canonical H3 at the core centromere, where kinetochore will assemble (Ali-Ahmad et al. 2020; Kixmoeller et al. 2020). Although there are exceptions, such as in silk moths Bombyx mori (Senaratne et al. 2021) and kinetoplastids (Ishii & Akiyoshi 2022), where there are no CENP-A, but they have histone-like CENP-T and unconventional KKT proteins, respectively. Understanding of the epigenetic regulation of centromeres begin with histone variant CENP-A, but later on also include unique centromeric histone modification combinations that are distinct from euchromatin and heterochromatin, DNA methylation pattern, centromeric and pericentric transcription, and centromeric and pericentric RNAs (cenRNAs and pericenRNAs) (Arunkumar and Melters 2020; Bergmann et al. 2012; Chan et al. 2012; Corless et al. 2020; Perea-Resa & Blower 2018; Saffery et al. 2003; Wong et al. 2007).
Centromere is not an inert but an actively transcribed region, despite at a low level
The chromosome constriction site at the monocentromere was originally taken as a transcriptionally inert site. High level of transcription is incompatible with centromere function in budding yeast (Hill et al. 1987) and human artificial chromosome studies (Molina et al. 2016). However, in the last decade, more evidence from numerous studies in monocentromeres, even in the simplest point centromeres in Saccharomyces cerevisiae (Hedouin et al. 2022; Ling & Yuen 2019; Ohkuni & Kitagawa 2012), have shown that both the core centromere region and flanking pericentric regions are transcribed by RNA polymerase II (RNA Pol II) at a relatively low level and in a cell cycle-dependent manner (reviewed in Duda et al. 2017; Perea-Resa & Blower 2018; Smurova & De Wulf 2018). The functions of cenRNAs at the centromere, the inner centromere (the region between the sister chromatids), and the kinetochore have been reviewed thoroughly and recently (Corless et al. 2020). The relationship between cenRNA expression and CENP-A loading time in the cell cycle in different organisms has also been elaborately discussed (Perea-Resa et al. 2018).
Here, we will focus on the discussion of ongoing challenges and recent breakthroughs relevant to studying centromeric transcription, including the repetitive nature of centromeric DNA sequences, the difficulty to distinguish centromeric and pericentric sequences and their corresponding transcription in some species, and the difficulty in separating the function of the centromeric transcription process versus the centromeric RNA transcripts.
Challenges
DNA and thus RNA repetitiveness in many regional monocentromeres
The repetitive problem has been discussed in details (Corless et al. 2020). Mapping of RNAs to unique locations will rely on a complete, reliable genomic DNA reference. Recent studies, based on third-generation long-read DNA sequencing, such as Nanopore and PacBio Single-Molecule Real-Time (SMRT) sequencing, has assembled and polished individual human chromosomes Y and X from telomere-to-telomere using a bacterial artificial chromosome (BAC) vector or a complete hydatidiform mole (CHM)-derived haploid cell line, respectively (Jain et al. 2018; Miga et al. 2020; Nurk et al. 2022). Based on these frameworks, Miga’s group has also revealed the organization and evolutionary patterns of centromeric satellite arrays (Altemose et al. 2022a). Centromeric and pericentric satellites in human megabase-sized centromeres constitute 6% of the genome (Altemose et al. 2022a). Besides the major component, alpha-satellites, which comprised 171-bp monomers, other satellites (e.g., HSat2, HSat3, HSat1, beta-satellites) were also ordered. Different monomer subtypes (e.g., a, b, and c) are linked and repeated to form a higher order repeat (HOR) unit (e.g., abc). Identical HOR units then make up a large, homogenous HOR array (e.g., abc-abc-abc…) with thousands of HOR units. Centromeres from different chromosomes (e.g., Chr1, 5, 19) that are confounded previously can now be resolved based on distinct HOR arrays and chromosome-specific sequence variants, and verified by flow cytometry-sorted chromosomes. This pioneered work, together with direct RNA long-read sequencing (Jiang et al. 2019), opens up new avenues that will allow more precise mapping of long non-coding cenRNAs to the centromere.
In a detailed biochemical study by combining CUT&RUN (cleavage under targets and release using nuclease) of CENP-A/B/C and salt fractionation, Henikoff’s group has observed drastic CENP-A/B/C configuration differences on alpha-satellite dimers belonging to the same alpha-satellite subfamily that contain only 4–12% differences in sequence (Thakur et al. 2018). The configurations include a symmetric complex with equal CENP-A/B/C binding on both monomers of the dimer, or an asymmetric complex preferentially occupying only one monomer of the dimer (Thakur et al. 2018). This result suggests that even slight alpha-satellite sequence differences affect the binding behavior of the associated centromeric complex (Thakur et al. 2018). Their work may help us to correlate the CENP-A occupancy or chromatin states with the cenRNAs based on their sequences.
Alternatively, works from centromeres with unique sequences have circumvented the repetitive problem and provided evidence for centromere transcription. For example, some chicken and potato chromosomes contain repetitive centromere sequences while some do not (Gong et al. 2012; Shang et al. 2010). Interestingly, in chicken meiosis II prophase, centromeric transcription is only observed on the non-repetitive centromeres (Krasikova et al. 2012). In pathogenic yeast Candida albicans, each centromere has a unique central core (Sanyal et al. 2004). The non-heterochromatic pericentric regions of C. albicans contain either long terminal repeats (LTR), inverted repeats, or non-repetitive sequences (Freire-Benéitez et al. 2016). Such chromosome-specific centromere sequences and organization may facilitate the identification of cenRNA in the future. In S. cerevisiae, centromeric transcripts corresponding to each of the 16 chromosomes have been detected, but they are more highly expressed in S phase (Hedouin et al. 2022; Ling & Yuen 2019; Ohkuni & Kitagawa 2012). A targeted RNA isoform long-read sequencing (Iso-seq) by PacBio SMRT with probe-based enrichment has enabled the identification of many centromeric (and pericentric) RNA variants with strand-specificity and co- or post-transcriptional processing information, such as poly-adenylated tails (Hedouin et al. 2022). While the level of centromeric transcripts from each chromosome varies (Hedouin et al. 2022; Ling & Yuen 2019), conversion of all chromosomes’ centromeres to the same centromeric DNA sequence has suggested that the level of centromeric transcripts roughly correlates to the copy number of that specific centromeric DNA sequence in the genome, and they may function in trans, potentially in proximity to the centromere cluster (Jin et al. 1998; Ling & Yuen 2019).
Distinguishing centromeric versus pericentric sequences
In humans, alpha-satellite sequences make up both the core centromeric and most of the flanking pericentric regions. It has long been difficult to differentiate between transcripts derived from these two regions due to their similarity. Analyses of long-read DNA sequencing and existing native (uncrosslinked) chromatin immunoprecipitation followed by DNA sequencing (N-ChIP-seq) and CUT&RUN datasets show that only a subset of human alpha-satellite HOR units bind CENP-A and assemble kinetochore proteins, and are known as the “active” array (340 kb to 4.8 Mb) (Altemose et al. 2022a). Flanking pericentric “inactive” arrays comprised more diverged alpha-satellites, other repeat families, and transposable elements. There is evidence of layered expansion, where new, distinct alpha-satellite repeat emerges and expands within the array to become the site of kinetochore assembly (the active array), whereas older repeats are displaced sideways symmetrically to become the inactive pericentric “layers.” However, there are also individual variations in terms of CENP-A localization on the newer or older repeats, suggesting epigenetic plasticity.
Identification of large structural rearrangements, transposable elements, or gene interspersion within the pericentric regions also help to distinguish among chromosomes (Altemose et al. 2022a), and may assist the annotation of transcribed cenRNAs or pericenRNAs. In the future, techniques that allow analyses of protein-DNA interactions with long-read information, such as DiMeLo-seq, will be useful to decipher the relationship between kinetochore function and underlying DNA sequences, individual variations, and evolutionary trends (Altemose et al. 2022b), and compare to the long RNA reads.
Besides the newly identified alpha-satellite HOR differences between human centromeric and pericentric regions, researchers have elucidated that the chromatin environment in these two regions vary significantly mostly by microscopy approaches. For instance, the chromatin at the core centromere typically contains histone modification marks associated with open chromatin or permissive transcription (Bergmann et al. 2012; Bergmann et al. 2011; Gopalakrishnan et al. 2009; Sullivan et al. 2004). CENP-A nucleosomes within the core centromere are interspersed with canonical histone H3 nucleosomes where H3 tails are modified with H3K4me1, H3K4me2, H3K36me2, and H3K36me3 (Table 1). These modifications are essential for CENP-A chaperone, HJURP, targeting, and CENP-A assembly (Bergmann et al. 2011; Duda et al. 2017). H3K4me2 is a modification associated with open but not active euchromatin (Sullivan et al. 2004; Soares et al. 2017). H3K36me2 is enriched downstream of transcription start sites (Weinberg et al. 2019) and can recruit histone deacetylases (HDAC) enzymes (Li et al. 2009). The core centromere also has reduced CpG methylation, called centromere dip region (CDR), consistent with its open chromatin (Altemose et al. 2022a).
On human artificial chromosomes (HACs), core centromere transcription can promote H3K9ac accumulation, which in turn can prevent heterochromatin formation at this region (Molina et al. 2016). Yet, on the same site, H3K9me3, typically thought to be associated with transcriptional repression, has also been shown to be compatible with the centromeric nucleosomes (Bergmann et al. 2012; Ribeiro et al. 2010). However, in general, the core centromere lacks clear marks for heterochromatin, such as H3K9me2 (Sullivan et al. 2004). A balance between euchromatin and heterochromatin characteristics at the core centromeric region is required for maintaining a low level of centromere transcription, maintaining the centromere identity and forming an active kinetochore (Sullivan et al. 2004). On the other hand, the surrounding pericentric regions are marked by histone modifications typically associated with transcriptional silencing, such as H3K9me2, H3K9me3, H3K27me2, and H3K27me3 (Table 1) (Gopalakrishnan et al. 2009; Kundaje et al. 2015).
At the core centromere, histone H4 tails are also modified. In chicken cells, H4K5ac and H4K12ac, which correlate with transcribed chromatin, are found to be enriched at the core centromere CENP-A pre-nucleosomes and are essential for CENP-A deposition (Shang et al. 2016). H4K20me, which is associated with transcriptional activation, is found in chicken and human CENP-A nucleosomes and is a prerequisite for kinetochore assembly (Bergmann et al. 2011; Hori et al. 2014; Sullivan et al. 2004).
For organisms in which the centromeric and pericentric DNA sequences can be differentiated, the transcripts levels and functions can be separately analyzed. For example, in S. pombe, the core centromere CENP-A/Cnp1-containing region consists of the non-repetitive central core (cnt) and flanking innermost repeats (imr). The core centromere is separated from the pericentric outer repeats (otr) by tRNA arrays (Dawe 2003). There is evidence of cnt, imr, and otr expressions by RNA Pol II at different lengths, levels, and functions. The tRNA genes at the barrier are also expressed by RNA Pol III. The pericentric RNAs are processed into small interfering RNAs (siRNAs) importantly for heterochromatin establishment, which is in turn required for de novo CENP-A establishment on introduced centromere-containing plasmid (Folco et al. 2008). Interestingly, tethering of histone H3 lysine 9 methyltransferase Clr4 to generate synthetic heterochromatin can bypass the need of RNA interference (RNAi) and the presence of outer repeats for centromere establishment (Chan & Wong 2012; Kagansky et al. 2009).
S. cerevisiae does not have heterochromatic pericentric regions, but the pericentric regions are characterized by H2A.Z-containing nucleosomes, in which this H2A variant is known to regulate sister chromatid cohesion and gene expression (Giaimo et al. 2019; Sharma et al. 2013). Indeed, centromeric transcription is mainly readthrough of pericentromeric regions with highly heterogeneous transcriptional start sites (TSS) but rarely span the entire centromere (Hedouin et al. 2022), consistent with RNA Pol II stalling in CDEI and CDEIII (Candelli et al. 2018).
In Drosophila, previous ChIP-seq revealed that CENP-A associates with simple satellites (Talbert et al. 2018). Yet, recent chromatin fibers, long-read DNA sequencing, and CENP-A/CID ChIP-seq that mapped to a heterochromatin-enriched genome assembly have revealed that CENP-A chromatin is formed on retroelement G2/Jockey-3, which is the only element shared among all chromosomes’ centromeres (Chang et al. 2019). This core centromere is then flanked by large arrays of satellite repeats (Chang et al. 2019). Low levels of G2/Jockey-3 transcription has been detected from some chromosomes’ centromeres but not all (Chang et al. 2019). Satellite RNA has also been shown to bind CENP-C and help to localize CENP-C to the centromere (Rošić et al. 2014). The refined centromere and pericentromere organization will facilitate the separation of transcription in these two regions.
Analyses of an evolutionary neocentromere (ENC), which is formed on non-centromeric DNA sequences and has been propagated across multiple generations in humans, show that the new CENP-A-defined core neocentromere becomes enriched in active epigenetic marks, RNA Pol II, and negatively supercoiled DNA, consistent with active transcription. In contrast, there is a spreading of repressive epigenetic marks, e.g., H3K27me3, to the surrounding regions (Naughton et al. 2022). The authors suggested that transcription disrupts chromatin to provide a foundation for kinetochore formation, while compact pericentric heterochromatin generates mechanical rigidity (Naughton et al. 2022).
Besides centromere and pericentric sequence and epigenetic differences, a recent study has investigated the nucleosomal patterns in different satellite sequences by a newly developed method called single-molecule adenine methylated oligonucleosome sequencing assay (SAMOSA), which combines non-specific adenine (m6dA) methyltransferase footprinting and single-molecule, real-time PacBio DNA sequencing to natively and non-destructively measure nucleosome positions, regularity, and nucleosome repeat lengths (NRLs) on individual chromatin fibers (500 b–2 kb) (Abdulhay et al. 2020). Interestingly, in human K562 cells, H3K9me3-decorated alpha- and beta-satellite sequences are enriched for both the expected regular and the unexpected irregular fibers (Abdulhay et al. 2020). The results show that heterochromatic nucleosome conformations can be both irregular and heterogenous. On the other hand, H3K9me3-free gamma-satellite is only enriched for chromatin fibers with regular long NRLs (Abdulhay et al. 2020). NRLs may specify the ability of heterochromatic nucleosomal arrays to phase separate (Gibson et al. 2019). With future optimization of digestion conditions, SAMOSA could be applicable to longer arrays, enabling kilobase-domain-scale study of single-molecule oligonucleosome patterning. In addition, multiple biochemical or epigenetic signals may be detected on the same single molecules, providing a “multi-omic” third-generation sequencing platform (Abdulhay et al. 2020).
Does centromere transcription occur in holocentromeres?
A missing piece in the centromere transcription field is whether it also happens in meta-polycentromeres and holocentromeres. For repeat-based holocentromeres, recent PacBio HiFi long-read sequencing, ChIP-seq, and high‐throughput chromosome conformation capture (Hi-C) on three holocentric sedge Rhynchospora species and their closest monocentric relative, the rush Juncus effusus, have enabled us to compare their centromere and epigenome organization and 3D genome architecture (Hofstatter et al. 2022). CENP-A/centromeric histone H3 (CenH3) has the highest enrichment in Tyba repeats and a lower enrichment in CRRh throughout the entire R. pubera and Rhynchospora breviuscula genomes (Hofstatter et al. 2022). The density of the Tyba arrays decreases with chromosome size, and there is a high frequency of dyad symmetries in the Tyba consensus sequences (Hofstatter et al. 2022). In R. pubera, the CENP-A-binding Tyba satellites are transcriptionally active in all tissues, and these Tyba satellites and CRRh are inserted into transcriptionally active gene-containing chromatin regions as genome-wide interspersed arrays (Marques et al. 2015). Holocentric sedges have more intrachromosomal interactions than its monocentric relative due to the lack of centromere clustering (Hofstatter et al. 2022). These newly characterized repeat-based holocentromeres seem to have similarities with many regional centromeres. There is a slight enrichment of H3K9me2 flanking CENP-A domains relative to the core CENP-A region, mimicking the pericentromeric chromatin composition in monocentromeres (Hofstatter et al. 2022). Further studies will elucidate whether the Tyba and CRRh transcripts behave like cenRNA and pericenRNA, respectively.
However, in non-repeat-based holocentromeres, there is so far no concrete evidence of holocentromeric transcription. In nematode C. elegans (Gassmann et al. 2012) and lepidopteran Bombyx mori (Senaratne et al. 2021), CENP-A/HCP-3 or CENP-T localization has an anti-correlation with active transcribed regions. In moth tissue culture cells, hormone induction of transcription results in CENP-T loss in the regions, suggesting that transcription can exclude holocentromere locations (Senaratne et al. 2021). CENP-A in C. elegans embryos is also anti-correlated with not only actively transcribed genes in embryos, but also some germline-transcribed regions (Gassmann et al. 2012). Histone H3K36 methyltransferase met-1 mutant causes ectopic H3K36me3 localization and exclusion of CENP-A, but not necessary an increase in active RNA Pol II localization at those sites (Gassmann et al. 2012). Argonaute CSR-1 is involved in a small RNA pathway that regulates germline gene expression (Claycomb et al. 2009). The germline target gene loci of CSR-1 anti-correlates with CENP-A localization (Gassmann et al. 2012). Disruption of CSR-1 pathway increases the level of CENP-A on chromatin (Wong & Yuen 2022). These results suggest that active or past gene transcription, histone modification, and 22G-sRNA may restrict CENP-A localization, which is consistent with the incompatibility of high gene expression with CENP-A in C. albicans and S. cerevisiae (Hill et al. 1987; Ketel et al. 2009).
On a C. elegans artificial chromosome (AC) with almost no genes, the CENP-A domain intervals and average domain size reduce, suggesting that more non-expressed regions are permissible holocentromere regions for CENP-A to locate (Lin et al. 2021). By imaging of artificial chromosomes, we found that inhibition of RNA Pol II-mediated transcription causes delayed de novo centromere establishment on newly formed ACs in holocentric C. elegans early embryos (Zhu et al. 2018a). However, we did not explore whether cenRNAs exist, or distinguish between the role of the act of transcription versus the cenRNA products during centromere establishment. So far, existing transcriptome analyses have detected many lowly abundant, long non-coding transcripts (Akay et al. 2019; Nam & Bartel 2012), but not directly related to holocentromeres. It will be interesting to learn whether holocentric transcription and cenRNAs exist, and their relationships with the centromere function, respectively.
Differentiating the function of the act of transcription versus transcripts
RNA Pol II is responsible for centromere transcription in many species analyzed, as shown by the effects of using specific drugs that differentially inhibit RNA polymerase I, II, or III’s activity (Chan et al. 2012). Recently, immunofluorescence experiments have shown that RNA Pol II Serine 2 phosphorylation of the C-terminal domain (CTD), which correlates with the active transcription elongation, is present at fly, frog, and human centromeres during mitosis (Blower 2016; Chan et al. 2012; Molina et al. 2016; Rošić et al. 2014). The passage of RNA Pol II in centromere transcription will bring two effects: it can trigger a chromatin remodeling event and produce a centromeric RNA. So, although the loading of CENP-A at centromeres has been associated extensively with the transcription process in different organisms (Choi et al. 2011; Grenfell et al. 2016; Quénet & Dalal 2014), the underlying molecular mechanism is only beginning to be unveiled, which may assist us to separate the two effects of transcription.
Function of centromeric transcription
Several previous studies have identified RNA Pol II-associated proteins and transcription-associated chromatin remodeling factors at the centromere. In eukaryotes, Facilitates Chromatin Transcription (FACT), a two-subunit complex containing SSRP1 and SUPT16H, is one of the major regulators of RNA Pol II transcriptional activity. FACT can be retained on DNA while RNA Pol II traverses pass FACT (Belotserkovskaya et al. 2003). FACT can promote RNA Pol II elongation by destabilizing or later stimulating formation of nucleosomes (Formosa 2012). Interestingly, both subunits of FACT interact with the CENP-A in different organisms (Barth et al. 2014; Foltz et al. 2006) and collaborate in transcription-coupled loading of CENP-A at the core centromere domain (Fig. 1A), by interacting with Chromosome Alignment Defect 1 (CAL1), the CENP-A/CID chaperone, in Drosophila (Chen et al. 2015). At ectopic centromeres in Drosophila, CAL1 also recruits FACT and RNA Pol II, which are essential for de novo CENP-A deposition (Chen et al. 2015). Immunostaining of FACT’s two subunits was the strongest at the kinetochore on mitotic chromosomes, at the same time that CENP-A normally loads in Drosophila tissue culture cells (Mellone et al. 2011). A more recent study using Drosophila tissue culture cells further proposed a two-step model for the recruitment and chromatin loading of CENP-A: at the beginning, CAL1-interacting CENP-A is recruited to the centromere domain in a manner that is independent of transcription and CENP-A is kept loosely associated with centromeric chromatin, then RNA Pol II passage facilitates the exchange of H3 nucleosomes to stably incorporate CENP-A into nucleosomes at the centromere (Bobkov et al. 2018). However, another recent study in Drosophila early embryogenesis has shown that RNA Pol II-mediated transcription is not necessary for the recruitment of CENP-A loading on the centromere (Ghosh & Lehner 2022). These contrasting results imply heterogeneity in the requirement of centromere transcription, which may depend on developmental stages or cell types.
A synthetic biology approach tethers different histone modifiers to the alphoidtetO centromere on a human artificial chromosome (HAC) to manipulate histone modifications, and thus transcription, at the centromere (Molina et al. 2016). H3K9ac-associated but not H4K12ac-associated transcription rescues the loss of H3K4me2 and centromeric transcription (Molina et al. 2016). Thus, the authors proposed that both mitotic centromeric transcription and H3K9ac can generate an appropriate epigenetic landscape to destabilize H3 nucleosomes and promote CENP-A loading (Molina et al. 2016). In chicken B cell line DT-40, FACT localizes to the centromere in a CENP-H-dependent manner and associates with chromatin remodeler CHD1 to facilitate CENP-A loading (Okada et al. 2009), raising the possibility that CENP-A chaperone HJURP may act through a FACT-dependent mechanism similarly to CAL1 in Drosophila. On the other hand, in budding yeast S. cerevisiae, FACT functions in correcting ectopic CENP-A loading by binding with a E3-ubiquitin ligase and triggering the proteasome-mediated degradation of ectopic CENP-A (Deyter et al. 2014).
Besides transcription elongation, transcriptional stalling within the core centromere region has been suggested to promote de novo establishment of CENP-A chromatin in fission yeast (Fig. 1B) (Catania et al. 2015). The central core domain (cnt) of S. pombe contains numerous promoters and transcriptional start sites (Catania et al. 2015). The abilities of different parts of the cnt to assemble CENP-A chromatin were not equivalent. Surprisingly, the parts with lower transcriptional activity (high RNA Pol II level but low transcript levels) were more effective in recruiting CENP-A than those with higher activity. Mutants that are defective in restarting stalled RNA Pol II indeed increase CENP-A loading, suggesting that persistent RNA Pol II stalling creates a favorable chromatin environment for CENP-A loading, perhaps through dislodging H3 nucleosomes without eviction of CENP-A or providing a longer time for CENP-A recruitment (Catania et al. 2015). An in vitro experiment has also shown that CENP-A nucleosomes present a greater barrier to transcription than H3 nucleosomes (Shandilya et al. 2014), and CENP-A-induced stalling may potentially promote their own recruitment in a positive-feedback loop.
To further decipher which step during transcription may be important to facilitate CENP-A and kinetochore assembly, Heald’s group has used different drug treatments to inhibit specific stages of transcription in Xenopus. When transcription initiation is inhibited by triptolide or when RNA splicing process is inhibited by ISGN, a reduced level of CENP-A and kinetochore protein NDC-80 is observed (Grenfell et al. 2016). Yet when transcription elongation is inhibited by α-amanitin, no CENP-A incorporation effect is observed (Grenfell et al. 2016). These results may indicate that some transcription initiation or splicing factors are specifically involved in CENP-A recruitment.
From all these studies, the act of centromeric transcription and stalling, accompanied by the transcription-associated chromatin remodeling, play important roles in the establishment and maintenance of centromeric chromatin, particularly the stable incorporation of CENP-A into chromatin. However, to definitely conclude that the function solely comes from the transcription process and not from the cenRNAs will need more evidence to distinguish the two effects.
Structures and functions of cenRNA
In general, non-coding RNA can function as a guide to recruit proteins to DNA or chromatin, a scaffold to assemble ribonucleoprotein complexes, a decoy which bind and sequester proteins thereby inhibiting their normal functions, or an inter-cellular signaling molecule (Corless et al. 2020; Hezroni et al. 2015). There appears to be relatively more studies on manipulating the levels of cenRNAs than studies on the centromere transcription process. Centromere-derived RNAs have been proposed to play a critical role in proper centromere formation and function. Specifically, overexpression or knockdown of cenRNAs reduces CENP-A and CENP-C levels at the centromere (Ling & Yuen 2019; Liu et al. 2015; Rošić et al. 2014). Yet, the lack of cenRNA sequence conservation prompts us to speculate if cenRNAs have any conserved secondary and tertiary structures, which may enable them to recruit centromeric protein to the centromere.
Centromeric transcription often occurs in both sense and anti-sense strands (Ferri et al. 2009; Ling & Yuen 2019; Topp et al. 2004). Both long (> 200 nt) and short non-coding cenRNA and pericenRNA have been catalogued in previous reviews (Arunkumar and Melters 2020; Chan et al. 2012; Chan & Wong 2012). cenRNA structure has been proposed to be double-stranded (ds) RNA, single-stranded (ss) RNA, or a DNA-RNA hybrid with the complementary DNA sequence at the centromere (R-loops). Here, we describe evidence supporting different types of cenRNA structures (Table 2).
Different structures of cenRNAs
Double-stranded (ds) cenRNA
Mouse minor satellites from the core centromere are transcribed and the transcripts are sensitive to RNase III, which cleaves dsRNA (Bouzinba-Segard et al. 2006). Tammar Wallaby’s Kangaroo Endogenous Retrovirus (KERV-1) exists as both dsRNA and ssRNA, which are important for CENP-B localization and pericentric heterochromatin maintenance (Carone et al. 2009). In Schizosaccharomyces pombe, ribonuclease Dicer, which cleaves pericentric otr dsRNA to 21–25-nt small interfering RNA (siRNA), and the RNA-induced initiation of transcriptional gene silencing (RITS) complex are important for CENP-A/Cnp1 establishment (Folco et al. 2008).
Single-stranded (ss) cenRNA
In maize, centromere retrotransponsons (CRM) and satellite repeats (CentC) are actively transcribed in both strands, and cenRNAs (40–250 nt) interact with CENP-A (Topp et al. 2004). However, there is no detection of centromere-derived small interfering RNAs (siRNAs) processed by the RNAi pathway. These cenRNAs are also sensitive to RNase A, suggesting that they maintain single-stranded organization (Topp et al. 2004). Single-stranded RNAs are highly unstable compared to double-stranded RNA (Zhang et al. 2021). However, RNA–protein interaction may stabilize the ss cenRNA. Further in vitro study has demonstrated that single-stranded maize cenRNA (44 nt) is essential for CENP-C exon duplication region to maintain DNA binding, and such cenRNA-CENP-C interaction is required for proper CENP-C centromere localization (Topp et al. 2004). In human HeLa cell line, alpha-satellite cenRNA, CENP-C1, and INCENP localize to the nucleolus in interphase in an RNA polymerase I-dependent manner, and this localization is also sensitive to RNase A (Wong et al. 2007). This suggests that human cenRNA also interacts as single-stranded structure with kinetochore or inner centromere protein.
In the case of transcription in the point centromere of Saccharomyces cerevisiae, after the introduction of RNAi machinery (but without adding the hairpin RNA against cenRNA), the level of cenRNAs is not affected, suggesting that endogenous cenRNAs probably do not exist as dsRNA (Ling & Yuen 2019). However, there is no direct evidence of ssRNA in budding yeast so far.
R-loop
Since cenRNAs are highly complementary with the template centromeric DNA, cenRNAs can potentially bind and form a R-loop structure consisting of a three-stranded DNA-RNA hybrid, with one strand of RNA bound to a single DNA strand, and a displaced single-stranded DNA. According to different studies, R-loops have been reported to be associated with gene transcription (Fang et al. 2019; Ling & Yuen 2019), DNA replication initiation (Yu et al. 2003), DNA damage response (Hamperl et al. 2017), DNA repair (Lu et al. 2018), and genome instability (Chedin & Benham 2020). The formation of DNA-RNA hybrids is also an important way to target the RNA to the local chromatin in a sequence-specific manner (Maldonado et al. 2019).
Generally, R-loops are formed with G-rich clusters during transcription (Allison & Wang 2019). However, the centromeric regions are often enriched with AT (Altemose et al. 2022a; Baker & Rogers 2005), which is not favorable for elongation and the formation of the R-loops. Additionally, negative supercoiling facilitates R-loop formation (Stolz et al. 2019). In S. cerevisiae, CENP-A/CSE4 induces positive supercoiling at the centromere (Furuyama & Henikoff 2009), which is unfavorable for R-loop formation. Yet, during transcription, while RNA polymerase acts on the DNA template, there exists a region upstream of the RNA polymerase with hyper-negative supercoiling where R-loop formation can potentially occur (Dorman 2019).
Centromeric R-loops have been identified in different species, including fission yeast, maize, rice, Arabidopsis, and human (Fang et al. 2019; Kabeche et al. 2018; Xu et al. 2017). In particular, a genome-wide R-loop map of maize leaf has been generated by single-strand DNA ligation-based library construction from DNA-RNA hybrid immunoprecipitation by S9.6 antibody, followed by sequencing (ssDRIP-seq). R-loops are enriched in centromeric regions, especially in the binding regions of CENP-A/CenH3, and pericentric regions (Yang Liu et al. 2021).
In S. pombe pericentromeric outer repeats (otr), in addition to dsRNA (Folco et al. 2008), about half of the non-coding RNAs (ncRNAs) are associated with chromatin that shows sensitivity towards RNase H, which cleaves DNA-RNA hybrids (Nakama et al. 2012). These ncRNAs form DNA-RNA hybrids, bind to the RNA-induced transcriptional silencing (RITS) complex, and result in RNAi-driven platform for heterochromatin assembly (Nakama et al. 2012).
Some evidence also shows that R-loop accumulation can in fact help the maintenance of centromeric integrity. In human cells, R-loops have been detected at centromeres specifically during mitosis, and such presence of R-loop drives ATR signaling pathway activation, which is important for faithful chromosome segregation and genome stability (Kabeche et al. 2018). A recent study shows that inducing centromeric double-stranded breaks (DSBs) in G1 phase leads to an increase in centromeric H3K4me2 and transcription of human and mouse cenRNAs. CENP-A and HJURP interact with the deubiquitinase USP11, enabling formation of the RAD51–BRCA1–BRCA2 complex. This further facilitates the homologous recombination (HR) of DNA and RNA, despite the absence of a sister chromatid, forming DNA-RNA hybrids and causing DNA-end resection to facilitate the repair of DSBs (Yilmaz et al. 2021).
On the contrary, R-loop accumulation at centromere chromatin has been found to be detrimental for proper chromosome segregation and genomic stability (Allison & Wang 2019). In Saccharomyces cerevisiae, HPR1, which interacts with CENP-A/CSE4, prevents the accumulation of R-loop in the centromeric region. In hpr1∆ mutant, a reduced level of CENP-A chaperone, SCM3, and a defective increased localization of histone H3 at the centromere have been observed (Mishra et al. 2021). Centromeric alpha-satellite array R-loops have been detected in human cancer cell lines. The presence of R-loop recruits BRCA1, which counteracts the accumulation of R-loops at the centromere, prevents centromere breakage, limits hyper-recombination, and ensures proper localization of CENP-A (Racca et al. 2021). Overexpression of pericenetric major satellite RNA transcription in mice sequesters BRCA1-associated network, causing destabilization of DNA replication forks, accumulation of R-loops, DNA damage, and induction of breast cancers (Zhu et al. 2018a, b). While the co-presence of centromeric R-loops and BRCA1 are evident in these studies (Yilmaz et al. 2021; Racca et al. 2021; Zhu et al. 2018a, b), the consequences appear to be different dependent on the situations. The above findings suggest that centromeric R-loops may have both positive and negative impacts on the centromere function and genome stability (Crossley et al. 2019). Further mechanistic understanding on how centromeric R-loop structure affects centromere function in different species or conditions will be useful to unveil its dynamic roles.
Centromeric circRNA
Circular RNAs (circRNAs) are formed by fusing the upstream 5′-splice site and the downstream 3′-splice site of a pre-messenger RNA (mRNA) through the process of back splicing (Li et al. 2018). CircRNAs are single-stranded, covalently closed RNA molecules, and are involved in RNA–protein complex formation and gene expression regulation (Li et al. 2018). CircRNAs have been explored in fly, worm, mouse, human, and plants, such as Oryza sativa, Arabidopsis thaliana, maize, and wheat (Memczak et al. 2013; Salzman et al. 2012; Westholm et al. 2014; Ye et al. 2015; Chen et al. 2018; Ye et al. 2015). However, due to the repetitive nature of centromeric DNA and the limitation of bioinformatics analysis tools to identify circRNAs, it has been difficult to identify centromeric circRNAs.
The first identified centromeric circRNAs are derived from centromeric retrotransposons in maize. These centromeric circRNAs bind to the centromere through R-loops (Liu et al. 2020). When using RNA interference to target against the sites of back-splicing in the circular cenRNAs, the level of circRNA, R-loops, and R-loop-induced chromatin loops decrease, and consequently, the level of CENP-A/CenH3 at centromere localization drops, indicating the significance of centromeric circRNA-derived R-loop in centromere integrity (Liu et al. 2020). The function and features of these centromeric circRNAs are just beginning to be understood.
Functions of cenRNAs and pericenRNAs
Function of cenRNAs as a scaffold
Non-coding RNA may work as a scaffold that spatially organizes proteins. This kind of RNA–protein or RNA-RNA bindings can be part of the components of a soluble or chromatin-associated protein complex (Hentze et al. 2018). RNAs may play a role in soluble complex formation (Quénet & Dalal 2014). To identify RNA-dependent soluble complexes, a newly developed tool based on density gradient ultracentrifugation, called R-DeeP, has been used. The CENP-A chaperone, HJURP, has shown an RNA-dependent shift in size (Caudron-Herger et al. 2019). The HJURP complex shows an increase in the size of the protein complex upon RNase (RNase A, RNase I, RNase T1, RNase H, and RNase III) treatment. The authors attribute this increase of size to a gain in interaction partners due to the increased availability of binding sites when RNA is degraded (Caudron-Herger et al. 2019). Future research can determine whether it is the cenRNAs that play a role in such complex formation, and this will help to establish a more complete picture of the role of cenRNAs in soluble protein complex formation.
In addition to soluble protein complexes, transcribed RNAs can associate with proximal chromatin as evidenced by RNA–DNA proximity ligation approaches (Bell et al. 2018; Sridhar et al. 2017). In fact, well-known chromatin binding factors, such as Polycomb complex (Zhang et al. 2019) and CCCTC-binding factor (CTCF) (Hansen et al. 2019), not only bind to distinct DNA motifs but are also functionally associated with RNA. Polycomb complex (Davidovich et al. 2013) and CTCF (Hansen et al. 2019; Saldaña-Meyer et al. 2019) are abundant at the pericentric heterochromatin and interact with a RNA component that is critical to their functions.
Function of centromeric RNAs in inner centromere signaling
Inner centromere is the region in between the centromeric regions of the sister chromatids (Trivedi & Stukenberg 2016). The chromosomal passenger complex (CPC) accumulates at the inner centromere region (Trivedi & Stukenberg 2016). The CPC contains the mitotic kinase Aurora B, INCENP, survivin, and borealin (Trivedi & Stukenberg 2016), and can sense and respond to the pulling forces generated at the kinetochores (Bloom 2014). In some vertebrates, CPC components have been shown to pull down cenRNAs. For example, in Xenopus, Aurora B has been shown to bound to cenRNA, which regulates both Aurora B’s localization and its activation (Blower 2016). Shugoshin (SGO1), which protects centromeric cohesion from cleavage during prophase (McGuinness et al. 2005), has also been shown to associate with cenRNA in vitro (Liu et al. 2015). Potentially, this cenRNA-SGO1 interaction allows SGO1 to reach cohesin embedded in centromeric chromatin (Liu et al. 2015). However, how exactly cenRNAs affect the recruitment of SGO1 and CPC to facilitate mitotic progression are still not clear.
Functions of pericentric transcription and pericenRNAs in heterochromatin formation
As discussed in the above sections a1 and 3, the fission yeast otr pericenRNA transcripts are processed by the RNA interference pathway and are important for heterochromatin establishment (Folco et al. 2008). In mouse cells, H3K9 methyltransferases SUV39H1 and SUV39H2 associate with major satellite RNAs derived from pericentric regions (but not with minor satellite RNAs from the centromeric regions) (Camacho et al. 2017; Johnson et al. 2017). The protein-RNA interaction stabilizes their chromatin localization and facilitates H3K9 histone methylation and heterochromatin formation (Camacho et al. 2017; Johnson et al. 2017). In a chicken-human hybrid DT-40 cell line, in which the centromeres also contain unique DNA sequences (Krasikova et al. 2012), conditional knockout of Dicer has different effects on the pericentric regions and the core centromere (Fukagawa et al. 2004). RNAi-deficient cells cause defective localization of heterochromatin proteins, cohesin protein Rad21, and checkpoint protein BubR1, while the localization of centromere proteins, including CENP-A and CENP-C, are normal (Corless et al. 2020; Fukagawa et al. 2004; Johnson et al. 2017). Such result indicates the differences in the requirement of the RNA interference or Dicer pathway between these two regions (Fukagawa et al. 2004; Hall et al. 2002; Provost et al. 2002; Volpe et al. 2002).
Implications from centromeric and pericentric RNA misregulation and diseases
The association and consequences of cenRNA and pericenRNA upregulation with stresses, cancers, and diseases have been investigated and extensively reviewed (Arunkumar and Melters 2020; Hernández-Saavedra et al. 2017; Smurova & De Wulf 2018). Pericentric HSAT II are overexpressed in some cancers, which indicates its potential application in cancer diagnosis (Hall et al. 2017). Overexpression of pericentric satellite RNA can even induce breast cancer (Zhu et al. 2018b). In general, high levels of cenRNAs promote chromosomal instability (CIN), which correlates with tumor metastasis (Chan et al. 2017; Zhu et al. 2011). Understanding how different organisms fine tune the expression of cenRNAs and pericenRNAs and determining the relationship between cenRNA regulation in stressed and disease conditions will help to apply cenRNAs and pericenRNA as biomarkers for diagnosis or prognosis in cancers and other diseases (Arunkumar and Melters 2020; Smurova & De Wulf 2018). Modulation of cenRNA or pericenRNA levels could also be important for cancer prevention or treatment.
Causes and functions of cenRNA transcript variants
Different start and end sites, and different cell cycle timings
The timing of centromeric transcription vary among organisms, but appears to correlate with the timing of CENP-A loading, as discussed briefly in b1 above and more extensively reviewed previously (Arunkumar and Melters 2020; Perea-Resa et al. 2018). Yet, within the same organism, cenRNA variants may be observed from different stages of the cell cycle, as evidenced by isoform RNA sequencing using long-read sequencing techniques in G1 and S phase in S. cerevisiae (Hedouin et al. 2022). The most abundant transcription isoform of each chromosome can be identified and classified based on the TSS: they can be initiated at nearby gene promoter or terminator sites, from anti-sense initiation of a neighboring gene, transcription readthrough from a neighboring gene, or initiation in an intergenic region (Fig. 2A) (Hedouin et al. 2022). However, they usually end upstream of the centromere in G1 (Hedouin et al. 2022; Ling & Yuen 2019). Only in S phase, the transcription leakage through into CDEI or CDEIII, due to centromere DNA replication and removal of CBF1 from CDEI, leading to the generation of cenRNAs (Hedouin et al. 2022). Even so, most identified cenRNAs do not encompass the full centromere (all three CDEs) (Hedouin et al. 2022). By 5′ and 3′ rapid amplification of cDNA ends (RACE), multiple S. cerevisiae centromeric transcript variants with different lengths (462–1754 nt) derived from both sense and anti-sense strands from three chromosomes have been characterized (Ling & Yuen 2019), consistent with the isoform RNA sequencing (Hedouin et al. 2022).
Different stabilities
In human cells, cenRNAs derived from different alpha-satellite arrays show different stability (Arunkumar and Melters 2020). CenRNA alpha-satellite generated within “active centromere” arrays (e.g., DXZ1 or D17Z1) associate with CENP-A and CENP-C, and are more stable compared to the transcripts from “inactive pericentric” arrays (e.g., D17Z1-B), which associate with CENP-B (McNulty et al. 2017).
Processing into shorter cenRNAs
Only a few studies have unveiled the details about cenRNA processing, possibly because it is difficult to trace all cenRNA variants, which may be short-lived or unstable. In mouse cells, while the core centromere precursor transcripts are about 2–4 kb, 120 nt of minor satellite transcripts are also identified (Bouzinba-Segard et al. 2006; Ferri et al. 2009). These data indicate that after being transcribed as a RNA precursor, cenRNA might be processed, by alternative splicing and post-transcriptional modifications (Wilkinson et al. 2020). In rice, > 3-kb transcripts derived from centromere retrotransposon (CRR) arrays have been found to be processed into small RNA (sRNA) of about 24 nt in length, which plays critical roles in the RNAi pathway to maintain heterochromatin (Neumann et al. 2007). In addition, splicing of intron-like elements in pericentric RNAs in fission yeast has been observed. Mutation of a splicing factor, PRP16, leads to pericentric heterochromatin defects (Vijayakumari et al. 2019).
m6A methylation and splicing
Recently, researchers have explored the impact of pericenRNA levels on mRNA splicing (Ninomiya et al. 2021; Vourc’h et al. 2022). Many target mRNAs contain N(6)-methyladenosine (m6A). m6A modification recruits a m6A reader, YTHDC1, to promote splicing (Xiao et al. 2016). Upon thermal stress, HSATIII RNAs, which are generated from pericentromeric regions, are highly expressed in HeLa cells. This might sequester the m6A writer complex and YTHDC1 in nuclear stress bodies (nSB) (Fig. 2B). This process will diminish m6A methylation and YTHDC1 binding to the target mRNAs, which in turn repress the splicing of target mRNAs (Ninomiya et al. 2021).
Polyadenylation
Telomeric repeat-containing RNA (TERRA) is a lncRNA generated from telomere DNA. About 7% of human TERRA is polyadenylated, whereas most of the yeast TERRA is polyadenylated (Feuerhahn et al. 2010). CenRNA polyadenylation has also been found in different organisms, including budding and fission yeast, rice, mice, and humans (Arunkumar and Melters 2020; Choi et al. 2011; Ling & Yuen 2019; Neumann et al. 2007). In budding yeast, centromeric transcripts with polyadenylated tails have been detected by 3′ RACE with the oligo (dT) primer (Ling & Yuen 2019). Similarly, various poly (A) tails with different termination sites were identified in rice by 3′ RACE (Fig. 2B) (Neumann et al. 2007). However, it is possible that not all cenRNA and pericenRNAs are polyadenylated. Similarly, 5′ capping of cenRNAs or pericenRNAs has also been reported in fission yeast (Arunkumar and Melters 2020; Choi et al. 2011). However, the percentage of 5′ capping or 3′ polyadenylated cenRNA and pericenRNAs might also vary among different species, and whether polyadenylation of RNA Pol II-derived centromeric transcripts contributes only to their stability or has additional functions needs further exploration.
In summary, many cenRNA and pericenRNA variants are generated due to multiple processes during and after transcription, including different transcription start and termination sites, alternative splicing, and modifications. Whether the cells need to have a repertoire of cenRNA variants in different abundancies remains unclear. Some cenRNAs could be more highly expressed only because of the leaky nearby gene expression. Studying different transcriptional variants will be useful to determine whether cenRNAs function interchangeably, potentially in trans, or whether individual cenRNA variants have unique functions, possibly in cis.
Future perspectives
Many previous studies analyzed centromere transcription and cenRNAs used population-based assays, including reverse transcription-quantitative polymerase chain reaction (RT-qPCR) and RNA short-read sequencing (RNA-seq). The average behaviors of cenRNAs across a cell population may not allow accurate characterization of the nature, number, and variants of cenRNAs in individual cells, especially many cenRNAs are lowly expressed and induced only in specific cell cycle timing (Biscotti et al. 2015; Blower 2016; Fachinetti et al. 2013; McNulty et al. 2017; Quénet & Dalal 2014). Knowledge on centromeric RNA has been increasing very quickly with the advancing single-molecule long-read and modification sequencing techniques. Microscopy approaches to visualize cenRNA transcripts in individual cells may help to resolve this averaging effect problem. These single-cell techniques may include detecting cenRNAs by LNA probes in mitotic chromosome spreads, RNA-fluorescence in-situ hybridization combined with immunofluorescence (RNA-FISH-IF) (Kochan et al. 2015, aptamer-tagged non-coding RNA (Autour et al. 2018), and single-molecule fluorescence in-situ hybridization (smFISH) (Raj et al. 2008; Rošić et al. 2014), a strategy that has been used to detect unique mRNAs and long non-coding RNAs (lncRNAs).
Experiments that affect the transcription process will unavoidably affect the corresponding transcript level as well. Directly comparing the centromere, CIN or cell cycle phenotypes by manipulating centromeric transcription and just the cenRNA level can further elucidate the function of each component. The additional phenotypes changes from manipulating transcription could be attributed to the function of the act of transcription, but of course perturbation of transcription has pleiotropic effects that have to be taken into account. Alternatively, manipulation of a particular transcription stage or RNA processing event may unveil the importance of each step. Exploring the characteristics and mechanism of cenRNA in more diverse model organisms will also provide a wider perspective for a potential conserved function.
Data availability
Not applicable.
References
Abdulhay NJ, McNally CP, Hsieh LJ, Kasinathan S, Keith A, Estes LS, Ramani V (2020) Massively multiplex single-molecule oligonucleosome footprinting. Elife 9:e59404
Akay A, Jordan D, Navarro IC, Wrzesinski T, Ponting CP, Miska EA, Haerty WJB (2019) Identification of functional long non-coding RNAs in C elegans. BMC Biol 17:1–14
Ali-Ahmad A, Sekulić N (2020) CENP-A nucleosome—a chromatin-embedded pedestal for the centromere: Lessons learned from structural biology. Essays Biochem 64(2):205–221
Allison DF, Wang GG (2019) R-loops formation, function and relevance to cell stress. Cell stress 3(2):38–46. https://doi.org/10.15698/cst2019.02.175
Allshire RC, Karpen GHJNRG (2008) Epigenetic regulation of centromeric chromatin: Old dogs, new tricks? Nat Rev Genet 9(12):923–937
Altemose N, Logsdon GA, Bzikadze AV, Sidhwani P, Langley SA, Caldas GV, Miga KH (2022a) Complete genomic and epigenetic maps of human centromeres. Science 376(6588):eabl4178. https://doi.org/10.1126/science.abl4178
Altemose N, Maslan A, Smith OK, Sundararajan K, Brown RR, Mishra R, Streets A (2022b) DiMeLo-seq: A long-read, single-molecule method for mapping protein–DNA interactions genome wide. Nature methods 19(6):711–723
Arunkumar G, Melters DP (2020) Centromeric transcription: a conserved Swiss-Army knife. Genes 11(8):911
Autour A, CY Jeng SD, Cawte A, Abdolahzadeh A, Galli A, Panchapakesan SS, ..., Unrau PJ (2018) Fluorogenic RNA Mango aptamers for imaging small non-coding RNAs in mammalian cells. Nature Comm 9(1):656
Baker RE, Rogers K (2005) Genetic and Genomic Analysis of the AT-Rich Centromere DNA Element II of Saccharomyces Cerevisiae. Genetics 171(4):1463–1475
Barbosa AC, Xu Z, Karari K, Williams W, Hauf S, Brown WR (2022) Mutation and selection explain why many eukaryotic centromeric DNA sequences are often A+ T rich. Nucleic Acid Res 50(1):579–596
Barski A, Cuddapah S, Cui K, Roh TY, Schones DE, Wang Z, ..., Zhao K (2007) High-resolution profiling of histone methylations in the human genome. Cell 129(4):823–837
Barth TK, Schade GO, Schmidt A, Vetter I, Wirth M, Heun P, ..., Imhof A (2014) Identification of novel Drosophila centromere‐associated proteins. Proteomics 14(19):2167–2178
Bell JC, Jukam D, Teran NA, Risca VI, Smith OK, Johnson WL, ..., Straight AF (2018) Chromatin-associated RNA sequencing (ChAR-seq) maps genome-wide RNA-to-DNA contacts. Elife 7:e27024
Belotserkovskaya R, Oh S, Bondarenko VA, Orphanides G, Studitsky VM, Reinberg D (2003) FACT Facilitates Transcription-Dependent Nucleosome Alteration. Science 301(5636):1090–1093
Bergmann JH, Rodríguez MG, Martins NM, Kimura H, Kelly DA, Masumoto H, ..., Earnshaw WC (2011) Epigenetic engineering shows H3K4me2 is required for HJURP targeting and CENP‐A assembly on a synthetic human kinetochore. EMBO J 30(2):328–340
Bergmann JH, Jakubsche JN, Martins NM, Kagansky A, Nakano M, Kimura H, ..., Earnshaw WC (2012) Epigenetic engineering: Histone H3K9 acetylation is compatible with kinetochore structure and function. J Cell Sci 125(2):411–421
Biscotti MA, Canapa A, Forconi M, Olmo E, Barucca MJCR (2015) Transcription of tandemly repetitive DNA: functional roles. Chromosome Res 23:463–477
Bloom KS (2014) Centromeric heterochromatin: the primordial segregation machine. Annu Rev Genet 48:457
Blower, MD (2016) Centromeric transcription regulates Aurora-B localization and activation. Cell Reports 15(8):1624–1633
Bobkov GO, Gilbert N, Heun P (2018) Centromere transcription allows CENP-A to transit from chromatin association to stable incorporation. J Cell Biol 217(6):1957–1972
Bouzinba-Segard H, Guais A, Francastel C (2006) Accumulation of small murine minor satellite transcripts leads to impaired centromeric architecture and function. Proc Natl Acad Sci 103(23):8709–8714
Camacho OV, Galan C, Swist-Rosowska K, Ching R, Gamalinda M, Karabiber F, ..., Jenuwein T (2017) Major satellite repeat RNA stabilize heterochromatin retention of Suv39h enzymes by RNA-nucleosome association and RNA: DNA hybrid formation. Elife 6:e25293
Candelli T, Challal D, Briand JB, Boulay J, Porrua O, Colin J, Libri D (2018) High-resolution transcription maps reveal the widespread impact of roadblock termination in yeast. EMBO J 37(4):e97490
Carbon J, Clarke L (1984) Structural and functional analysis of a yeast centromere. J Cell Sci 1:43–58
Carone DM, Longo MS, Ferreri GC, Hall L, Harris M, Shook N, ..., O’Neill RJ (2009) A new class of retroviral and satellite encoded small RNAs emanates from mammalian centromeres. Chromosoma 118:113–125
Carrozza MJ, Li B, Florens L, Suganuma T, Swanson SK, Lee K K, ..., Workman JL (2005) Histone H3 methylation by Set2 directs deacetylation of coding regions by Rpd3S to suppress spurious intragenic transcription. Cell 123(4):581–592
Catania S, Pidoux AL, Allshire RCJP (2015) Sequence features and transcriptional stalling within centromere DNA promote establishment of CENP-A chromatin. PLOs Genet 11(3):e1004986
Caudron-Herger M, Rusin SF, Adamo ME, Seiler J, Schmid VK, Barreau E, Diederichs S (2019) R-DeeP: proteome-wide and quantitative identification of RNA-dependent proteins by density gradient ultracentrifugation. Molecular cell 75(1):184–199. e110
Chan FL, Wong LH (2012) Transcription in the Maintenance of Centromere Chromatin Identity. Nucleic Acids Res 40(22):11178–11188
Chan FL, Marshall OJ, Saffery R, Won Kim B, Earle E, Choo KA, Wong LH (2012) Active transcription and essential role of rna polymerase II at the centromere during mitosis. Proceedings of the National Academy of Sciences 109(6):1979–1984
Chan DY, Moralli D, Khoja S, Monaco ZL (2017) Noncoding centromeric RNA expression impairs chromosome stability in human and murine stem cells. Disease Markers 2017:1–8
Chang CH, Chavan A, Palladino J, Wei X, Martins NM, Santinello B, ..., Mellone BG (2019) Islands of retroelements are major components of Drosophila centromeres. PLoS Biol 17(5):e3000241
Chantalat S, Depaux A, Héry P, Barral S, Thuret J-Y, Dimitrov S, Gérard MJGR (2011) Histone H3 trimethylation at lysine 36 is associated with constitutive and facultative heterochromatin. Genome Res 21(9):1426–1437
Chedin F, Benham CJ (2020) Emerging roles for R-Loop structures in the management of topological stress. J Biol Chem 295(14):4684–4695
Chen CC, Bowers S, Lipinszki Z, Palladino J, Trusiak S, Bettini E, ..., Mellone BG (2015) Establishment of centromeric chromatin by the CENP-A assembly factor CAL1 requires FACT-mediated transcription. Developmental Cell 34(1):73–84
Chen L, Zhang P, Fan Y, Lu Q, Li Q, Yan J, ..., Li L (2018) Circular RNAs mediated by transposons are associated with transcriptomic and phenotypic variation in maize. New Phytologist 217(3):1292–1306
Choi ES, Strålfors A, Castillo AG, Durand-Dubief M, Ekwall K, Allshire RC (2011) Identification of noncoding transcripts from within CENP-A chromatin at fission yeast centromeres. J Biol Chem 286(26):23600–23607
Clarke L, Carbon J (1980) Isolation of a yeast centromere and construction of functional small circular chromosomes. Nature 287(5782):504–509
Claycomb JM, Batista PJ, Pang KM, Gu W, Vasale JJ, van Wolfswinkel JC, ..., Mello CC (2009) The Argonaute CSR-1 and its 22G-RNA cofactors are required for holocentric chromosome segregation. Cell 139(1):123–134
Corless S, Höcker S, Erhardt S (2020) Centromeric RNA and its function at and beyond centromeric chromatin. J Mol Biol 432(15):4257–4269
Crossley MP, Bocek M, Cimprich KA (2019) R-Loops as cellular regulators and genomic threats. Mol Cell 73(3):398–411
Cumberledge S, Carbon J (1987) Mutational analysis of meiotic and mitotic centromere function in saccharomyces cerevisiae. Genetics 117(2):203–212
Davidovich C, Zheng L, Goodrich KJ, Cech TRJNS, Biology M (2013) Promiscuous RNA binding by Polycomb repressive complex 2. Nat Struct Mol Biol 20(11):1250–1257
Dawe RKJTPC (2003) RNA interference, transposons, and the centromere. Plant Cell 15(2):297–301
Deyter GM, Biggins SJG (2014) The FACT complex interacts with the E3 ubiquitin ligase Psh1 to prevent ectopic localization of CENP-A. Genes Dev 28(16):1815–1826
Dorman C (2019) DNA supercoiling and transcription in bacteria: A two-way street. BMC Mol Cell Biol 20. https://doi.org/10.1186/s12860-019-0211-6
Du Y, Topp CN, Dawe RK (2010) DNA binding of centromere protein C (CENPC) is stabilized by single-stranded RNA. PLoS Genet 6(2):e1000835. https://doi.org/10.1371/journal.pgen.1000835
Duda Z, Trusiak S, O’Neill R (2017) Centromere transcription: Means and motive. Centromeres and Kinetochores: Discovering the Molecular Mechanisms Underlying Chromosome Inheritance 257–281
Escudero M, Márquez-Corro JI, Hipp AL (2016) The phylogenetic origins and evolutionary history of holocentric chromosomes. Syst Bot 41(3):580–585
Fachinetti D, Diego Folco H, Nechemia-Arbely Y, Valente LP, Nguyen K, Wong AJ, ..., Cleveland DW (2013) A two-step mechanism for epigenetic specification of centromere identity and function. Nature Cell Biol 15(9):1056–1066
Fang Y, Chen L, Lin K, Feng Y, Zhang P, Pan X, ..., Zhang W (2019) Characterization of functional relationships of R-loops with gene transcription and epigenetic modifications in rice. Genome Res 29(8):1287–1297
Ferrari KJ, Scelfo A, Jammula S, Cuomo A, Barozzi I, Stützer A, ..., Pasini D (2014) Polycomb-dependent H3K27me1 and H3K27me2 regulate active transcription and enhancer fidelity. Molecular Cell 53(1):49–62
Ferri F, Bouzinba-Segard H, Velasco G, Hube F, Francastel C (2009) Non-coding murine centromeric transcripts associate with and potentiate aurora B kinase. Nucleic Acids Res 37(15):5071–5080
Feuerhahn S, Iglesias N, Panza A, Porro A, Lingner J (2010) TERRA biogenesis, turnover and implications for function. FEBS Lett 584(17):3812–3818. https://doi.org/10.1016/j.febslet.2010.07.032
Flemming W (1880) Beitrage zur kenntniss der zelle und ihrer lebenserscheinungen. Arch Mikroskop Anat 1879:302–436
Folco HD, Pidoux AL, Urano T, Allshire RC (2008) Heterochromatin and RNAi are required to establish CENP-A chromatin at centromeres. Science 319(5859):94–97
Foltz DR, Jansen LE, Black BE, Bailey AO, Yates JR, Cleveland DW (2006) The human CENP-A centromeric nucleosome-associated complex. Nature Cell Biol 8(5):458–469
Formosa T (2012) The role of fact in making and breaking nucleosomes. Biochimica Et Biophysica Acta (BBA)-Gene Regulatory Mechanisms 1819(3–4):247–255
Freire-Benéitez V, Price RJ, Buscaino A (2016) The chromatin of candida albicans pericentromeres bears features of both euchromatin and heterochromatin. Frontiers in Microbiol 7:759
Fukagawa T, Nogami M, Yoshikawa M, Ikeno M, Okazaki T, Takami Y, . . . Oshimura M (2004) Dicer is essential for formation of the heterochromatin structure in vertebrate cells. Nat Cell Biol 6(8):784–791
Furuyama T, Henikoff S (2009) Centromeric nucleosomes induce positive DNA supercoils. Cell 138(1):104–113. https://doi.org/10.1016/j.cell.2009.04.049
Gassmann R, Rechtsteiner A, Yuen KW, Muroyama A, Egelhofer T, Gaydos L, ..., Desai A (2012) An inverse relationship to germline transcription defines centromeric chromatin in C. elegans. Nature 484(7395):534–537
Ghosh S, Lehner CF (2022) Incorporation of CENP-A/CID into centromeres during early Drosophila embryogenesis does not require RNA polymerase II-mediated transcription. Chromosoma 131(1):1–17. https://doi.org/10.1007/s00412-022-00767-2
Giaimo BD, Ferrante F, Herchenröther A, Hake SB, Borggrefe T (2019) The histone variant H2A. Z in gene regulation. Epigenetics & Chromatin 12(1):1–22
Gibson BA, Doolittle LK, Schneider MW, Jensen LE, Gamarra N, Henry L, Rosen MK (2019) Organization of chromatin by intrinsic and regulated phase separation. Cell 179(2):470–484. e421
Gong Z, Wu Y, Koblížková A, Torres G A, Wang K, Iovene M, Jiang J (2012) Repeatless and repeat-based centromeres in potato: implications for centromere evolution. Plant Cell 24(9):3559–3574
Gopalakrishnan S, Sullivan BA, Trazzi S, Della Valle G, Robertson KDJHMG (2009) DNMT3B interacts with constitutive centromere protein CENP-C to modulate DNA methylation and the histone code at centromeric regions. Human Mol Genet 18(17):3178–3193
Grenfell AW, Heald R, Strzelecka M (2016) Mitotic Noncoding RNA Processing Promotes Kinetochore and Spindle Assembly in Xenopus. J Cell Biol 214(2):133–141
Hall IM, Shankaranarayana GD, Noma K-I, Ayoub N, Cohen A, Grewal SI (2002) Establishment and maintenance of a heterochromatin domain. Science 297(5590):2232–2237
Hall LL, Byron M, Carone DM, Whitfield TW, Pouliot GP, Fischer A, Lawrence JB (2017) Demethylated HSATII DNA and HSATII RNA foci sequester PRC1 and MeCP2 into cancer-specific nuclear bodies. Cell Rep 18(12):2943–2956
Hamperl S, Bocek MJ, Saldivar JC, Swigut T, Cimprich KAJC (2017) Transcription-replication conflict orientation modulates R-loop levels and activates distinct DNA damage responses. Cell 170(4):774-786.e19
Hansen AS, Hsieh THS, Cattoglio C, Pustova I, Saldaña-Meyer R, Reinberg D, ..., Tjian R (2019) Distinct classes of chromatin loops revealed by deletion of an RNA-binding region in CTCF. Molecular Cell 76(3):395–411.e313
Hartley G, O’Neill RJ (2019) Centromere repeats: hidden gems of the genome. Genes 10(3):223
Hedouin S, Logsdon GA, Underwood JG, Biggins S (2022) A transcriptional roadblock protects yeast centromeres. Nucleic Acids Res 50(14):7801–7815
Henikoff S, Ahmad K, Malik HS (2001) The centromere paradox: Stable inheritance with rapidly evolving DNA. Science 293(5532):1098–1102
Hentze MW, Castello A, Schwarzl T, Preiss T (2018) A brave new world of RNA-binding proteins. Nature Rev Mol Cell Biol 19(5):327–341
Hernández-Saavedra D, Strakovsky RS, Ostrosky-Wegman P, Pan YX (2017) Epigenetic regulation of centromere chromatin stability by dietary and environmental factors. Adv Nutr 8(6):889–904
Hezroni H, Koppstein D, Schwartz MG, Avrutin A, Bartel DP, Ulitsky IJCR (2015) Principles of long noncoding RNA evolution derived from direct comparison of transcriptomes in 17 species. Cell Rep 11(7):1110–1122
Hill A, Bloom K (1987) Genetic manipulation of centromere function. Mol Cell Biol 7(7):2397–2405
Hofstatter PG, Thangavel G, Lux T, Neumann P, Vondrak T, Novak P, ..., Marques A (2022) Repeat-based holocentromeres influence genome architecture and karyotype evolution. Cell 185(17):3153–3168.e3118
Hori T, Shang WH, Toyoda A, Misu S, Monma N, Ikeo K, ..., Fukagawa T (2014) Histone H4 Lys 20 monomethylation of the CENP-A nucleosome is essential for kinetochore assembly. Developmental Cell 29(6):740–749
Ishii M, Akiyoshi B (2022) Plasticity in centromere organization and kinetochore composition: Lessons from diversity. Curr Opin Cell Biol 74:47–54
Ishii K, Ogiyama Y, Chikashige Y, Soejima S, Masuda F, Kakuma T, ..., Takahashi K (2008) Heterochromatin integrity affects chromosome reorganization after centromere dysfunction. Science 321(5892):1088–1091
Jain M, Olsen HE, Turner DJ, Stoddart D, Bulazel KV, Paten B, ..., Miga KH (2018) Linear assembly of a human centromere on the Y chromosome. Nature Biotechnol 36(4):321–323
Jiang F, Zhang J, Liu Q, Liu X, Wang H, He J, Kang LJRB (2019) Long-read direct RNA sequencing by 5′-Cap capturing reveals the impact of Piwi on the widespread exonization of transposable elements in locusts. RNA Biol 16(7):950–959
Jin Q-W, Trelles-Sticken E, Scherthan H, Loidl J (1998) Yeast nuclei display prominent centromere clustering that is reduced in nondividing cells and in meiotic prophase. J Cell Biol 141(1):21–29
Johnson WL, Yewdell WT, Bell JC, McNulty SM, Duda Z, O’Neill RJ, Straight AF (2017) RNA-dependent stabilization of SUV39H1 at constitutive heterochromatin. Elife 6: e25299
Kabeche L, Nguyen HD, Buisson R, Zou L (2018) A mitosis-specific and r loop-driven atr pathway promotes faithful chromosome segregation. Science 359(6371):108–114
Kagansky A, Folco HD, Almeida R, Pidoux AL, Boukaba A, Simmer F, ..., Allshire RC (2009) Synthetic heterochromatin bypasses RNAi and centromeric repeats to establish functional centromeres. Science 324(5935):1716–1719
Karmodiya K, Krebs AR, Oulad-Abdelghani M, Kimura H, Tora LJBG (2012) H3K9 and H3K14 acetylation co-occur at many gene regulatory elements, while H3K14ac marks a subset of inactive inducible promoters in mouse embryonic stem cells. BMC Genom 13:1–18
Ketel C, Wang HS, McClellan M, Bouchonville K, Selmecki A, Lahav T, ..., Berman J (2009) Neocentromeres form efficiently at multiple possible loci in Candida albicans. PLoS Genetics 5(3):e1000400
Kixmoeller K, Allu PK, Black BE (2020) The centromere comes into focus: From CENP-A nucleosomes to kinetochore connections with the spindle. Open Biol 10(6):200051
Kochan J, Wawro M, Kasza AJB (2015) Simultaneous detection of mRNA and protein in single cells using immunofluorescence-combined single-molecule RNA FISH. BioTechniques 59(4):209–221
Krasikova A, Fukagawa T, Zlotina A (2012) High-resolution mapping and transcriptional activity analysis of chicken centromere sequences on giant lampbrush chromosomes. Chromosome Res 20:995–1008
Kulish D, Struhl K (2001) TFIIS enhances transcriptional elongation through an artificial arrest site in vivo. Mol Cell Biol 21(13):4162–4168
Kundaje A, Meuleman W, Ernst J, Bilenky M, Yen A, Heravi-Moussavi A, Ziller MJ (2015) Integrative analysis of 111 reference human epigenomes. Nature 518(7539):317–330
Kvint K, Uhler JP, Taschner MJ, Sigurdsson S, Erdjument-Bromage H, Tempst P, Svejstrup JQJMC (2008) Reversal of RNA polymerase II ubiquitylation by the ubiquitin protease Ubp3. Mol Cell 30(4):498–506
Lehnertz B, Ueda Y, Derijck AA, Braunschweig U, Perez-Burgos L, Kubicek S, ..., Peters AH (2003) Suv39h-mediated histone H3 lysine 9 methylation directs DNA methylation to major satellite repeats at pericentric heterochromatin. Curr Biol 13(14):1192–1200
Lennartsson A, Ekwall K (2009) Histone modification patterns and epigenetic codes. Biochimica et biophysica acta (BBA)-general subjects 1790(9):863–868
Li B, Jackson J, Simon MD, Fleharty B, Gogol M, Seidel C, Shilatifard AJJOBC (2009) Histone H3 lysine 36 dimethylation (H3K36me2) is sufficient to recruit the Rpd3s histone deacetylase complex and to repress spurious transcription. J Biol Chem 284(12):7970–7976
Li X, Yang L, Chen L-LJMC (2018) The biogenesis, functions, and challenges of circular RNAs. Mol Cell 71(3):428–442
Lin Z, Xie Y, Nong W, Ren X, Li R, Zhao Z, ..., Yuen KWY (2021) Formation of artificial chromosomes in Caenorhabditis elegans and analyses of their segregation in mitosis, DNA sequence composition and holocentromere organization. Nucleic Acids Res 49(16):9174–9193
Ling YH, Yuen KWY (2019) Point centromere activity requires an optimal level of centromeric noncoding RNA. Proceedings of the National Academy of Sciences 116(13):6270–6279
Liu H, Qu Q, Warrington R, Rice A, Cheng N, Yu HJMC (2015) Mitotic transcription installs Sgo1 at centromeres to coordinate chromosome segregation. Mol Cell 59(3):426–436
Liu Y, Su H, Zhang J, Liu Y, Feng C, Han F (2020) Back-spliced rna from retrotransposon binds to centromere and regulates centromeric chromatin loops in maize. PLoS Biol 18(1):e3000582
Liu Y, Liu Q, Su H, Liu K, Xiao X, Li W, Han F (2021) Genome-wide mapping reveals R-loops associated with centromeric repeats in maize. Genome Res 31(8):1409–1418. https://doi.org/10.1101/gr.275270.121
Lu WT, Hawley BR, Skalka GL, Baldock RA, Smith EM, Bader AS, ..., Bushell M (2018) Drosha drives the formation of DNA: RNA hybrids around DNA break sites to facilitate DNA repair. Nature Communications 9(1):1–13
Maldonado R, Schwartz U, Silberhorn E, Längst GJMC (2019) Nucleosomes stabilize ssRNA-dsDNA triple helices in human cells. Mol Cell 73(6):1243-1254.e1246
Marques A, Ribeiro T, Neumann P, Macas J, Novák P, Schubert V, ..., Houben A (2015) Holocentromeres in Rhynchospora are associated with genome-wide centromere-specific repeat arrays interspersed among euchromatin. Proceedings of the National Academy of Sciences 112(44):13633–13638
Martinez-Rucobo FW, Cramer P (2013) Structural basis of transcription elongation. Biochimica et Biophysica Acta (BBA)-Gene Regulatory Mechanisms 1829(1):9–19
McGuinness BE, Hirota T, Kudo NR, Peters J-M, Nasmyth K (2005) Shugoshin prevents dissociation of cohesin from centromeres during mitosis in vertebrate cells. PLoS Biol 3(3):e86
McNulty SM, Sullivan BA (2018) Alpha satellite DNA biology: Finding function in the recesses of the genome. Chromosome Res 26:115–138
McNulty SM, Sullivan LL, Sullivan BA (2017) Human centromeres produce chromosome-specific and array-specific alpha satellite transcripts that are complexed with CENP-A and CENP-C. Dev Cell 42(3):226–240.e226. https://doi.org/10.1016/j.devcel.2017.07.001
Mellone BG, Grive KJ, Shteyn V, Bowers SR, Oderberg I, Karpen GH (2011) Assembly of drosophila centromeric chromatin proteins during mitosis. PLoS Genetics 7(5):e1002068
Melters DP, Paliulis LV, Korf IF, Chan SWJCR (2012) Holocentric chromosomes: convergent evolution, meiotic adaptations, and genomic analysis. Chromosome Res 20:579–593
Memczak S, Jens M, Elefsinioti A, Torti F, Krueger J, Rybak A, ..., Rajewsky N (2013) Circular RNAs are a large class of animal RNAs with regulatory potency. Nature 495(7441):333–338
Miga KH, Koren S, Rhie A, Vollger MR, Gershman A, Bzikadze A, ..., Phillippy AM (2020) Telomere-to-telomere assembly of a complete human X chromosome. Nature 585(7823):79–84
Mishra PK, Chakraborty A, Yeh E, Feng W, Bloom KS, Basrai MA (2021) R-Loops at centromeric chromatin contribute to defects in kinetochore integrity and chromosomal instability in budding yeast. Mol Biol Cell 32(1):74–89
Molina O, Vargiu G, Abad MA, Zhiteneva A, Jeyaprakash AA, Masumoto H, ..., Earnshaw WC (2016) Epigenetic engineering reveals a balance between histone modifications and transcription in kinetochore maintenance. Nature Communications 7(1):1–16
Mravinac B, Sullivan LL, Reeves JW, Yan CM, Kopf KS, Farr C J, ..., Sullivan BA (2009) Histone modifications within the human X centromere region. PloS One 4(8):e6602
Nakama M, Kawakami K, Kajitani T, Urano T, Murakami Y (2012) DNA–RNA hybrid formation mediates RNAi-directed heterochromatin formation. Genes Cells 17(3):218–233. https://doi.org/10.1111/j.1365-2443.2012.01583.x
Nam J-W, Bartel DP (2012) Long Noncoding RNAs in C. elegans. Genome Res 22(12):2529–2540
Naughton C, Huidobro C, Catacchio CR, Buckle A, Grimes GR, Nozawa RS, ..., Gilbert N (2022) Human centromere repositioning activates transcription and opens chromatin fibre structure. Nature Communications 13(1):5609
Neumann P, Yan H, Jiang J (2007) The centromeric retrotransposons of rice are transcribed and differentially processed by RNA interference. Genetics 176(2):749–761
Ng R, Ness J, Carbon J (1986) Structural studies on centromeres in the yeast Saccharomyces cerevisiae. Basic Life Sci 40:479–492
Ninomiya K, Iwakiri J, Aly MK, Sakaguchi Y, Adachi S, Natsume T, Hirose T (2021) m(6) A modification of HSATIII lncRNAs regulates temperature-dependent splicing. Embo J 40(15):e107976. https://doi.org/10.15252/embj.2021107976
Nurk S, Koren S, Rhie A, Rautiainen M, Bzikadze AV, Mikheenko A, ..., Phillippy AM (2022) The complete sequence of a human genome. Science 376(6588):44–53
Ohkuni K, Kitagawa K (2012) Role of transcription at centromeres in budding yeast. Transcription 3(4):193–197
Ohzeki JI, Bergmann JH, Kouprina N, Noskov VN, Nakano M, Kimura H, ..., Masumoto H (2012) Breaking the HAC barrier: Histone H3K9 acetyl/methyl balance regulates CENP‐A assembly. EMBO J 31(10):2391–2402
Okada M, Okawa K, Isobe T, Fukagawa TJMBOTC (2009) CENP-H-containing complex facilitates centromere deposition of CENP-A in cooperation with FACT and CHD1. Mol Biol Cell 20(18):3986–3995
Perea-Resa C, Blower MD (2018) Centromere biology: Transcription goes on stage. Mol Cell Biol 38(18):e00263-e218
Potapova T, Gorbsky GJ (2017) The Consequences of chromosome segregation errors in mitosis and meiosis. Biology 6(1):12
Provost P, Silverstein RA, Dishart D, Walfridsson J, Djupedal I, Kniola B, Ekwall K (2002) Dicer is required for chromosome segregation and gene silencing in fission yeast cells. Proc Nat Acad Sci 99(26):16648–16653
Quénet D, Dalal Y (2014) A long non-coding RNA is required for targeting centromeric protein A to the human centromere. Elife 3:e26016
Racca C, Britton S, Hédouin S, Francastel C, Calsou P, Larminat F (2021) BRCA1 prevents R-loop-associated centromeric instability. Cell Death Dis 12(10):896. https://doi.org/10.1038/s41419-021-04189-3
Rada-Iglesias AJNG (2018) Is H3K4me1 at enhancers correlative or causative? Nat Genet 50(1):4–5
Raj A, Van Den Bogaard P, Rifkin SA, Van Oudenaarden A, Tyagi S (2008) Imaging individual mRNA molecules using multiple singly labeled probes. Nature Methods 5(10):877–879
Ribeiro SA, Vagnarelli P, Dong Y, Hori T, McEwen BF, Fukagawa T, ..., Earnshaw WC (2010) A super-resolution map of the vertebrate kinetochore. Proceedings of the National Academy of Sciences 107(23):10484–10489
Rošić S, Köhler F, Erhardt S (2014) Repetitive centromeric satellite RNA is essential for kinetochore formation and cell division. J Cell Biol 207(3):335–349
Rougeulle C, Chaumeil J, Sarma K, Allis CD, Reinberg D, Avner P, ..., Heard E (2004) Differential histone H3 Lys-9 and Lys-27 methylation profiles on the X chromosome. Mol Cell Biol 24(12):5475–5484
Saffery R, Sumer H, Hassan S, Wong LH, Craig JM, Todokoro K, ..., Choo KA (2003) Transcription within a functional human centromere. Mol Cell 12(2):509–516
Saldaña-Meyer R, Rodriguez-Hernaez J, Escobar T, Nishana M, Jácome-López K, Nora EP, ..., Reinberg D (2019) RNA interactions are essential for CTCF-mediated genome organization. Mol Cell 76(3):412–422.e415
Salzman J, Gawad C, Wang PL, Lacayo N, Brown PO (2012) Circular RNAs are the predominant transcript isoform from hundreds of human genes in diverse cell types. PloS One 7(2):e30733
Santaguida S, Amon A (2015) Short-and long-term effects of chromosome mis-segregation and aneuploidy. Nature Rev Mol Cell Biol 16(8):473–485
Sanyal K, Baum M, Carbon J (2004) Centromeric DNA sequences in the pathogenic yeast candida albicans are all different and unique. Proceedings of the National Academy of Sciences 101(31):11374–11379
Schubert V, Neumann P, Marques A, Heckmann S, Macas J, Pedrosa-Harand A, ..., Houben A (2020) Super-resolution microscopy reveals diversity of plant centromere architecture. Int J Mol Sci 21(10):3488
Schwartz S, Meshorer E, Ast G (2009) Chromatin organization marks exon-intron structure. Nat Struct Mol Biol 16(9):990–995
Scott KC, Sullivan BAJTIG (2014) Neocentromeres: a place for everything and everything in its place. Trend Genet 30(2):66–74
Senaratne AP, Muller H, Fryer KA, Kawamoto M, Katsuma S, Drinnenberg IAJCB (2021) Formation of the CenH3-deficient holocentromere in Lepidoptera avoids active chromatin. Curr Biol 31(1):173-181.e177
Shandilya J, Senapati P, Hans F, Menoni H, Bouvet P, Dimitrov S, ..., Kundu TK (2014) Centromeric histone variant CENP-A represses acetylation-dependent chromatin transcription that is relieved by histone chaperone NPM1. J Biochem 156(4):221–227
Shang WH, Hori T, Toyoda A, Kato J, Popendorf K, Sakakibara Y, ..., Fukagawa T (2010) Chickens possess centromeres with both extended tandem repeats and short non-tandem-repetitive sequences. Genome Res 20(9):1219–1228
Shang WH, Hori T, Westhorpe FG, Godek KM, Toyoda A, Misu S, ..., Fukagawa T (2016) Acetylation of histone H4 lysine 5 and 12 is required for CENP-A deposition into centromeres. Nature Communications 7(1):1–13
Sharma U, Stefanova D, Holmes SG (2013) Histone variant H2A. Z functions in sister chromatid cohesion in saccharomyces cerevisiae. Mol Cell Biol 33(17):3473–3481
Smurova K, De Wulf PJFIG (2018) Centromere and pericentromere transcription: roles and regulation in sickness and in health. Front Genet 9:674
Soares LM, He PC, Chun Y, Suh H, Kim T, Buratowski SJMC (2017) Determinants of histone H3K4 methylation patterns. Mol Cell 68(4):773–785.e776
Sridhar B, Rivas-Astroza M, Nguyen TC, Chen W, Yan Z, Cao X, Zhong S (2017) Systematic mapping of RNA-chromatin interactions in vivo. Curr Biol 27(4):602–609
Stolz R, Sulthana S, Hartono SR, Malig M, Benham CJ, Chedin F (2019) Interplay between DNA sequence and negative superhelicity drives R-loop structures. Proc Natl Acad Sci 116(13):6260–6269. https://doi.org/10.1073/pnas.1819476116
Sullivan BA, Karpen GH (2004) Centromeric Chromatin Exhibits a Histone Modification Pattern That is Distinct from Both Euchromatin and Heterochromatin. Nat Struct Mol Biol 11(11):1076–1083
Talbert PB, Henikoff S (2020) What makes a centromere? Exp Cell Res 389(2):111895
Talbert PB, Kasinathan S, Henikoff S. (2018) Simple and Complex Centromeric Satellites in Drosophila Sibling Species. Genetics 208(3):977–990
Thakur J, Sanyal K (2013) Efficient neocentromere formation is suppressed by gene conversion to maintain centromere function at native physical chromosomal loci in candida albicans. Genome Res 23(4):638–652
Thakur J, Henikoff SJG, Development (2018) Unexpected conformational variations of the human centromeric chromatin complex. Genes Dev 32(1):20–25
Topp CN, Zhong CX, Dawe RK (2004) Centromere-encoded RNAs are integral components of the maize kinetochore. Proc Natl Acad Sci U S A 101(45):15986–15991. https://doi.org/10.1073/pnas.0407154101
Trivedi P, Stukenberg PT (2016) A centromere-signaling network underlies the coordination among mitotic events. Trends in Biochem Sci 41(2):160–174
Vijayakumari D, Sharma AK, Bawa PS, Kumar R, Srinivasan S, Vijayraghavan U (2019) Early splicing functions of fission yeast Prp16 and its unexpected requirement for gene silencing is governed by intronic features. RNA Biol 16(6):754–769
Volpe TA, Kidner C, Hall IM, Teng G, Grewal SI, Martienssen RA (2002) Regulation of heterochromatic silencing and histone H3 lysine-9 methylation by RNAi. Science 297(5588):1833–1837
Vourc'h C, Dufour S, Timcheva K, Seigneurin-Berny D, Verdel A (2022) HSF1-activated non-coding stress response: Satellite lncRNAs and beyond, an emerging story with a complex scenario. Genes (Basel):13(4). https://doi.org/10.3390/genes13040597
Wang Z, Zang C, Rosenfeld JA, Schones DE, Barski A, Cuddapah S, ..., Zhao K (2008) Combinatorial patterns of histone acetylations and methylations in the human genome. Nature Genetics 40(7):897–903
Weinberg DN, Papillon-Cavanagh S, Chen H, Yue Y, Chen X, Rajagopalan KN, ..., Lu C (2019) The histone mark H3K36me2 recruits DNMT3A and shapes the intergenic DNA methylation landscape. Nature 573(7773):281–286
Westholm JO, Miura P, Olson S, Shenker S, Joseph B, Sanfilippo P, ..., Lai EC (2014) Genome-wide analysis of Drosophila circular RNAs reveals their structural and sequence properties and age-dependent neural accumulation. Cell Reports 9(5):1966–1980
Wilkinson ME, Charenton C, Nagai K (2020) RNA splicing by the spliceosome. Annu Rev Biochem 89(1):359–388
Williams BC, Murphy TD, Goldberg ML, Karpen GH (1998) Neocentromere activity of structurally acentric mini-chromosomes in drosophila. Nature Genetics 18(1):30–38
Wong CYY, Ling YH, Mak JKH, Zhu J, Yuen KWY (2020) Lessons from the extremes: Epigenetic and genetic regulation in point monocentromere and holocentromere establishment on artificial chromosomes. Exp Cell Res 390(2):111974
Wong CYY, Yuen KWYY (2022) CSR-1 RNA interference pathway restricts holocentromere protein CENP-A/HCP-3 localization in Caenorhabditis elegans. https://doi.org/10.1101/2022.05.26.493264
Wong LH, Brettingham-Moore KH, Chan L, Quach JM, Anderson M A, Northrop EL, Choo K A (2007) Centromere RNA is a key component for the assembly of nucleoproteins at the nucleolus and centromere. Genome Res 17(8):1146–1160
Xiao W, Adhikari S, Dahal U, Chen YS, Hao YJ, Sun BF, Yang YG (2016) Nuclear m(6)A reader YTHDC1 regulates mRNA splicing. Mol Cell 61(4):507–519. https://doi.org/10.1016/j.molcel.2016.01.012
Xu W, Xu H, Li K, Fan Y, Liu Y, Yang X, Sun Q (2017) The R-Loop is a Common Chromatin Feature of the Arabidopsis Genome. Nature Plants 3(9):704–714
Ye CY, Chen L, Liu C, Zhu QH, Fan L (2015) Widespread Noncoding Circular RNA s in Plants. New Phytologist 208(1):88–95
Yilmaz D, Furst A, Meaburn K, Lezaja A, Wen Y, Altmeyer M, Soutoglou E (2021) Activation of homologous recombination in G1 preserves centromeric integrity. Nature 600(7890):748–753. https://doi.org/10.1038/s41586-021-04200-z
Yu K, Chedin F, Hsieh C-L, Wilson TE, Lieber MR (2003) R-Loops at immunoglobulin class switch regions in the chromosomes of stimulated B cells. Nature Immunol 4(5):442–451
Zhang K, Hodge J, Chatterjee A, Moon TS, Parker KM (2021) Duplex structure of double-stranded RNA provides stability against hydrolysis relative to single-stranded RNA. Environ Sci Technol 55(12):8045–8053. https://doi.org/10.1021/acs.est.1c01255
Zhang Q, McKenzie NJ, Warneford-Thomson R, Gail EH, Flanigan SF, Owen M, Davidovich C (2019) RNA exploits an exposed regulatory site to inhibit the enzymatic activity of PRC2. Nature Struct Mol Biol 26(3):237–247
Zhu J, Cheng KCL, Yuen KWY (2018a) Histone H3K9 and H4 acetylations and transcription facilitate the initial CENP-AHCP-3 deposition and de novo centromere establishment in Caenorhabditis elegans artificial chromosomes. Epigenetics Chromatin 11(1):1–20
Zhu Q, Pao GM, Huynh AM, Suh H, Tonnu N, Nederlof PM, ..., Verma IM (2011) BRCA1 tumour suppression occurs via heterochromatin-mediated silencing. Nature 477(7363):179–184
Zhu Q, Hoong N, Aslanian A, Hara T, Benner C, Heinz S, Verma IM (2018b) Heterochromatin-encoded satellite RNAs induce breast cancer. Mol Cell 70(5):842–853.e847. https://doi.org/10.1016/j.molcel.2018.04.023
Acknowledgements
We thank the Yuen lab for the discussion.
Funding
This work was supported by Ministry of Science and Technology of the People’s Republic of China (project number 2021YFA0909300); Research Grants Council, University Grants Committee (project number 17116520); and Research Grants Council, University Grants Committee (project number C7058-18G).
Author information
Ethics declarations
Ethics approval
Not applicable.
Consent to participate
Not applicable.
Consent for publication
Not applicable.
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Zhu, J., Guo, Q., Choi, M. et al. Centromeric and pericentric transcription and transcripts: their intricate relationships, regulation, and functions. Chromosoma 132, 211–230 (2023). https://doi.org/10.1007/s00412-023-00801-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00412-023-00801-x