Previous Article | Next Article ![]()
Eukaryotic Cell, March 2007, p. 361-377, Vol. 6, No. 3
1535-9778/07/$08.00+0 doi:10.1128/EC.00296-06
Copyright © 2007, American Society for Microbiology. All Rights Reserved.
,
,
Avraham Hury,
Ehud Hoze,
Shai Uliel,
Inna Myslyuk,
Avihay Apatoff,
Ron Unger, and
Shulamit Michaeli*
The Mina & Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan 52900, Israel
Received 13 September 2006/ Accepted 7 December 2006
|
|
|---|
|
|
|---|
In most eukaryotes, the snoRNAs that guide pseudouridylation consist of two hairpin domains connected by a single-stranded hinge, the H (AnAnnA) domain, and by a tail, the ACA box. Two short rRNA recognition motifs of the snoRNA that base pair with rRNA sequences flanking the uridine to be converted to pseudouridine have been characterized. The pseudouridine is always located 14 to 16 nt upstream from the H box or the ACA box of the snoRNA (14, 36). The two hairpin loops share structural and functional similarities, and functional pseudouridylation pockets are found with equal frequencies at the 5' and 3' ends of the molecule; in many cases, the RNAs direct pseudouridylation of rRNA at two different sites (14).
Most of the C/D and H/ACA box RNAs characterized to date are from humans, Saccharomyces cerevisiae, plants, and archaea (2, 14, 33, 37, 39). Only recently has information on these RNAs in unicellular eukaryotic organisms, such as the amoeba Dictyostelium discoideum (1) and the diplomonad Giardia lamblia, become available (46).
Trypanosomatids are unicellular parasitic protozoa which are the causative agents of several infamous parasitic diseases, such as African trypanosomiasis, caused by Trypanosoma brucei; Chagas' disease, caused by Trypanosoma cruzi; and leishmaniasis, caused by Leishmania species. The genomes of these three trypanosomatids were recently published, and their DNA sequences highlight many unique features of these organisms (5, 12, 16). Trypanosomatids are unique because they engage in two unusual RNA processing events, namely, trans-splicing (21) and mitochondrial RNA editing (34). In addition, the large subunit rRNA undergoes trypanosome-specific cleavages during rRNA maturation, yielding two large rRNA molecules and four small RNAs, ranging in size from 76 to 226 nt (43). Most relevant to this study are the Leishmania parasites. Leishmania spp. are obligatory intracellular parasites that cause a spectrum of human diseases, with an annual incidence of 2 million cases in 88 countries (16). The parasite cycles between two hosts, namely, the phagolysosomes of mammalian macrophages and the midguts of sand flies. In the insect host, Leishmania parasites grow as flagellated extracellular promastigotes; in the mammalian host, they proliferate as aflagellated intracellular amastigotes. The Leishmania species are divided into Old World Leishmania spp., including species such as Leishmania infantum, Leishmania donovani, and Leishmania major, and New World Leishmania spp., including species such as Leishmania mexicana and Leishmania braziliensis. The gene order and sequence are highly conserved among the 30 known Leishmania species, and diagnostic tools are in demand to distinguish between these related species (16).
While relatively little is known about snoRNAs in trypanosomatids, some unique features were found for snoRNAs in these species (11, 38). Most, if not all, trypanosome H/ACA RNAs are composed of a single hairpin RNA and carry an AGA box instead of an ACA box (22, 38). The first discovered trypanosome H/ACA-like RNA, the spliced leader-associated RNA 1 (SLA1), guides modification on a short-lived RNA (25), the spliced leader RNA (SL RNA). This RNA is the donor of the spliced leader sequence to all trypanosome mRNAs (21). Silencing of the pseudouridine synthase (CBF5) by RNA interference in T. brucei provided evidence for the role of SLA1 in trans-splicing (4). We proposed that SLA1 has a unique chaperone function and escorts the SL RNA early in its biogenesis until it is assembled with Sm proteins (4).
Most recently, using bioinformatics and experimental tools, we performed a genome-scale analysis of snoRNAs that guide methylations and pseudouridylations on rRNAs in T. brucei (24). Our data suggested that most snoRNAs are clustered in reiterated repeats that carry a mixed population of C/D and H/ACA-like RNAs. Predicting the modifications guided by these RNAs and using partial mapping data, we identified 84 2'-O-methyls and 32 pseudouridines on rRNA, suggesting a high occurrence of Nms compared to pseudouridines on rRNA (24). Many of these modifications are species specific and increase modifications at domains which are already modification-rich. About 40% of the trypanosome-specific modifications are situated in unique positions outside the highly conserved domains of the rRNA (24).
In this study, we expanded our analysis to Leishmania species. By searching homologues of the T. brucei snoRNA genes and then evaluating the clusters, we identified 23 clusters carrying 62 C/D snoRNAs and 37 H/ACA-like RNAs. Several snoRNAs that were not revealed in T. brucei were identified here. However, in general, the pattern of Nm modifications is highly conserved between L. major and T. brucei. The L. major clusters are highly repeated compared to those of T. brucei. Surprisingly, the expression of snoRNAs in high-copy-number clusters is not necessarily abundant. Factors that may affect the abundance of C/D snoRNAs, such as copy number, the potential to form the K-turn motif, intragenic and/or extragenic base pairing, and conservation of the boxes, were examined. Sequence conservation of H/ACAs among Leishmania species and other trypanosomatids allowed us to identify structural features which are conserved in these RNAs and to highlight the high degree of phylogenetic conservation among related species for structure-function analysis of these guide RNAs.
|
|
|---|
Prediction of targets in rRNA. The potential targets (for 2'-O-methylation) in rRNA were determined using the computer program BestFit (from the GCG package), which searches for the optimal local alignment between two sequences. For this study, the program was used to search for complementarity to rRNA that complies with the +5 guiding rule. Additionally, the targets were also predicted based on the data available from their yeast homologues. To predict the pseudouridines guided by H/ACA RNAs, the secondary structure of H/ACA RNA was predicted using the mfold program (http://www.bioinfo.rpi.edu/applications/mfold/old/rna/form1.cgi), and the sequences from the internal loop were used to search for complementarity with rRNA, based on the guiding rules established for yeast, mammals, and plants (http://www.bio.umass.edu/biochem/rna-sequence/Yeast_snoRNA_Database/snoRNA_DataBase.html; http://bioinf.scri.sari.ac.uk/cgi-bin/plant_snorna/conservation).
Mapping of modified nucleotides. 2'-O-Methylations on rRNA were mapped by primer extension with different concentrations of deoxynucleoside triphosphates (dNTPs), as described by Xu et al. (45), using primers specific to the relevant region of rRNA. Primer extension products were analyzed in 6% polyacrylamide-7 M urea gel, next to sequencing reactions performed using the same primer.
H/ACA snoRNA folding and conservation in trypanosomatids. The L. major H/ACA snoRNA secondary structure was predicted using the mfold program (http://www.bioinfo.rpi.edu/applications/mfold/old/rna/form1.cgi). Homologues for the selected H/ACA snoRNA were screened from the T. brucei genome (http://www.genedb.org/genedb/tryp/), the L. braziliensis genome (http://www.genedb.org/genedb/lbraziliensis/), and the L. infantum genome (http://www.genedb.org/genedb/linfantum/) and were subjected to multiple alignment (http://www.ebi.ac.uk/clustalw/).
RNA preparation and primer extension analysis. RNAs were prepared from T. brucei cells, using TRI Reagent (Sigma). Primer extension analysis was performed as described previously (22, 45), using 5'-end-labeled oligonucleotides specific to target RNAs, as indicated in the figure legends. The extension products were analyzed in 6% polyacrylamide-7 M urea gels and visualized by autoradiography.
Northern analysis. Total RNA (20 µg/lane) was fractionated in a 10% polyacrylamide gel containing 7 M urea. RNAs were transferred to a nylon membrane (Hybond; Amersham Biosciences) and probed with [32P]ATP-labeled oligonucleotides.
|
|
|---|
![]() View larger version (42K): [in a new window] |
FIG. 1. Schematic representation of snoRNA clusters in L. major. (A) Nomenclature of L. major snoRNAs. (B) snoRNA clusters. The C/D snoRNAs are shown as pink boxes, whereas H/ACA-like snoRNAs are shown as green boxes. Thin lines indicate intergenic regions, and their lengths are indicated below the lines. The lengths of the snoRNA genes (±3 bp) are indicated above the boxes. The number on the right side of each cluster indicates the number of times the cluster is repeated in the genome database. The small numbers below the name of each cluster indicate the position of the cluster in the genome database in L. major GeneDB, release 5 (http://www.genedb.org/genedb/leish). A minus sign indicates the chromosomal location on an opposite strand. Parentheses indicate tandem repeats of the cluster.
|
To examine the validity of the copy numbers of the different clusters, we searched the database of L. major containing the raw sequencing data obtained from the shotgun clones (ftp:/ftp.sanger.ac.uk/pub/databases/L.major_sequences/shotgun_READS/), using BLASTN. This analysis enabled us to determine the relative abundance of each snoRNA in the genome. The number of independent clones found in the shotgun library should reflect the number of times this gene is repeated in the genome. Indeed, a correlation of 0.77 was found between the information in the shotgun library and the data found in the published genome, suggesting that the information in the genome was valid for determination of the copy numbers of the snoRNAs within each cluster (not shown). Although in certain cases the information in the published genome may not reflect the exact repeat structure, a comparative analysis among different repeats can be used to estimate the relative copy number of each of the snoRNAs we identified. Inspecting the chromosomal locations of the snoRNAs suggests that their genes reside mainly on chromosomes 14 to 36, except for one cluster that was found on chromosome 5, carrying the SLA1 locus.
The clusters were always found flanked by protein coding genes. As observed in T. brucei, the distance between the cluster and the upstream open reading frame (>650 bp) is generally larger than the distance between the 3' end of the cluster and the downstream open reading frame (
200 bp). This observation may suggest the existence of a regulatory region upstream of the cluster that may have promoter-like activity, as recently described for the monogenetic trypanosomatid Leptomonas collosoma (23).
Repertoire and properties of L. major C/D and H/ACA RNAs and their potential targets. The repertoire of the C/D snoRNAs is illustrated in Table 1. All of the C/D snoRNAs range in size from 72 to 148 nt. Their potential targets were predicted using bioinformatic tools (BestFit from GCG software) and are presented in Figure S1A in the supplemental material. Of the 62 C/D snoRNAs, 20 have the potential to guide two modification sites. The percentage of snoRNAs that can potentially guide two modifications (double guiders) is slightly lower than that in T. brucei, where there are 27 double guiders out of 57 snoRNAs. We were able to predict targets for all C/D snoRNAs except four. In addition, we identified 14 novel C/D snoRNAs whose homologues were not identified in T. brucei. Of the 62 snoRNAs, 31 have homologues in other eukaryotes (Table 1). As already published for the T. brucei snoRNAs, we found that the 5' end of C/D snoRNA is located 1 to 5 nt upstream from the C box (24). We therefore provide the sequence of the 5 nt upstream of the C box. Based on the experimental data for T. brucei, the 3' ends of C/D molecules were usually found 1 to 3 nt downstream from the D box (11); we therefore provide the sequence of the 3 nt downstream from this box (Table 1).
|
View this table: [in a new window] |
TABLE 1. L. major C/D snoRNAsa
|
|
View this table: [in a new window] |
TABLE 2. L. major H/ACA-like RNAsa
|
![]() View larger version (26K): [in a new window] |
FIG. 2. Conservation of H/ACA secondary structures between different trypanosomatid species. The proposed H/ACA-like snoRNA secondary structure was determined by mfold, taking into consideration the complementarity with the target and the distance between AGA and . The sequences of L. major snoRNAs are given, and differences from the L. major sequences are marked in colored circles for T. brucei (red), T. cruzi (green), and L. infantum (yellow). Deletions are indicated by triangles. The species designations are as follows: TB, Trypanosoma brucei; TC, Trypanosoma cruzi; LM, Leishmania major; and Linfant, Leishmania infantum. The AGA box and the conserved nucleotide upstream from stem I are boxed. A schematic representation of the H/ACA consensus pattern is boxed, and the structural motifs are marked.
|
![]() View larger version (53K): [in a new window] |
FIG. 3. Sequence alignment of snoRNA clusters from different Leishmania species. Multiple alignment of the various Leishmania species was performed using CLUSTAL W on the entire LM34Cs1 cluster. The conserved nucleotides are shown as dots, and gaps are indicated by hyphens. The snoRNA consensus sequence is marked below in bold. The coding regions are shown in capital letters, while intragenic regions are marked in lowercase letters. The species designations are as follows: Lmajor, Leishmania major (LmjF34.snoRNA.0003); Linfantum, Leishmania infantum (LinJ34_20050901_V2.0); and Lbrasiliensis, Leishmania braziliensis (brazil865f07.p1k).
|
![]() View larger version (58K): [in a new window] |
FIG. 4. Conservation of modified Nm nts on T. brucei and L. major rRNAs. Total RNAs from L. major and T. brucei were subjected to primer extension using different concentrations of dNTPs (0.05, 0.5, and 5 mM) to detect the Nm modification. The primers used for this mapping were specific to 5' LSU rRNA (A) and 3' LSU rRNA (B) (using oligonucleotides 1238 and 1217, respectively). Extension products were analyzed in a 6% polyacrylamide-7 M urea gel next to DNA sequencing products. Sequencing was performed using plasmids containing T. brucei rRNA as a template, with the same oligonucleotides as those used for primer extension. A partial DNA sequence is given on the left, and the modified nts are denoted by asterisks. The nts that differ in L. major from T. brucei are circled. The stops representing 1 nt before the modified nts are indicated on the right by arrows. The positions of the modified nts in T. brucei and L. major are indicated. The names of L. major snoRNAs that potentially guide the modification are given in parentheses.
|
![]() View larger version (33K): [in a new window] |
FIG. 5. Expression of selected snoRNAs. (A) Levels of C/D snoRNAs. Total RNA from L. major was subjected to primer extension using primers specific to LM14Cs1C1, LM14Cs1C2, LM20Cs1C1, LM20Cs1C2, LM20Cs1C3, LM20Cs1C4, LM20Cs1C5, LM23Cs1C1, LM23Cs1C2, LM23Cs1C3, LM26Cs1C1, LM26Cs1C3, and LM35Cs2C1. The level of snoRNA LM33Cs1H1 was used to control the amount of RNA in each sample. Extension products were analyzed in a 6% polyacrylamide-7 M urea gel. (B) Levels of H/ACA-like snoRNAs. Total RNA from L. major was separated in a 10% polyacrylamide-7 M urea gel. The RNAs were electroblotted and hybridized with probes complementary to LM26Cs1H1, LM26Cs1H4, LM26Cs1H8, LM26Cs1H9, LM33Cs1H1, and LM33Cs2H1. The level of SL RNA was used to control for equal loading. The numbers at the bottom indicate the copy numbers of the clusters.
|
![]() View larger version (27K): [in a new window] |
FIG. 6. Relationships among predicted secondary structure, K-turn motif, and structural features of C/D snoRNAs to their expression level. (A) C/D snoRNA secondary structures were predicted using the mfold program (http://www.bioinfo.rpi.edu/applications/mfold/old/rna/). Flanking sequences appear in small letters, and coding sequences appear in large letters. The conserved C and D sequences are boxed and indicated. The extragenic stem, K-turn motif, and intragenic stem are indicated by arrows on the right. Noncanonical base pairing of the K turn is indicated by a dashed line. (B) Schematic representation of K-turn motifs. (a) Consensus motif of rRNA; (b) consensus motif of E. gracilis snoRNA (E.g.) (32); (c) K turn from T. brucei snoRNA; (d) K turn from L. major snoRNA. (C) Table summarizing the structural features of C/D snoRNAs. The degree of expression is given in relative numbers. The G values (kcal/mol) of the duplexes (external and internal stems) were calculated using mfold. The copy numbers of the genes are indicated. +, C/D boxes which agree with the consensus (cited in the text); , boxes that deviate from the consensus. The degree of resemblance to the K-turn consensus was graded as follows: , strong deviation from the consensus; +, partial resemblance to the canonical structure; and ++, cases that fully obey the consensus. Expression levels were calculated using densitometric analysis and are presented in arbitrary units (1 represents the snoRNA expressed at its lowest level).
|
G values of the extragenic stems were calculated using the mfold program (Fig. 6C). The
G values ranged from 0.7 to 30 kcal/mole (average of 16.5 ± 7.8 kcal/mol). The
G values of the external stems suggest that such stems most probably exist, unlike the putative internal stems, whose
G values are higher (3.9 ± 3.4 kcal/mol) and which most probably are not formed (Fig. 6C). Thus, we suggest that the external stem is formed and is a real structural feature of C/D snoRNAs in trypanosomes. In contrast, the stability of the internal stem is very weak, and this stem either does not form at all or exists only transiently. However, no direct correlation was found between the strength of the external stem and RNA abundance (Fig. 5A and 6C). Another structural feature of the C/D snoRNAs that may affect the expression level is the kink-turn motif (K turn), formed by interactions between the C and D boxes located near the ends of the RNAs (35, 40). The putative canonical K-turn structure (32) is depicted in Fig. 6B, comparing the consensus K turn with the potential of LM20Cs1H1 to form an analogous structure. This motif is composed of stems I and II (Fig. 6B). Noncanonical base pairing, such as G-A and U-U pairs, have been shown to be important for the formation of the K turn and to determine the binding of Snu13 to box C/D (26). Efficiency of binding of Snu13 may affect the fate of snoRNA and determine whether the snoRNA will assemble to snoRNP or be degraded by the exosome during processing (8). We therefore examined whether there is a correlation between the strength of conservation of the K-turn motif and the level of the snoRNA. The potential of several snoRNAs to form a K turn is indicated in Fig. 6A and was qualitatively assessed as shown in Fig. 6C. Other factors that may affect snoRNA processing and binding of Snu13 are the C and D box sequences (26, 40). To this end, we inspected the conservation of the C and D boxes, i.e., CUGA and UGAUGA, respectively. Those that are conserved are marked with plus signs, and those that deviate by at least 1 nt are marked with minus signs (Fig. 6C). We found that if the two boxes deviate from the consensus, the snoRNA is poorly expressed. However, high conservation of the boxes is not sufficient to ensure high expression, as in the case of Lm35Cs2C1 (Fig. 6C). Inspection of the data in Fig. 6A and C suggests that there is no single structural feature that fully correlates with the abundance of the snoRNA. For instance, the K turns of all molecules inspected agree with the canonical structure mostly in stem II (A-G pairing), but there are cases of K turns which fit closely with the consensus and yet the snoRNAs harboring them are poorly expressed (Fig. 6C).
|
|
|---|
Comparative genomics between L. major and T. brucei snoRNA clusters. For the most part, the repertoire of snoRNA genes is conserved between T. brucei and L. major. However, the snoRNA sequences described here and previously described for T. brucei represent only part of the estimated repertoire. Mapping of Nms in T. brucei revealed the existence of an additional 52 novel Nms that were found mainly in domains already rich in modifications. These modifications were reduced in cells depleted of fibrillarin by RNA interference, suggesting that they are guided by C/D snoRNPs (S. Barth and S. Michaeli, unpublished data). Among these modifications are those conserved in plants, yeast, and mammals, suggesting that snoRNAs that guide these modifications most probably exist. Why were these snoRNAs not detected in the entire genomes of both L. major and T. brucei? Our search for T. brucei snoRNAs was based on the identification of repeats, using the SnoScan program and comparative genomics with T. cruzi (24). This suggests that our screen was biased towards the identification of repeated snoRNAs, but snoRNAs which are not encoded in repeated clusters may also exist.
Finding these missing snoRNAs by either experimental or computational strategies turned out to be complicated. Examining the scores of individual snoRNAs by SnoScan (http://lowelab.UCSC.edu/snoscan) suggested that many known snoRNAs in trypanosomes receive low scores, explaining why it is very difficult to find additional snoRNAs by performing a genome search using this program. The finding of novel C/D snoRNAs using such a genome-scale search seems to be a complicated task because the conservation of these RNAs is relatively low and exists only in the boxes and in the regions of complementarity to the targets (24, 38). Thus, we cannot rule out the possibility of the existence of a novel family of guide RNAs that contain consensus boxes which strongly deviate from the C/D consensus and guide modifications present on rRNAs. A program to search for H/ACA-like molecules in the T. brucei genome is being developed (I. Myslyuk, Y. Horesh, S. Michaeli, and R. Unger, unpublished results). There is great hope that the identification of novel H/ACA RNA-containing clusters will reveal some of the missing C/D snoRNAs as well, since most snoRNA genes identified so far are present in clusters that contain both H/ACA and C/D snoRNA genes.
It is interesting that some C/D and H/ACA RNAs were identified in L. major but not in T. brucei. Among these RNAs are two snoRNAs with no predicted targets (Lm25Cs1C4 and Lm35Cs2C3) and three snoRNAs that can potentially guide modifications conserved in other organisms (LM30Cs1C2, LM33Cs3C1, and LM18Cs1C2). Seven snoRNAs with no known targets in T. brucei (Lm14Cs1C1, Lm35Cs2C2, Lm25Cs1C2, Lm36Cs2C3, Lm26Cs2C1, Lm20Cs1C4, and Lm20Cs1C5) were also identified and were found to be poorly expressed. In addition, we could not detect the predicted modifications on the rRNA. These could represent pseudogenes, very weak modifications, or alternatively, snoRNAs which are developmentally regulated.
Factors that may affect the differential abundance of snoRNAs present in Leishmania.
The data presented in Fig. 5 and 6 highlight the observation already reported that the levels of snoRNAs expressed from the same gene cluster can vary considerably (22, 23, 24). This raises a question regarding the factors that affect the abundance of a certain snoRNA within the cell. The transcriptional regulation of snoRNA genes cannot be disputed at this point, since a genomic region (
700 bp) upstream of a snoRNA gene cluster enhances the expression of snoRNA genes present on a multicopy plasmid (23). However, we could not identify consensus sequences upstream of the snoRNA clusters in either of these organisms. A rigorous experimental approach is needed to identify such sequences. Interestingly, we recently identified snoRNAs whose levels are enhanced in the bloodstream form of T. brucei compared to the procyclic state, suggesting that snoRNA gene expression might be developmentally regulated (Barth and Michaeli, unpublished data).
The expression of a snoRNA cluster can be influenced by its location near a "real" promoter. Recently, it was shown that promoter-like regions exist in L. major and are located in a strand-switching region between two long polycistronic transcripts (27). Indeed, in one special case, i.e., LM5Cs1, which carries SLA1, the cluster is located near an inflection point. This location enhances the transcription of SLA1 and accompanying snoRNAs, and indeed these RNAs are the most abundant snoRNAs (except for U3). It will therefore be of interest to examine the "strength" of this "promoter-like" element and to compare it to sequences present upstream of other snoRNA clusters which are not present at inflection points.
Most intriguing is the differential expression of snoRNA genes present in the same cluster, as clearly demonstrated in this study (Fig. 5 and 6). Transcription regulation cannot explain these differences, since these snoRNAs are processed from the same transcripts. At least two factors may influence the levels of such snoRNAs, namely, the efficiency of processing and the strength by which these snoRNAs bind their cognate binding proteins. To date, none of the factors that mediate snoRNA processing have been identified in trypanosomes. The only study performed with L. collosoma suggests that 10 extragenic flanking nucleotides are sufficient to govern efficient processing of the snoRNA (23, 45). Studies from mammals and yeast suggest that base pairing of the 5'-3'-terminal stem of the snoRNA coding sequence is required for the processing and accumulation of box C/D snoRNAs (7, 15, 41, 44). However, many intron-encoded box C/D snoRNAs expressed in mammalian cells or polycistronic snoRNAs in yeast lack the canonical 5'-3'-terminal helix (8). Studies on such snoRNAs indicated that the processing of these C/D snoRNAs is supported by external intronic stem structures that are fully or partially degraded during exonucleolytic cleavage (8). The trypanosome C/D snoRNAs resemble such snoRNAs, lacking the 5'-3' stem but having, instead, extragenic flanking helices. However, the data presented in Fig. 5A and 6C indicate that although the external stems are thermodynamically favorable, no significant correlation can be found between the abundance of the snoRNA and the stability of the external duplex. Other structural factors, such as the presence of a canonical K turn and conserved C and D boxes, also influence snoRNA abundance. The conclusion from our analysis suggests that efficient expression of trypanosomatid snoRNAs takes place if the extragenic flanking sequences of the snoRNAs form an extended stem and/or the snoRNAs can form a canonical K turn with conserved boxes. None of these factors exclusively affects snoRNA abundance. However, it is possible that regulatory sequences situated in the flanking sequences or other structural factors that were overlooked in this study affect snoRNA abundance. The level of snoRNA may be governed by combinatorial effects of all the factors discussed.
Even less is known about factors that affect the processing and stability of H/ACA RNAs (20). The phylogenetic analysis in Fig. 3 highlights the most important structural features of these RNAs, which are the conserved AGA box, the distance between the AGA box and
, and the lengths of stem I and part of stem II. These are the structural features that were shown to dictate the in vitro binding of CBF5 to archaeal H/ACA RNAs (3). The differential expression of H/ACA RNAs present in the same cluster observed in this study may result from the differential binding of CBF5 to the H/ACA RNA molecule. In addition, the sequences flanking the coding region may also affect snoRNA processing.
Conservation of rRNA Nm modifications among the trypanosomatids and their biological role. For the most part, the modifications mapped in L. major coincide with the modifications present in T. brucei rRNAs, except for a few cases where the modification was slightly shifted. As in T. brucei, the number of Nms in L. major is almost double the number of pseudouridines. We previously suggested that the need for a large number of Nms may help the parasite to cope with growth at an elevated temperature in its mammalian host (37°C) compared to 26°C in the insect host (24). Support for this role of Nms in trypanosomes was recently obtained when we observed an elevation in the level of certain modifications in bloodstream-stage T. brucei parasites compared to the procyclic stage (Barth and Michaeli, unpublished data). It will be interesting to examine whether the level of modification is also higher in L. major amastigotes than in promastigotes. Large numbers of Nms were also reported for plants, and this finding was rationalized by the fact that plants are exposed to large temperature changes, during which the ribosomes must be produced and remain active. Interestingly, plants (6) and Euglena gracilis (31) are the only eukaryotes studied so far that carry mixed clusters containing both C/D and H/ACA RNAs. The mixed snoRNA cluster may have developed independently at least twice in evolution, since there is no evidence for large-scale horizontal transfer of genetic material from plants to trypanosomes (13).
In sum, this study highlights the strengths of comparative genomics among related trypanosomatid species for the identification of snoRNA genes, and most probably other noncoding RNAs. The unique features of trypanosome snoRNAs, i.e., a greater number of C/D than H/ACA RNAs, single-hairpin H/ACA RNAs, and reiterated clusters composed of C/D and H/ACA RNAs, are all conserved in this family of parasites. The pattern of Nm modification is also highly conserved. Based on conserved features and deviations among the H/ACA RNAs, a canonical trypanosome H/ACA RNA was established that is currently in use for developing an algorithm to search for these RNAs on a genomic scale. The nonconserved snoRNA intergenic regions present in Old and New World Leishmania species could be used to differentiate Leishmania-related species and to identity novel noncoding RNAs in the Leishmania genome.
Published ahead of print on 22 December 2006. ![]()
Supplemental material for this article may be found at http://ec.asm.org/. ![]()
X.-H.L. and A.H. contributed equally to this study. ![]()
Present address: Department of Biochemistry and Molecular Biology, University of Massachusetts, Amherst, MA. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2010 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»